XML/Input: Difference between revisions

Content deleted Content added

Inline

Revision as of 15:49, 9 August 2011

Given the following XML fragment, extract the list of student names using whatever means desired. If the only viable method is to use XPath, refer the reader to the task XML and XPath.

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students></lang>

Expected Output

April
Bob
Chad
Dave
Émily

Ada

Works with: GNAT

Uses XML/Ada from AdaCore.

extract_students.adb: <lang Ada>with Sax.Readers; with Input_Sources.Strings; with Unicode.CES.Utf8; with My_Reader;

procedure Extract_Students is

  Sample_String : String :=

"<Students>" &

  "<Student Name=""April"" Gender=""F"" DateOfBirth=""1989-01-02"" />" &
  "<Student Name=""Bob"" Gender=""M"" DateOfBirth=""1990-03-04"" />" &
  "<Student Name=""Chad"" Gender=""M"" DateOfBirth=""1991-05-06"" />" &
  "<Student Name=""Dave"" Gender=""M"" DateOfBirth=""1992-07-08"">" &
     "<Pet Type=""dog"" Name=""Rover"" />" &
  "</Student>" &
  "<Student DateOfBirth=""1993-09-10"" Gender=""F"" Name=""Émily"" />" &

"</Students>";

  Reader : My_Reader.Reader;
  Input : Input_Sources.Strings.String_Input;

begin

  Input_Sources.Strings.Open (Sample_String, Unicode.CES.Utf8.Utf8_Encoding, Input);
  My_Reader.Parse (Reader, Input);
  Input_Sources.Strings.Close (Input);

end Extract_Students;</lang>

my_reader.ads: <lang Ada>with Sax.Attributes; with Sax.Readers; with Unicode.CES; package My_Reader is

  type Reader is new Sax.Readers.Reader with null record;
  procedure Start_Element
    (Handler       : in out Reader;
     Namespace_URI : Unicode.CES.Byte_Sequence := "";
     Local_Name    : Unicode.CES.Byte_Sequence := "";
     Qname         : Unicode.CES.Byte_Sequence := "";
     Atts          : Sax.Attributes.Attributes'Class);

end My_Reader;</lang>

my_reader.adb: <lang Ada>with Ada.Text_IO; package body My_Reader is

  procedure Start_Element
    (Handler       : in out Reader;
     Namespace_URI : Unicode.CES.Byte_Sequence := "";
     Local_Name    : Unicode.CES.Byte_Sequence := "";
     Qname         : Unicode.CES.Byte_Sequence := "";
     Atts          : Sax.Attributes.Attributes'Class) is
  begin
     if Local_Name = "Student" then
        Ada.Text_IO.Put_Line (Sax.Attributes.Get_Value (Atts, "Name"));
     end if;
  end Start_Element;

end My_Reader;</lang>

Output:

April
Bob
Chad
Dave
Émily

Alternative using a DOM document

<lang Ada>with Ada.Text_IO; with Sax.Readers; with Input_Sources.Strings; with Unicode.CES.Utf8; with DOM.Readers; with DOM.Core.Documents; with DOM.Core.Nodes; with DOM.Core.Attrs;

procedure Extract_Students is

  Sample_String : String :=

"<Students>" &

  "<Student Name=""April"" Gender=""F"" DateOfBirth=""1989-01-02"" />" &
  "<Student Name=""Bob"" Gender=""M"" DateOfBirth=""1990-03-04"" />" &
  "<Student Name=""Chad"" Gender=""M"" DateOfBirth=""1991-05-06"" />" &
  "<Student Name=""Dave"" Gender=""M"" DateOfBirth=""1992-07-08"">" &
     "<Pet Type=""dog"" Name=""Rover"" />" &
  "</Student>" &
  "<Student DateOfBirth=""1993-09-10"" Gender=""F"" Name=""Émily"" />" &

"</Students>";

  Input : Input_Sources.Strings.String_Input;
  Reader : DOM.Readers.Tree_Reader;
  Document : DOM.Core.Document;
  List : DOM.Core.Node_List;

begin

  Input_Sources.Strings.Open (Sample_String, Unicode.CES.Utf8.Utf8_Encoding, Input);
  DOM.Readers.Parse (Reader, Input);
  Input_Sources.Strings.Close (Input);
  Document := DOM.Readers.Get_Tree (Reader);
  List := DOM.Core.Documents.Get_Elements_By_Tag_Name (Document, "Student");
  for I in 0 .. DOM.Core.Nodes.Length (List) - 1 loop
     Ada.Text_IO.Put_Line
       (DOM.Core.Attrs.Value
          (DOM.Core.Nodes.Get_Named_Item
             (DOM.Core.Nodes.Attributes
                (DOM.Core.Nodes.Item (List, I)), "Name")
          )
      );
  end loop;
  DOM.Readers.Free (Reader);

end Extract_Students;</lang>

output is the same.

Aikido

Put the XML in the file called t.xml <lang aikido> import xml

var s = openin ("t.xml") var tree = XML.parseStream (s)

foreach node tree {

   if (node.name == "Students") {
       foreach studentnode node {
           if (studentnode.name == "Student") {
               println (studentnode.getAttribute ("Name"))
           }
       }
   }

}

</lang> The output is (Aikido doesn't support unicode rendering):

April
Bob
Chad
Dave
Émily

AutoHotkey

simply using regular expressions <lang AutoHotkey>students = ( <Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students> )

quote = " ; " pos = 1 while, pos := RegExMatch(students, "Name=.(\w+)" . quote . "\sGender" , name, pos + 1) names .= name1 . "`n"

msgbox % names</lang>

The following code extracts the value of the property "Name" from every Student tag. It does not handle the &#CODE;; this can be left to others: a way to cope with it fastly, is to output a very simple HTML structure, so that the interpretation is left to an HTML reader/browser.

<lang awk>function parse_buf() {

   if ( match(buffer, /<Student[ \t]+[^>]*Name[ \t]*=[ \t]*"([^"]*)"/, mt) != 0 ) {
     students[mt[1]] = 1
   }
   buffer = ""

}

BEGIN {

 FS=""
 mode = 0
 buffer = ""
 li = 1

}

mode==1 {

 for(i=1; i <= NF; i++) {
   buffer = buffer $i
   if ( $i == ">" ) {
     mode = 0;
     break;
   }
 }
 if ( mode == 0 ) {
   li = i
 } else {
   li = 1
 }
 # let us process the buffer if "complete"
 if ( mode == 0 ) {
   parse_buf()
 }

}

mode==0 {

 for(i=li; i <= NF; i++) {
   if ( $i == "<" ) {
     mode = 1
     break;
   }
 }
 for(j=i; i <= NF; i++) {
   buffer = buffer $i
   if ( $i == ">" ) {
     mode = 0
     parse_buf()
   }
 }
 li = 1

}

END {

 for(k in students) {
   print k
 }

}</lang> Using getXML.awk written by Jan Weber, one could do this:

Works with: gawk

or

Works with: nawk

<lang awk>awk -f getXML.awk sample.xml | awk '

   $1 == "TAG"                 {tag = $2}
   tag == "Student" && /Name=/ {print substr($0, index($0, "=") + 1)}

'</lang> Using xmlparser.awk by Steve Coile, one can do this:

Works with: gawk

<lang awk>gawk -f xmlparser.awk sample.xml | awk '

   $1 == "begin"                                         {tag = $2}
   $1 == "attrib"                                        {attrib = $2}
   $1 == "value" && tag == "STUDENT" && attrib == "name" {print $2}

'</lang>

Both of these produce this output

April
Bob
Chad
Dave
&#x00C9;mily

Works with: XMLgawk

Scripts in AWK are often one-liners. This one-liner implementation searches for Student tags and then displays the contents of their Name attribute. The following line is meant to be typed in on the command line of a Unix shell or an MS-DOS command window.

gawk -lxml 'XMLSTARTELEM == "Student" {print XMLATTR["Name"]}' rosetta.xml

Output:

April
Bob
Chad
Dave
Émily

C

Library: LibXML

Uses: libxml (Components:{{#foreach: component$n$|{{{component$n$}}}Property "Uses Library" (as page type) with input value "Library/libxml/{{{component$n$}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process., }})

Uses: C Runtime (Components:{{#foreach: component$n$|{{{component$n$}}}Property "Uses Library" (as page type) with input value "Library/C Runtime/{{{component$n$}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process., }})

<lang c>#include <stdio.h>

include <stdlib.h>
include <string.h>
include <libxml/parser.h>
include <libxml/tree.h>

static void print_names(xmlNode *node) {

 xmlNode *cur_node = NULL;
 for (cur_node = node; cur_node; cur_node = cur_node->next) {
   if (cur_node->type == XML_ELEMENT_NODE) {
     if ( strcmp(cur_node->name, "Student") == 0 ) {

xmlAttr *prop = NULL; if ( (prop = xmlHasProp(cur_node, "Name")) != NULL ) { printf("%s\n", prop->children->content);

}

     }
   }
   print_names(cur_node->children);
 }

}

const char *buffer =

 "<Students>\n"
 "  <Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />\n"
 "  <Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />\n"
 "  <Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />\n"
 "  <Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">\n"
 "    <Pet Type=\"dog\" Name=\"Rover\" />\n"
 "  </Student>\n"
 "  <Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />\n"
 "</Students>\n";

int main() {

 xmlDoc *doc = NULL;
 xmlNode *root = NULL;

 doc = xmlReadMemory(buffer, strlen(buffer), NULL, NULL, 0);
 if ( doc != NULL ) {
   root = xmlDocGetRootElement(doc);
   print_names(root);
   xmlFreeDoc(doc);
 }
 xmlCleanupParser();
 return 0;

}</lang>

C++

Library: Qt

Uses: Qt (Components:{{#foreach: component$n$|{{{component$n$}}}Property "Uses Library" (as page type) with input value "Library/Qt/{{{component$n$}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process., }})

<lang cpp>/* Using the Qt library's XML parser.

/

include <iostream>

include <QDomDocument>
include <QObject>

int main() {

   QDomDocument doc;

   doc.setContent(
      QObject::tr(
         "<Students>\n"
         "<Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />\n"
         "<Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />\n"
         "<Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />\n"
         "<Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">\n"
         "<Pet Type=\"dog\" Name=\"Rover\" />\n"
         "</Student>\n"
         "<Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />\n"
         "</Students>"));
   
   QDomElement n = doc.documentElement().firstChildElement("Student");
   while(!n.isNull()) {
       std::cout << qPrintable(n.attribute("Name")) << std::endl;
       n = n.nextSiblingElement();
   }
   return 0;

}</lang>

C#

<lang csharp> class Program {

   static void Main(string[] args)
   {   
       XDocument xmlDoc = XDocument.Load("XMLFile1.xml");
       var query = from p in xmlDoc.Descendants("Student")
                   select p.Attribute("Name");

       foreach (var item in query)
       {
           Console.WriteLine(item.Value);
       }
       Console.ReadLine();
   }

} </lang>

Clojure

Uses: java.io (Components:{{#foreach: component$n$|{{{component$n$}}}Property "Uses Library" (as page type) with input value "Library/java.io/{{{component$n$}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process., }})

Uses: clojure.xml (Components:{{#foreach: component$n$|{{{component$n$}}}Property "Uses Library" (as page type) with input value "Library/clojure.xml/{{{component$n$}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process., }})

This version uses the standard Clojure function xml-seq' <lang lisp> (import '(java.io ByteArrayInputStream)) (use 'clojure.xml) ; defines 'parse

(def xml-text "<Students>

 <Student Name='April' Gender='F' DateOfBirth='1989-01-02' />
 <Student Name='Bob' Gender='M'  DateOfBirth='1990-03-04' />
 <Student Name='Chad' Gender='M'  DateOfBirth='1991-05-06' />
 <Student Name='Dave' Gender='M'  DateOfBirth='1992-07-08'>
   <Pet Type='dog' Name='Rover' />
 </Student>
 <Student DateOfBirth='1993-09-10' Gender='F' Name='Émily' />

</Students>")

(def students (parse (-> xml-text .getBytes ByteArrayInputStream.))) </lang>

The parse produces a data structure where each element is represented as a map with :tag, :attrs, and :content keys. Thus the "April" element becomes {:tag :Student, :attrs {:Name "April", :Gender "F", :DateOfBirth "1989-01-02"}, :content nil}. xml-seq produces a sequence of such nodes by walking the resulting tree.

<lang lisp> (doseq [{:keys [tag attrs]} (xml-seq students)]

 (if (= :Student tag)
   (println (:Name attrs))))

</lang>

Common Lisp

Library: Closure XML

<lang lisp>(defparameter *xml-blob* "<Students>

 <Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />
 <Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />
 <Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />
 <Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">
   <Pet Type=\"dog\" Name=\"Rover\" />
 </Student>
 <Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />

</Students>")

(let* ((document (cxml:parse *xml-blob* (cxml-dom:make-dom-builder)))

      (students (dom:item (dom:get-elements-by-tag-name document "Students") 0))
      (student-names '()))
 (dom:do-node-list (child (dom:child-nodes students) (nreverse student-names))
   (when (dom:element-p child)
     (push (dom:get-attribute child "Name") student-names))))</lang>

produces<lang lisp>("April" "Bob" "Chad" "Dave" "Émily")</lang>

D

Library: KXML

<lang d>import kxml.xml; char[]xmlinput = "<Students>

 <Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />
 <Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />
 <Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />
 <Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">
   <Pet Type=\"dog\" Name=\"Rover\" />
 </Student>
 <Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />

</Students>";

void main() {

       auto root = readDocument(xmlinput);
       foreach(students;root.getChildren) if (!students.isCData && students.getName == "Students") {
               // now look for student subnodes
               foreach(student;students.getChildren) if (!student.isCData && student.getName == "Student") {
                       // we found a student!
                       std.stdio.writefln("%s",student.getAttribute("Name"));
               }
               // we only want one, so break out of the loop once we find a match
               break;
       }

}</lang>

Delphi

<lang Delphi> //You need to use these units uses

 SysUtils,
 Dialogs,
 XMLIntf,
 XMLDoc;

//..............................................

//This function process the XML function GetStudents(aXMLInput: string): string; var

 XMLDoc: IXMLDocument;
 i: Integer;

begin

 //Creating the TXMLDocument instance
 XMLDoc:= TXMLDocument.Create(nil);

 //Loading the XML string
 XMLDoc.LoadFromXML(aXMLInput);

 //Parsing the xml document
 for i:=0 to XMLDoc.DocumentElement.ChildNodes.Count - 1 do
   Result:= Result + XMLDoc.DocumentElement.ChildNodes.Get(i).GetAttributeNS('Name', ) + #13#10;

 //Removing the trailing #13#10 characters
 Result:= Trim(Result);

end;

//..............................................

//Consuming code example (fragment) var

 XMLInput: string;

begin

 XMLInput:= '<Students>' +
               '<Student Name="April" Gender="F" DateOfBirth="1989-01-02" />' +
               '<Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />' +
               '<Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />' +
               '<Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">' +
                 '<Pet Type="dog" Name="Rover" />' +
               '</Student>' +
               '<Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />'+
             '</Students>';
 Showmessage(GetStudents(XMLInput));

end; </lang>

Factor

<lang factor>USING: io sequences xml xml.data xml.traversal ;

print-student-names ( string -- )

   string>xml "Student" tags-named [ "Name" attr print ] each ;

"""<Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students>""" print-student-names</lang>

Fantom

<lang fantom> using xml

class XmlInput {

 public static Void main () 
 {
   // create the XML parser
   parser := XParser(File("sample-xml.xml".toUri).in)
   // parse the document, creating an XML document
   XDoc doc := parser.parseDoc
   // walk through each child element from the root of the document
   doc.root.elems.each |elem|
   {
     // printing the Name attribute of all Students
     if (elem.name == "Student") { echo (elem.get("Name")) }
   }
 }

} </lang>

Forth

Library: Forth Foundation Library

<lang forth>include ffl/est.fs include ffl/str.fs include ffl/xis.fs

\ Build input string str-create xmlstr

x+ xmlstr str-append-string ;

s\" <Students>\n" x+ s\" <Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />\n" x+ s\" <Student Name=\"Bob\" Gender=\"M\" DateOfBirth=\"1990-03-04\" />\n" x+ s\" <Student Name=\"Chad\" Gender=\"M\" DateOfBirth=\"1991-05-06\" />\n" x+ s\" <Student Name=\"Dave\" Gender=\"M\" DateOfBirth=\"1992-07-08\">\n" x+ s\" <Pet Type=\"dog\" Name=\"Rover\" />\n" x+ s\" </Student>\n" x+ s\" <Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />\n" x+ s\" </Students>\n" x+

\ Setup xml parser xis-create xmlparser xmlstr str-get xmlparser xis-set-string

\ Parse the xml

xmlparse

 BEGIN
   xmlparser xis-read  dup xis.error <> over xis.done <> AND
 WHILE
   dup xis.start-tag = over xis.empty-element = OR IF
     drop
     s" Student" compare 0= IF
       0 ?DO
         2swap s" Name" compare 0= IF
           type cr
         ELSE
           2drop
         THEN
       LOOP
     ELSE
       xis+remove-attribute-parameters
     THEN
   ELSE
     xis+remove-read-parameters
   THEN
 REPEAT
 drop

xmlparse</lang>

Go

Go's xml.Unmarshal uses reflection to fill in data-structures recursively.

<lang go>package main

import (

  "xml"
  "strings"

)

type Students struct {

  Student []Student

}

type Student struct {

  Name string "attr"
  Gender string "attr"
  DateOfBirth string "attr"
  Pet []Pet

}

type Pet struct {

  Type string "attr"
  Name string "attr"

}

const XML_DATA = ` <Students>

  <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
  <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
  <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
  <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
     <Pet Type="dog" Name="Rover" />
  </Student>
  <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students> `

func main() {

  data := new(Students)
  err := xml.Unmarshal(strings.NewReader(XML_DATA), data)
  if err != nil { panic(err) }
  for _, s := range data.Student {
     println(s.Name)
  }

}</lang>

Output:

April
Bob
Chad
Dave
Émily

Groovy

<lang groovy>def input = """<Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students>"""

def students = new XmlParser().parseText(input) students.each { println it.'@Name' }</lang>

Haskell

<lang haskell>import Data.Maybe import Text.XML.Light

students="<Students>"++

       " <Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />"++
       " <Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />"++
       " <Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\"/>"++
       " <Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">"++
       "   <Pet Type=\"dog\" Name=\"Rover\" />  </Student>"++
       " <Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />"++
       "</Students>"

xmlRead elm name = mapM_ putStrLn

     . concatMap (map (fromJust.findAttr (unqual name)).filterElementsName (== unqual elm))
     . onlyElems.  parseXML</lang>

Show names: <lang haskell>*Main> xmlRead "Student" "Name" students April Bob Chad Dave Émily</lang>

HicEst

<lang HicEst>CHARACTER in*1000, out*100

READ(ClipBoard) in EDIT(Text=in, SPR='"', Right='<Student', Right='Name=', Word=1, WordEnd, APpendTo=out, DO)</lang>

out is returned as:
April  Bob  Chad  Dave  Émily

J

J's system includes several XML processing libraries. This task is probably best addressed using XPath (this is the type of problem XPath was designed to solve), but the task description implicitly discourages that method. So we can use the SAX library instead:

<lang j>load'xml/sax'

saxclass 'Students' startElement =: ([: smoutput 'Name' getAttribute~ [)^:('Student'-:]) cocurrent'base'

process_Students_ XML</lang>

April
Bob
Chad
Dave
Émily

and the definition of the variable XML: <lang j>XML=: noun define <Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students> )</lang>

Java

Uses: java.io (Components:{{#foreach: component$n$|{{{component$n$}}}Property "Uses Library" (as page type) with input value "Library/java.io/{{{component$n$}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process., }})

Uses: org.xml.sax (Components:{{#foreach: component$n$|{{{component$n$}}}Property "Uses Library" (as page type) with input value "Library/org.xml.sax/{{{component$n$}}}" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process., }})

<lang java>import java.io.IOException; import java.io.StringReader; import org.xml.sax.Attributes; import org.xml.sax.InputSource; import org.xml.sax.SAXException; import org.xml.sax.XMLReader; import org.xml.sax.helpers.DefaultHandler; import org.xml.sax.helpers.XMLReaderFactory;

public class StudentHandler extends DefaultHandler {

 public static void main(String[] args)throws Exception{
   String xml = "<Students>\n"+
   "<Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />\n"+
   "<Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />\n"+
   "<Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />\n"+
   "<Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">\n"+
   "  <Pet Type=\"dog\" Name=\"Rover\" />\n"+
   "</Student>\n"+
   "<Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />\n"+
   "</Students>";
   StudentHandler handler = new StudentHandler();
   handler.parse(new InputSource(new StringReader(xml)));
 }

 public void parse(InputSource src) throws SAXException, IOException {

XMLReader parser = XMLReaderFactory.createXMLReader();

   parser.setContentHandler(this);
   parser.parse(src);
 }

 @Override
 public void characters(char[] ch, int start, int length) throws SAXException {
   //if there were text as part of the elements, we would deal with it here
   //by adding it to a StringBuffer, but we don't have to for this task
   super.characters(ch, start, length);
 }

 @Override
 public void endElement(String uri, String localName, String qName) throws SAXException {
   //this is where we would get the info from the StringBuffer if we had to,
   //but all we need is attributes
   super.endElement(uri, localName, qName);
 }

 @Override
 public void startElement(String uri, String localName, String qName, Attributes attributes) throws SAXException {
   if(qName.equals("Student")){
     System.out.println(attributes.getValue("Name"));
   }
 }

}</lang>

JavaScript

This version tested against Chrome 5, Firefox 3.6, and IE 8: <lang JavaScript> var xmlstr = '<Students>' +

 '<Student Name="April" Gender="F" DateOfBirth="1989-01-02" />' +
 '<Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />' +
 '<Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />' +
 '<Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">' +
   '<Pet Type="dog" Name="Rover" />' +
 '</Student>' +
 '<Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />' +

'</Students>';

var list = xmlstr.match(/<Student .*? \/>/g); var output = ; for (var i = 0; i < list.length; i++) {

 if (i > 0) {
   output += ', ';
 }
 var tmp = list[i].match(/Name="(.*?)"/);
 output += tmp[1];

}

// Bounce it through a HTML element to handle Unicode for us var l = document.createElement('p'); l.innerHTML = output; alert(l.innerHTML); </lang>

Works with: Mozilla Firefox version 3.6

Alternatively, use the E4X featureset (currently only in Firefox): <lang JavaScript> var xmlstr = '<Students>' +

 '<Student Name="April" Gender="F" DateOfBirth="1989-01-02" />' +
 '<Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />' +
 '<Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />' +
 '<Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">' +
   '<Pet Type="dog" Name="Rover" />' +
 '</Student>' +
 '<Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />' +

'</Students>'; var xml = XML(xmlstr); var list = xml.Student.@Name; var output = ; for (var i = 0; i < list.length(); i++) {

 if (i > 0) {
   output += ', ';
 }
 output += list[i];

}

alert(output); </lang>

newLISP

<lang newlisp> (set 'xml-input "<Students>

 <Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />
 <Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />
 <Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />
 <Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">
   <Pet Type=\"dog\" Name=\"Rover\" />
 </Student>
 <Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />

</Students>")

(set 'sexp (xml-parse xml-input))

(dolist (x (ref-all "Name" sexp)) (if (= (length x) 6) (println (last (sexp (chop x)))))) </lang>

Output:

April 
Bob 
Chad 
Dave 
Émily

Objeck

<lang objeck> use XML;

bundle Default {

 class Test {
   function : Main(args : String[]) ~ Nil {
     in := String->New();
     in->Append("<Students>");
     in->Append("<Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />");
     in->Append("<Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />");
     in->Append("<Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />");
     in->Append("<Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">");
     in->Append("<Pet Type=\"dog\" Name=\"Rover\" />");
     in->Append("</Student>");
     in->Append("<Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" /></Students>");
   
     parser := XmlParser->New(in);
     if(parser->Parse()) {
       root := parser->GetRoot();
       children := root->GetChildren("Student");
       each(i : children) {
         child : XMLElement := children->Get(i)->As(XMLElement);
         XMLElement->DecodeString(child->GetAttribute("Name"))->PrintLine();
       };
     };
   }
 }

} </lang>

OCaml

from the toplevel using the library xml-light: <lang ocaml># #directory "+xml-light" (* or maybe "+site-lib/xml-light" *) ;;

#load "xml-light.cma" ;;

let x = Xml.parse_string "

 <Students>
   <Student Name='April' Gender='F' DateOfBirth='1989-01-02' />
   <Student Name='Bob' Gender='M'  DateOfBirth='1990-03-04' />
   <Student Name='Chad' Gender='M'  DateOfBirth='1991-05-06' />
   <Student Name='Dave' Gender='M'  DateOfBirth='1992-07-08'>
     <Pet Type='dog' Name='Rover' />
   </Student>
   <Student DateOfBirth='1993-09-10' Gender='F' Name='Émily' />
 </Students>"
 in
 Xml.iter (function
   Xml.Element ("Student", attrs, _) ->
      List.iter (function ("Name", name) -> print_endline name | _ -> ()) attrs
 | _ -> ()) x
 ;;

April Bob Chad Dave Émily - : unit = ()</lang>

Another solution using the library xmlm: <lang ocaml>#directory "+xmlm"

load "xmlm.cmo"

open Xmlm

let str = "

 <Students>
   <Student Name='April' Gender='F' DateOfBirth='1989-01-02' />
   <Student Name='Bob'   Gender='M' DateOfBirth='1990-03-04' />
   <Student Name='Chad'  Gender='M' DateOfBirth='1991-05-06' />
   <Student Name='Dave'  Gender='M' DateOfBirth='1992-07-08'>
     <Pet Type='dog' Name='Rover' />
   </Student>
   <Student DateOfBirth='1993-09-10' Gender='F' Name='Émily' />
 </Students>"

let xi = make_input(`String(0, str))

let () =

 while not(eoi xi) do
   match Xmlm.input xi with
   | `El_start ((_, "Student"), attrs) ->
       List.iter (function ((_, "Name"), name) -> print_endline name | _ -> ()) attrs
   | _ -> ()
 done</lang>

using the ocaml expat wrapper:

<lang ocaml>open Expat

let xml_str = "

 <Students>
   <Student Name='April' Gender='F' DateOfBirth='1989-01-02' />
   <Student Name='Bob'   Gender='M' DateOfBirth='1990-03-04' />
   <Student Name='Chad'  Gender='M' DateOfBirth='1991-05-06' />
   <Student Name='Dave'  Gender='M' DateOfBirth='1992-07-08'>
     <Pet Type='dog' Name='Rover' />
   </Student>
   <Student DateOfBirth='1993-09-10' Gender='F' Name='Émily' />
 </Students>"

let () =

 let p = parser_create None in
 set_start_element_handler p
   (fun tag attrs ->
      if tag = "Student" then
        List.iter (function ("Name", name) -> print_endline name | _ -> ()) attrs
   );
 parse p xml_str;
 final p;</lang>

Oz

<lang oz>declare

 [XMLParser] = {Module.link ['x-oz://system/xml/Parser.ozf']}
 Parser = {New XMLParser.parser init}

 Data =
  "<Students>"
 #"  <Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />"
 #"  <Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />"
 #"  <Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />"
 #"  <Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">"
 #"    <Pet Type=\"dog\" Name=\"Rover\" />"
 #"  </Student>"
 #"  <Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />"
 #"</Students>"

 fun {IsStudentElement X}
    case X of element(name:'Student' ...) then true
    else false
    end
 end

 fun {GetStudentName element(attributes:As ...)}
    [NameAttr] = {Filter As fun {$ attribute(name:N ...)} N == 'Name' end}
 in
    NameAttr.value
 end

 [StudentsDoc] = {Parser parseVS(Data $)}
 Students = {Filter StudentsDoc.children IsStudentElement}
 StudentNames = {Map Students GetStudentName}

in

 {ForAll StudentNames System.showInfo}</lang>

Perl

Library: XML::Simple Simple

<lang perl>use XML::Simple;

my $ref = XMLin('<Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students>');

print join( "\n", map { $_->{'Name'} } @{$ref->{'Student'}});</lang>

PHP

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students>'; $xml = new XMLReader(); $xml->xml( $data ); while ( $xml->read() ) if ( XMLREADER::ELEMENT == $xml->nodeType && $xml->localName == 'Student' ) echo $xml->getAttribute('Name') . "\n"; ?></lang>

PicoLisp

<lang PicoLisp>(load "@lib/xm.l")

(mapcar

  '((L) (attr L 'Name))
  (body (in "file.xml" (xml))) )</lang>

Output:

-> ("April" "Bob" "Chad" "Dave" "Émily")

PowerShell

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students> '@

foreach ($node in $xml.DocumentElement.ChildNodes) {$node.Name}

</lang>

PureBasic

Uses a PureBasic XML library (which is linked automatically) that is based on the library expat XML parser licensed under the MIT license. <lang PureBasic>Define studentNames.String, src$

src$ = "<Students>" src$ + "<Student Name='April' Gender='F' DateOfBirth='1989-01-02' />" src$ + "<Student Name='Bob' Gender='M' DateOfBirth='1990-03-04' />" src$ + "<Student Name='Chad' Gender='M' DateOfBirth='1991-05-06' />" src$ + "<Student Name='Dave' Gender='M' DateOfBirth='1992-07-08'>" src$ + "<Pet Type='dog' Name='Rover' />" src$ + "</Student>" src$ + "<Student DateOfBirth='1993-09-10' Gender='F' Name='Émily' />" src$ + "</Students>"

This procedure is generalized to match any attribute of any normal element's node name
i.e. get_values(MainXMLNode(0),"Pet","Type",@petName.String) and displaying petName\s
would display "dog".

Procedure get_values(*cur_node, nodeName$, attribute$, *valueResults.String)

 ;If nodeName$ and attribute$ are matched then the value
 ;will be added to the string structure pointed to by *valueResults .
 Protected result$

 While *cur_node
   If XMLNodeType(*cur_node) = #PB_XML_Normal
     
     result$ = GetXMLNodeName(*cur_node)
     If result$ = nodeName$
       If ExamineXMLAttributes(*cur_node)
         While NextXMLAttribute(*cur_node)
           If XMLAttributeName(*cur_node) = attribute$
             If *valueResults <> #Null
               *valueResults\s + XMLAttributeValue(*cur_node) + Chr(13) ;value + carriage-return
             EndIf 
           EndIf
         Wend
       EndIf
     EndIf 
     
   EndIf 
   
   get_values(ChildXMLNode(*cur_node), nodeName$, attribute$, *valueResults)
   *cur_node = NextXMLNode(*cur_node)
 Wend

EndProcedure

CatchXML(0,@src$,Len(src$))

If IsXML(0)

 get_values(MainXMLNode(0), "Student", "Name",@studentNames)
 MessageRequester("Student Names", studentNames\s)
 FreeXML(0)

EndIf </lang> Sample output:

April
Bob
Chad
Dave
Émily

Python

<lang python>import xml.dom.minidom

doc = """<Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students>"""

doc = xml.dom.minidom.parseString(doc)

for i in doc.getElementsByTagName("Student"):

   print i.getAttribute("Name")</lang>

R

Library: XML

<lang R>library(XML)

Read in XML string

str <- readLines(tc <- textConnection('<Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students>')) close(tc) str</lang>

[1] "<Students>"                                                                 
[2] "  <Student Name=\"April\" Gender=\"F\" DateOfBirth=\"1989-01-02\" />"       
[3] "  <Student Name=\"Bob\" Gender=\"M\"  DateOfBirth=\"1990-03-04\" />"        
[4] "  <Student Name=\"Chad\" Gender=\"M\"  DateOfBirth=\"1991-05-06\" />"       
[5] "  <Student Name=\"Dave\" Gender=\"M\"  DateOfBirth=\"1992-07-08\">"         
[6] "    <Pet Type=\"dog\" Name=\"Rover\" />"                                    
[7] "  </Student>"                                                               
[8] "  <Student DateOfBirth=\"1993-09-10\" Gender=\"F\" Name=\"Émily\" />"
[9] "</Students>"

<lang R>#Convert to an XML tree xmltree <- xmlTreeParse(str)

Retrieve the students, and how many there are

students <- xmltree$doc$children$Students nstudents <- length(students)

Get each of their names

studentsnames <- character(nstudents) for(i in 1:nstudents) {

  this.student <- students$children[i]$Student
  studentsnames[i] <- this.student$attributes["Name"]

}

Change the encoding so that Emily displays correctly

Encoding(studentsnames) <- "UTF-8" studentsnames</lang>

[1] "April" "Bob"   "Chad"  "Dave"  "Émily"

REBOL

<lang REBOL>REBOL [ Title: "XML Reading" Author: oofoe Date: 2009-12-08 URL: http://rosettacode.org/wiki/XML_Reading ]

xml: { <Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />

</Students> }

REBOL has a simple built-in XML parser. It's not terribly fancy, but
it's easy to use. It converts the XML into a nested list of blocks
which can be accessed using standard REBOL path operators. The only
annoying part (in this case) is that it does try to preserve
whitespace, so some of the parsed elements are just things like line
endings and whatnot, which I need to ignore.

Once I have drilled down to the individual student records, I can
just use the standard REBOL 'select' to locate the requested
property.

data: parse-xml xml students: data/3/1/3 ; Drill down to student records. foreach student students [ if block! = type? student [ ; Ignore whitespace elements. print select student/2 "Name" ] ]</lang>

Output:

April
Bob
Chad
Dave
Émily

REXX

version 1

<lang rexx> /*REXX program to extract student names from an XML string(s). */

g.= g.1='<Students> ' g.2=' <Student Name="April" Gender="F" DateOfBirth="1989-01-02" /> ' g.3=' <Student Name="Bob" Gender="M" DateOfBirth="1990-03-04" /> ' g.4=' <Student Name="Chad" Gender="M" DateOfBirth="1991-05-06" /> ' g.5=' <Student Name="Dave" Gender="M" DateOfBirth="1992-07-08"> ' g.6=' <Pet Type="dog" Name="Rover" /> ' g.7=' </Student> ' g.8=' <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" /> ' g.9='</Students> '

say copies('~',70)

 do j=1 while g.j\==
 g.j=space(g.j)
 say g.j
 end

say copies('~',70)

 do j=1 while g.j\==

 parse var g.j 'Name="' studname '"'
 if studname\== then say studname
 end

say copies('~',70) </lang> Output: <lang rexx> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ <Students> <Student Name="April" Gender="F" DateOfBirth="1989-01-02" /> <Student Name="Bob" Gender="M" DateOfBirth="1990-03-04" /> <Student Name="Chad" Gender="M" DateOfBirth="1991-05-06" /> <Student Name="Dave" Gender="M" DateOfBirth="1992-07-08"> <Pet Type="dog" Name="Rover" /> </Student> <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" /> </Students> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ April Bob Chad Dave Rover Émily ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ </lang>

version 2

<lang rexx> /*REXX program to extract student names from an XML string(s). */

g.= g.1='<Students> ' g.2=' <Student Name="April" Gender="F" DateOfBirth="1989-01-02" /> ' g.3=' <Student Name="Bob" Gender="M" DateOfBirth="1990-03-04" /> ' g.4=' <Student Name="Chad" Gender="M" DateOfBirth="1991-05-06" /> ' g.5=' <Student Name="Dave" Gender="M" DateOfBirth="1992-07-08"> ' g.6=' <Pet Type="dog" Name="Rover" /> ' g.7=' </Student> ' g.8=' <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" /> ' g.9='</Students> '

say copies('~',70)

 do j=1 while g.j\==
 g.j=space(g.j)
 say g.j
 end

say copies('~',70)

 do j=1 while g.j\==
 parse var g.j 'Name="' studname '"'
 if studname== then iterate
 if pos('&',studname)\==0 then studname=xmlTranE(studname)
 say studname
 end

say copies('~',70) exit

/*─────────────────────────────────────XML! subroutine──────────────────*/ xml!: parse arg ,_ /*tran an XML entity (&xxxx;) */ xmlEntity!='&'_";" if pos(xmlEntity!,x)\==0 then x=changestr(xmlEntity!,x,arg(1)) if left(_,2)=='#x' then do

                       xmlEntity!='&'left(_,3)translate(substr(_,4))";"
                       x=changestr(xmlEntity!,x,arg(1))
                       end

return x

/*─────────────────────────────────────XMLTRANE subroutine──────────────*/ xmlTranE: procedure; parse arg x

                               /*Following are a few of the chars in   */
                               /*the DOS (DOS under Windows)  codepage.*/

x=xml!('♥',"hearts") x=xml!('♦',"diams") x=xml!('♣',"clubs") x=xml!('♠',"spades") x=xml!('♂',"male") x=xml!('♀',"female") x=xml!('↕',"UpDownArrow") x=xml!('¶',"para") x=xml!('§',"sect") x=xml!('↑',"uarr") x=xml!('↓',"darr") x=xml!('←',"larr") x=xml!('1c'x,"rarr") x=xml!('!',"excl") x=xml!('!',"excl") x=xml!('"',"apos") x=xml!('$',"dollar") x=xml!("'","quot") x=xml!('*',"ast") x=xml!('/',"sol") x=xml!(':',"colon") x=xml!(';',"semi") x=xml!('<',"lt") x=xml!('=',"equals") x=xml!('>',"gt") x=xml!('@',"commat") x=xml!('[',"lbrack") x=xml!('\',"bsol") x=xml!(']',"rbrack") x=xml!('^',"Hat") x=xml!('`',"grave") x=xml!('{',"lbrace") x=xml!('|',"vert") x=xml!('}',"rbrace") x=xml!('Ç',"Ccedil") x=xml!('Ç','#x00c7') x=xml!('ü',"uuml") x=xml!('ü','#x00fc') x=xml!('é',"eacute") x=xml!('é','#x00e9') x=xml!('â',"acirc") x=xml!('â',"ETH") x=xml!('â','#x00e2') x=xml!('â','#x00e9') x=xml!('ä',"auml") x=xml!('ä','#x00e4') x=xml!('à',"agrave") x=xml!('à','#x00e0') x=xml!('å',"aring") x=xml!('å','#x00e5') x=xml!('ç',"ccedil") x=xml!('ç','#x00e7') x=xml!('ê',"ecirc") x=xml!('ê','#x00ea') x=xml!('ë',"euml") x=xml!('ë','#x00eb') x=xml!('è',"egrave") x=xml!('è','#x00e8') x=xml!('ï',"iuml") x=xml!('ï','#x00ef') x=xml!('î',"icirc") x=xml!('î','#x00ee') x=xml!('ì',"igrave") x=xml!('ì','#x00ec') x=xml!('Ä',"Auml") x=xml!('Ä','#x00c4') x=xml!('Å',"Aring") x=xml!('Å',"Aring") x=xml!('Å','#x00c5') x=xml!('É',"Eacute") x=xml!('É','#x00c9') x=xml!('æ',"aelig") x=xml!('æ','#x00e6') x=xml!('Æ',"AElig") x=xml!('Æ','#x00c6') x=xml!('ô',"ocirc") x=xml!('ô','#x00f4') x=xml!('ö',"ouml") x=xml!('ö','#x00f6') x=xml!('ò',"ograve") x=xml!('ò','#x00f2') x=xml!('û',"ucirc") x=xml!('û','#x00fb') x=xml!('ù',"ugrave") x=xml!('ù','#x00f9') x=xml!('ÿ',"yuml") x=xml!('ÿ','#x00ff') x=xml!('Ö',"Ouml") x=xml!('Ö','#x00d6') x=xml!('Ü',"Uuml") x=xml!('Ü','#x00dc') x=xml!('¢',"cent") x=xml!('£',"pound") x=xml!('¥',"yen") x=xml!('ƒ',"fnof") x=xml!('á',"aacute") x=xml!('á','#x00e1') x=xml!('í',"iacute") x=xml!('í','#x00ed') x=xml!('ó',"oacute") x=xml!('ó','#x00f3') x=xml!('ú',"uacute") x=xml!('ú','#x00fa') x=xml!('ñ',"ntilde") x=xml!('ñ','#x00f1') x=xml!('Ñ',"Ntilde") x=xml!('Ñ','#x00d1') x=xml!('¿',"iquest") x=xml!('⌐',"bnot") x=xml!('¬',"not") x=xml!('½',"fact12") x=xml!('¼',"fact14") x=xml!('¡',"iexcl") x=xml!('«',"laqru") x=xml!('»',"raqru") x=xml!('α',"alpha") x=xml!('ß',"beta") x=xml!('ß',"szlig") x=xml!('ß','#x00df') x=xml!('Γ',"Gamma") x=xml!('π',"pi") x=xml!('π',"pi") x=xml!('Σ',"Sigma") x=xml!('σ',"sigma") x=xml!('µ',"mu") x=xml!('τ',"tau") x=xml!('Φ',"phi") x=xml!('Θ',"Theta") x=xml!('δ',"delta") x=xml!('∞',"infin") x=xml!('φ',"Phi") x=xml!('ε',"epsilon") x=xml!('∩',"cap") x=xml!('≡',"equiv") x=xml!('±',"plusmn") x=xml!('≥',"ge") x=xml!('≤',"le") x=xml!('÷',"div") x=xml!('≈',"approx") x=xml!('∙',"bull") x=xml!('°',"deg") x=xml!('·',"middot") x=xml!('√',"radic") x=xml!('²',"sup2") x=xml!('■',"squart ") x=xml!('&',"amp") return x </lang> <lang rexx> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ <Students> <Student Name="April" Gender="F" DateOfBirth="1989-01-02" /> <Student Name="Bob" Gender="M" DateOfBirth="1990-03-04" /> <Student Name="Chad" Gender="M" DateOfBirth="1991-05-06" /> <Student Name="Dave" Gender="M" DateOfBirth="1992-07-08"> <Pet Type="dog" Name="Rover" /> </Student> <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" /> </Students> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ April Bob Chad Dave Rover Émily ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ </lang>

Ruby

Library: REXML

<lang ruby>require 'rexml/document' include REXML

doc = Document.new(File.new("sample.xml"))

or
doc = Document.new(xml_string)

without using xpath

doc.each_recursive do |node|

 puts node.attributes["Name"] if node.name == "Student"

end

using xpath

doc.each_element("*/Student") {|node| puts node.attributes["Name"]}</lang>

Scala

Scala has native XML support, with query constructs similar to XPath and XQuery.

<lang scala>val students =

 <Students>
   <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
   <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
   <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
   <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
     <Pet Type="dog" Name="Rover" />
   </Student>
   <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />
 </Students>

students \ "Student" \\ "@Name" foreach println</lang>

Slate

Some lines in this example are too long (more than 80 characters). Please fix the code if it's possible and remove this message.

Slate's XML Reader is still being developed at the time of this writing.

<lang slate>slate[1]> [ |tree|

 tree: (Xml SimpleParser newOn: '<Students>
   <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
   <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
   <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
   <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
     <Pet Type="dog" Name="Rover" />
   </Student>
   <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" />
 </Students>') parse.
 tree name = 'Students' ifTrue: [(tree children select: #is: `er <- Xml Element)
                                        do: [|:child| child name = 'Student' ifTrue: [inform: (child attributes at: 'Name' ifAbsent: ['Noname'])]]].

] do. April Bob Chad Dave Émily Nil</lang>

Tcl

Using

Library: tDOM

<lang tcl>package require tdom set tree [dom parse $xml] set studentNodes [$tree getElementsByTagName Student] ;# or: set studentNodes [[$tree documentElement] childNodes]

foreach node $studentNodes {

   puts [$node getAttribute Name]

}</lang>

Using

Library: TclXML

<lang tcl>package require xml set parser [xml::parser -elementstartcommand elem] proc elem {name attlist args} {

   if {$name eq "Student"} {
       puts [dict get $attlist Name]
   }

} $parser parse $xml</lang>

Using just pure-Tcl (originally on http://wiki.tcl.tk/3919): <lang Tcl>proc xml2list xml {

   regsub -all {>\s*<} [string trim $xml " \n\t<>"] "\} \{" xml
   set xml [string map {> "\} \{#text \{" < "\}\} \{"}  $xml]
   set res ""   ;# string to collect the result
   set stack {} ;# track open tags
   set rest {}
   foreach item "{$xml}" {
       switch -regexp -- $item {

^# {append res "{[lrange $item 0 end]} " ; #text item} ^/ { regexp {/(.+)} $item -> tagname ;# end tag set expected [lindex $stack end] set stack [lrange $stack 0 end-1] append res "\}\} "

/$ { # singleton - start and end in one <> group

               regexp {([^ ]+)( (.+))?/$} $item -> tagname - rest
               set rest [lrange [string map {= " "} $rest] 0 end]
               append res "{$tagname [list $rest] {}} "

} default {

               set tagname [lindex $item 0] ;# start tag
               set rest [lrange [string map {= " "} $item] 1 end]
               lappend stack $tagname
               append res "\{$tagname [list $rest] \{"

}

       }
   }
   string map {"\} \}" "\}\}"} [lindex $res 0]   ;#"

} proc deent str {

   regsub -all {&\#x(.+?);} $str {\\u\1} str
   subst -nocommands -novar $str

}

----------------------- Testing the whole thing:

set xml {<Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Émily" /></Students>

} foreach i [lindex [xml2list $xml] 2] {

   if {[lindex $i 0] eq "Student"} {
       foreach {att val} [lindex $i 1] {
           if {$att eq "Name"} {puts [deent $val]}
       }
   }

}</lang>

TUSCRIPT

<lang tuscript> $$ MODE TUSCRIPT MODE DATA $$ SET xmldata =* <Students>

 <Student Name="April" Gender="F" DateOfBirth="1989-01-02" />
 <Student Name="Bob" Gender="M"  DateOfBirth="1990-03-04" />
 <Student Name="Chad" Gender="M"  DateOfBirth="1991-05-06" />
 <Student Name="Dave" Gender="M"  DateOfBirth="1992-07-08">
   <Pet Type="dog" Name="Rover" />
 </Student>
 <Student DateOfBirth="1993-09-10" Gender="F" Name="Emily" />

</Students> $$ MODE TUSCRIPT COMPILE LOOP x = xmldata SET name=GET_TAG_NAME (x) IF (name=="student") THEN SET studentname=GET_ATTRIBUTE (x,"Name") IF (studentname!="") PRINT studentname ENDIF ENDLOOP ENDCOMPILE </lang> Output:

April
Bob
Chad
Dave
Emily

Vedit macro language

This implementation finds all Student tags and then displays the contents of their Name parameter. <lang vedit>Repeat(ALL) {

   Search("<Student|X", ERRBREAK)
   #1 = Cur_Pos
   Match_Paren()
   if (Search_Block(/Name=|{",'}/, #1, Cur_Pos, BEGIN+ADVANCE+NOERR+NORESTORE)==0) { Continue }
   #2 = Cur_Pos
   Search(/|{",'}/)
   Type_Block(#2, Cur_Pos)
   Type_Newline

}</lang>

Output:

April
Bob
Chad
Dave
Émily

Visual Basic .NET

             <Student Name="April"/>
             <Student Name="Bob"/>
             <Student Name="Chad"/>
             <Student Name="Dave"/>
             <Student Name="Emily"/>
          </Students>

Dim names = (From node In xml...<Student> Select node.@Name).ToArray

For Each name In names

    Console.WriteLine(name)

Next</lang>

Revision as of 15:47, 9 August 2011 view source rosettacode>Yanniel No edit summary ← Older edit		Revision as of 15:49, 9 August 2011 view source rosettacode>Yanniel No edit summary Newer edit →
Line 491: //You need to use these units uses SysUtils, Dialogs, XMLIntf,