The Hans Rausing Endangered Languages ProjectThe Hans Rausing Endangered Languages Project   The Hans Rausing Endangered Languages Project

XML Day

Venue: The Endangered Languages Archive, R201
Co-ordinators: David Nathan and Ed Garrett
27 October 2006, 10am - 5pm

Last updated: 21 October 2006

The day will consist of a mix of tutorial, practical, and discussion sessions, including group work and problem solving. The workshop will provide participants with opportunities to:
  • learn about the history, purpose, and formalism of XML
  • understand the applications, strengths, and weaknesses of XML
  • see how XML can be applied to linguistic data
  • gain basic hands on experience with designing, "reading", evaluating, and editing XML
  • learn about technologies that are closely related to and used in conjunction with XML

Pre-reading and preparatory exercises are now available from ELAR. It is essential that you attempt these before the workshop.

Program (provisional)

10:00 - 10:40 Roots, principles and formalism. Lookahead to architecture. Namespaces and Unicode. Workflow. Discussion of preparatory material.
10:40 - 11:20 Introduction to Oxygen. Practical exercises on document marking up and well-formedness
11:20 - 11:40 Break
11:40 - 12:40 Constraining document grammar: DTD and Schema. Constructing DTDs. XPath and XQuery.
12:40 - 13:00 More Oxygen functions. XPath exercises.
13:00 - 14:00 Lunch break (lunch not supplied)
14:00 - 14:40 XSLT and CSS, XSL:FO
14:40 - 15:20 XSLT exercises
15:20 - 15:40 Break
15:40 - 16:20 Real world: office applications and creating and manipulating XML
16:20 - 17:00 Exercises: creating and manipulating XML, bringing it all together

Key:

  Classroom sessions
  Group practical sessions
  Breaks (GYO refreshments and lunch)

 

Outline

Theory

  • XML roots
  • XML principles and formalism
  • XML vs. relational databases

Practice

  • Case studies in representing linguistic data and theories
  • Good XML design
  • XML editors
  • The view from mobilisation
  • Born-XML and marriages of convenience
  • Examples of XML in linguistics: modelling interlinear glossed text, Transcriber, ELAN files

Architecture of XML-related technologies

  • Constraining XML with DTDs and Schemas
  • Namespaces and different XML vocabularies
  • Transforming/displaying XML with XSLT, CSS, and XSL-FO
  • Searching XML using XPath and XQuery
  • Unicode and XML