The Hans Rausing Endangered Languages Project  The Hans Rausing Endangered Languages Project

LSA 2011 Organized Session and Poster Session

Metadata in Language Documentation and Description

Held as part of the Annual Meeting of the Linguistic Society of America, Pittsburgh USA, 6-9 January 2011 [meeting website]

Organizers: Peter K. Austin (SOAS, London), Jeff Good (University at Buffalo, NY)

Introduction

Show/hide Introduction

Since the LSA Committee on Endangered Language Preservation was established in 1992, interest among linguists in documenting endangered languages has grown tremendously, and the new field of documentary linguistics has emerged. Dedicated sessions at annual meetings, new funding initiatives such as the NSF/NEH Documenting Endangered Languages program and the Hans Rausing Endangered Languages Project, and organizations such the Open Language Archives Community have helped both to stimulate and to support this interest. The urgency of the need for more and better language documentation projects is another major stimulus, both for languages that are nearly extinct and for those that may be teetering on the brink of survival.

As linguists have expanded their efforts at documentation there has been an increasing interest in collecting, preserving, and disseminating old and new records—e.g., audio, video, and text—of the world's languages. In order to for this to be done well, it is necessary to associate metadata, data about the data, with these records to ensure that their context, meaning and use can be properly determined. Metadata is therefore required not only for archiving and preservation, but also for management, identification, retrieval and understanding of the data within a documentation project. Metadata is also necessary for the proper treatment of ethical responsibilities in research, including providing various levels of access and use of language materials.

There have been several approaches to notation of metadata, ranging from limited standardized sets (such as OLAC and IMDI) to open descriptive accounts reflecting the diversity of language documentation projects. Recently, there have been suggestions that a new direction needs to be taken, namely development of ‘meta-documentary linguistics' which is concerned with the methods, tools, and theoretical underpinnings for setting up, carrying out and concluding a documentary linguistics research project. This is the documentation of the documentation research itself.

This tutorial and poster session will provide those interested in documenting and describing languages with concrete information about metadata and meta-documentation, covering the different models and how they can be employed from the perspectives of creators and users of the documentary materials in linguistics and neighboring fields. These sessions will bring together field linguists, computational linguists, language archivists, anthropologists, and archaeologists to discuss the issues from an interdisciplinary perspective. The poster session will focus on presentation of a number of archives of endangered languages materials and display their approaches to metadata.

Tutorial talks covered general topics such as how to design a metadata system and what it can be used for, what kinds of metadata researchers are collecting, how linguists' metadata relates to that developed by anthropologists and archaeologists, and what information archives need for the best description and preservation of language materials. The poster session presented specific case studies from on-going archiving projects. It also served as an opportunity for audience members to have individual discussions with representatives of archives which may be suitable for their materials.

Hide Introduction

Presentations

Peter K. Austin, Endangered Languages Academic Program, Department of Linguistics, School of Oriental and African Studies, University of London
What is metadata and what is it good for?
Abstract (PDF)
Talk (PDF)
Lisa Conathan, Beinecke Rare Book and Manuscript Library, Yale University
An archivist's view of meta-documentation
Abstract (PDF)
Talk (PDF)
Bryan Hanks, Department of Anthropology, University of Pittsburgh
Metadata in archaeology? Engaging with materiality and cultural transmission
Abstract (PDF)
Talk (PDF)
David Nathan, Endangered Languages Archive, Department of Linguistics, School of Oriental and African Studies, University of London
Better data about metadata: A survey of depositor metadata submitted to the Endangered Languages Archive
Abstract (PDF)
Talk (PDF)
Carol Ember, Human Relations Area Files, Yale University and Jeff Good, Department of Linguistics, University at Buffalo
Metadata across the four fields of anthropology
Abstract (PDF)
Talk (PDF)

Posters

Steven Bird, Department of Computer Science and Software Engineering, University of Melbourne and Gary Simons, SIL International
OLAC: Accessing the world's language resources
Abstract (PDF)
Poster (PDF)
Amy Campbell, Andrew Garrett, Hannah Haynie, Justin Spence, Ronald Sprouse, and John Sylak, Department of Linguistics, University of California, Berkeley
Geographical metadata in the Berkeley language archives
Abstract (PDF)
Poster (PDF)
Debbie Chang, Graduate Institute of Applied Linguistics
TAPS: Checklist for responsible archiving of digital language resources
Abstract (PDF)
Poster (PDF)
Gary Holton, Alaska Native Language Archive, University of Alaska, Fairbanks
Improving finding aids for endangered language archives
Abstract (PDF)
Poster (PDF)
Heidi Johnson, The Archive of the Indigenous Languages of Latin America, University of Texas at Austin
The Archive of the Indigenous Languages of Latin America
Abstract (PDF)
Poster (PDF)
Mary S. Linn, Sam Noble Oklahoma Museum of Natural History, University of Oklahoma
Metadata afterward: Gathering information for the Carolyn Quintero collection
Abstract (PDF)
Poster (PDF)
David Nathan, Edward Garrett, Thomas Castle, and Jennifer Marshall, Endangered Languages Archive, Department of Linguistics, School of Oriental and African Studies, University of London
Endangered Languages Archive
Abstract (PDF)
Poster (PDF)
Nicholas Thieberger, Department of Linguistics and Applied Linguistics, University of Melbourne and Department of Linguistics, University of Hawai'i at Manoa
The Pacific and Regional Archive for Digital Sources in Endangered Cultures
Abstract (PDF)
Poster (PDF)
Laura Welcher, The Rosetta Project, The Long Now Foundation
The Rosetta Project Distributed Archive Model

References and links

Conathan, Lisa. 2011. Archiving and language documentation. In Peter K. Austin and Julia Sallabank (eds.) The Cambridge Handbook of Endangered Languages, 235-254. Cambridge: Cambridge University Press.

Good, Jeff 2002. A gentle introduction to metadata. Open Language Archives Community. link

Good, Jeff. 2011. Data and language documentation. In Peter K. Austin and Julia Sallabank (eds.) The Cambridge Handbook of Endangered Languages , 212-234. Cambridge: Cambridge University Press.

Johnson, Heidi 2004. Language documentation and archiving, or how to build a better corpus. In Peter K. Austin (ed.) Language Documentation and Description , Volume 2 , 140–53. London : SOAS. link

Nathan, David. 2011. Digital archiving. In Peter K. Austin and Julia Sallabank (eds.) The Cambridge Handbook of Endangered Languages, 255-274. Cambridge: Cambridge University Press.

Nathan, David and Peter K. Austin 2004. Reconceiving metadata: language documentation through thick and thin. In Peter K. Austin (ed.) Language Documentation and Description, Volume 2 , 179–87. London: SOAS. link

Simons, Gary and Steven Bird (eds.) 2008. OLAC Metadata. Open Language Archives Community. link

OREL - Online Resources For Endangered Languages - Metadata section (ELAR/Lameen Souag)

EMELD - Electronic Metastructures for Endangered Languages Documentation

OLAC - Open Languages Archives Community

IMDI - ISLE Metadata Initiative

TEI - Text Encoding Initiative