WORKSHOP PROGRAM


Locations: Presentations and panel sessions will take place in Logan Hall Room B17 (coded L below), working groups and lunches will be in the IRCS suite (coded I). For more information please see the venues page.




Tuesday 12 December


8:00-L: REGISTRATION AT LOGAN

8:30-L: Introduction

Steven Bird, Gary Simons, Aravind Joshi.

9:00-L: Archive Infrastructure

Anthony Aristar and Helen Aristar Dry: E-MELD: Electronic metastructure for endangered languages data

Susan Hockey: Towards a model for web-based language documentation and description: some contributions from digital libraries and humanities computing research

Helen Aguera: NEH experiences and needs

Nancy Ide, Laurent Romary: XML support for annotated language resources

Patrick McConvell: Two-way research resources for indigenous languages: Positioning Resources in the GARMA

10:40-L: Break

11:00-L: PANEL 1: Legal, Ethical, and Policy Issues Concerning the Recording and Publication of Primary Language Materials

Jonathan Amith (chair), Megan Crowhurst, Mark Liberman, Patrick McConvell, Bill Poser, and Joel Sherzer

12:30-I: LUNCH (provided)
1:50 MOVE TO LOGAN

2:00-L: An Infrastructure for Open Language Archives

Michael Nelson: The Open Archives Initiative

Steven Bird and Gary Simons: White paper on establishing an infrastructure for open language archiving

3:40-L: Break

4:00-L: Data Infrastructure: Objects

Session chair: Gary Simons

Michael Nelson: Smart Objects and Dumb Archives: Insuring the Long-Term Integrity of Digital Information

Neal Audenaert: Language Data Repository: Project abstract

Saturnino Luz, Mona Baker: TEC: a toolkit and API for distributed corpus processing

Jean Carletta, Amy Isard, David McKelvie: Linguistic data processing for everyman

Peter Wittenburg, Hennie Brugman, Daan Broeder: Annotations, formats and data types in the DOBES project

Richmond H. Thomason, Bill Poser: Representing and reasoning in computer-readable field linguistics archives

6:00 CLOSE





Wednesday 13 December


8:30 ARRIVE AT LOGAN

8:40-L: Data Infrastructure: Character Encoding

Session chair: Dafydd Gibbon

Anthony McEnery, Paul Baker, Mark Leisher, Rob Gaizauskas, Hamish Cunningham: Mapping multiple South Asian 8-bit character sets to the Unicode Standard

Peter Constable: Unicode Character Encoding of Archived Linguistic Data

Eric Albright: Design of an Electronic Method for Describing Writing Systems

Bill Poser: A Sorting Tool and Issues in Sorting

Sharon Correll: Extensible rendering technology for web-based data access

Peter Constable: Phonetic Fonts and Phonetic Data Encoding

10:40-I: Break

11:00-I: Working Groups

Discuss requirements for the character encoding working group. Pass results directly to group.

12:00-I: LUNCH (provided)
1:30-L: MOVE TO LOGAN

1:40-L: Metadata

Session chair: Steven Bird

Gary Simons: Language identification in metadata descriptions of language archive holdings

Daan Broeder, Pirkko Suihkonen, Peter Wittenburg: Developing a standard for meta-descriptions of multimedia language resources

Gary Holton: Metadata for linguistic documentation archives

Lev Michael: Creating discourse data metadata for the AILLA Project: Lessons learned and needs felt

3:00-I: Break

3:20-I: Working Groups

Discuss requirements for the metadata working group. Pass results directly to group.

4:20 MOVE TO LOGAN

4:30-L: Brief Advertisements

A chance for participants to give brief advertisements for projects and initiatives which are not included in the program.

4:45-L PANEL 2: Community Infrastructure

Helen Aristar Dry (chair), Megan Crowhurst, Nicholas Ostler, Doug Whalen

6:00 CLOSE

CONFERENCE DINNER AT THE PALLADIUM
6:00 HORS D'OEUVRES AND DRINKS, 7:00 DINNER
(for all presenters and participants)





Thursday 14 December


[Character encoding and metadata working groups meet at IRCS, and report back to main workshop after the mid-afternoon break.]

8:30-L: ARRIVE AT LOGAN

8:40-L Annotated Text and Signals

Session chair: Susan Hockey

Elena Maslova: Text corpora: grammatical queries and morphemic indices

Anne Mahoney, Jeffrey A. Rydberg-Cox, David A. Smith, Clifford E. Wulfman: Generalizing the Perseus XML Document Manager

Douglas Whalen, David Harrison, Dennis Holt: Searching for reflexes in linguistic archives: The Endangered Language Fund Algonquian Project

Steven Weinberger: Speech Accent Archive: Issues and Methods

Mike Maxwell Creating Morphological Data: From Markup to Generalizations

10:20-L: Break

10:40-L Annotated Text and Signals (cont)

Session chair: Nancy Ide

Ronald Sprouse: Data types for interlinear text

Boyd Michailovsky, Michel Jacobson: A linguistic archive on the web

Kazuaki Maeda, Steven Bird: A Formal Framework for Interlinear Text

David Weber: E.g.

12:00-I: LUNCH (provided)
1:30-L: MOVE TO LOGAN

1:40-L: Lexicons

Session chair: Nicoletta Calzolari

Martha Palmer, Ralph Grishman, Nicolletta Calzolari, Antonio Zampolli: Standardizing Multilingual Lexicons

Dafydd Gibbon: On lexical objects and their properties

John Bell, Steven Bird: A Preliminary Study of the Structure of Lexicon Entries

Jeff Good, Ronald Sprouse: SGML markup of dictionaries with special reference to comparative and etymological data

Steve Tinney: XML and the Corpus-Based Dictionary: Development and Implementation of the Pennsylvania Sumerian Dictionary

3:20-L: Break

3:40-L Character encoding and metadata working groups report back

4:20-L PANEL 3: Archivists Respond

5:30 DINNER

7:30-I Working Groups and Demo Session

Working groups writing up

Informal demonstrations

9:30 CLOSE





Friday 15 December


All Friday events will take place in the IRCS suite.

9:00-I: ARRIVE AT IRCS

9:10-I Working groups report back

10:40-I: Break

11:00-I: Final working group session

final writing up

12:00: LUNCH
1:30-I: ARRIVE AT IRCS

1:40-I: Resolutions and Actions

3:00-I: Break

3:30: CLOSE


Return to Index