Language Archive Survey Results


1. Name and Location

Archive Name: Corpus Documentale Latinum Portugaliae (a saeculo nono post Christum usque ad duodecimum)
Archive URL: http://
Host Institution: Universidade Nova de Lisboa/Faculdade de Ciências Sociais e Humanas/Departamento de Linguística
Country: Portugal
Contact Person: Dr António H A Emiliano
Email Address: a.emiliano@mail.telepac.pt


2. Catalog

2.1 If the archive has a catalog in a standardized format, what fields does it contain? If not, what contextual information about the resources are collected? What other information would you like to collect if you could?
No standardized catalogue yet. Each text will have a header with several fields in it. In the future this header will comply with SGML format.

2.2 If the electronic catalog conforms to some standard, please tell us the name of the standard.
--

2.3 To what extent have the archived materials been cataloged electronically?
no electronic catalog

2.4 If there is an online public access catalog, please give its URL.
--


3. Holdings

3.1 What geographical regions and languages are covered?
Main Regions Covered: Europe
Approx Number of Languages: 2
Main Languages: Medieval Notarial Latin and Old Portuguese

3.2 Please give impressionistic estimates of the archive holdings for each of the data types.
DATA TYPE NON-DIGITAL DIGITAL
Texts: none large
Wordlists, Vocabularies, Lexicons, Dictionaries: none large
Field Notes, Correspondence, Misc files: none none
Descriptions (Grammars, Phonologies, etc): small none
Audio Recordings: none none
Video Recordings: none none

3.3 Please list any other data types which are not included above, or any other comments on the archive holdings:
Digitization of the texts from direct transcription of manuscripts is still in its initial phase. No existing editions are being used or will be used to create or enlarge the corpus. Transcription of the texts is being done according to extremely conservative criteria, because the corpus is intended mainly as a resource for linguistic studies.

3.4 What proportion of the holdings are unique to the archive and not available elsewhere?
virtually everything


4. Electronic Publication

4.1 To what extent are the archive holdings published electronically, where "published" means that there is a well-defined procedure such that anyone at all can get a standard copy of the data, either on digital media or over the internet?
nothing published

4.2 To what extent are the archive holdings accessible over the web?
nothing accessible

4.3 Is permission required before materials can be accessed?

4.4 Is there any fee for materials?

4.5 How are author and/or editor defined for the electronic publications? Is there a bibliographical citation method?
does not apply

4.6 Do the electronic publications have ISBN numbers?

4.7 What plans are there to expand the electronic publication of archive holdings?
When the transcription of the manuscripts reaches the first half of the 11th century the electronic publication of the corpus in a standard format will be planned. Foreseen date for completion of this phase is 2003.


5. General Issues

5.1 Who is the legal owner of archived materials?

5.2 Beyond legal ownership, are there any asserted or perceived moral rights concerning archived materials? Do the holders of the archive see the original speakers or their representatives as controlling publication?

5.3 In cases where no electronic publication is planned, why is this so? (e.g. funding, licensing, technical know-how, lack of interest).

5.4 Is any of the data in a proprietary format (e.g. MS Word)? If so, are there plans to transfer it to an open standard (e.g., XML)?
The existing data is currently in MS Word. We have plans to transfer it to an open standard such as SGML (following closely and wherever possible the TEI Guidelines).


6. Do you have any other comments about digital archives of language material, or on this survey?
Digital archives are the most secure way to preserve textual information for future generations, and in our case they are an extremely important means for preserving and making available a part of Portugal's historical, legal, cultural and linguistic heritage. This survey is of course extremely useful.



Back to the index page