Language Archive Survey Results


1. Name and Location

Archive Name: Comparative Bantu Online Dictionary
Archive URL: http://linguistics.berkeley.edu/CBOLD
Host Institution: University of California, Berkeley
Country: USA
Contact Person: Jeff Good
Email Address: jcgood@socrates.berkeley.edu


2. Catalog

2.1 If the archive has a catalog in a standardized format, what fields does it contain? If not, what contextual information about the resources are collected? What other information would you like to collect if you could?
Folder Name, Language, Guthrie (Number), (Word) Count, Condition, Web Search (Available)

2.2 If the electronic catalog conforms to some standard, please tell us the name of the standard.

2.3 To what extent have the archived materials been cataloged electronically?
virtually everything

2.4 If there is an online public access catalog, please give its URL.
http://linguistics.berkeley.edu/cgi-bin/CBOLD/jgspltscrn.cgi?simpledbq=inventory.txt


3. Holdings

3.1 What geographical regions and languages are covered?
Main Regions Covered: Africa
Approx Number of Languages: 200
Main Languages: Bantu languages

3.2 Please give impressionistic estimates of the archive holdings for each of the data types.
DATA TYPE NON-DIGITAL DIGITAL
Texts: none
Wordlists, Vocabularies, Lexicons, Dictionaries: large
Field Notes, Correspondence, Misc files: none
Descriptions (Grammars, Phonologies, etc): small
Audio Recordings: none
Video Recordings: none

3.3 Please list any other data types which are not included above, or any other comments on the archive holdings:

3.4 What proportion of the holdings are unique to the archive and not available elsewhere?
a significant amount


4. Electronic Publication

4.1 To what extent are the archive holdings published electronically, where "published" means that there is a well-defined procedure such that anyone at all can get a standard copy of the data, either on digital media or over the internet?
virtually everything

4.2 To what extent are the archive holdings accessible over the web?
virtually everything

4.3 Is permission required before materials can be accessed?
no

4.4 Is there any fee for materials?
no

4.5 How are author and/or editor defined for the electronic publications? Is there a bibliographical citation method?
Author is defined as the author of the original source data. There is no particular bibliographical citation method. The author name and year are incorporated into each file name, however.

4.6 Do the electronic publications have ISBN numbers?
no

4.7 What plans are there to expand the electronic publication of archive holdings?
Some of the archives are searchable online. In the future more and more will be searchable this way.


5. General Issues

5.1 Who is the legal owner of archived materials?
For the most part, the legal owner is the original author of the source. We have permission to distribute these soruces.

5.2 Beyond legal ownership, are there any asserted or perceived moral rights concerning archived materials? Do the holders of the archive see the original speakers or their representatives as controlling publication?
We only have lexicons, so there have not been moral rights concerns. The original speakers are not thought of as controlling publication. (If we ever distribute texts, this may become an issue.)

5.3 In cases where no electronic publication is planned, why is this so? (e.g. funding, licensing, technical know-how, lack of interest).

5.4 Is any of the data in a proprietary format (e.g. MS Word)? If so, are there plans to transfer it to an open standard (e.g., XML)?
Much of the data is in FileMaker or Word format. There are plans to transfer the data into SGML.


6. Do you have any other comments about digital archives of language material, or on this survey?



Back to the index page