Full description
The Language Data Commons of Australia (LDaCA) is making nationally significant language data available for academic and non-academic use and to provide a model for ensuring continued access with appropriate community control.
LDaCA is enabling a sustainable long-term repository for ingesting and curating existing language data collections of national significance. These collections include intangible cultural heritage of the languages of some of the world's longest continuous cultures in one of the world's most linguistically diverse regions (Aboriginal and Torres Strait Islander languages and regional languages of the Pacific), and data which is important for cyber-security (AusTalk, Australian National Corpus, corpora of regional languages), for gauging popular opinions and sentiment (Australian Twitter Corpus), and for emergency communication (languages of the region and some Indigenous languages).
As a portal to these language records, with associated metadata, LDaCA ensures long-lasting access for analysis and re-use of these invaluable data in a culturally, ethically and legally appropriate manner.
Collections in the LDaCA portal include:
- AustLit
- A COrpus of Oz Early English (COOEE)
- Braided Channels
- The LaTrobe Corpus of Spoken Australian English (LTCSAusE)
- International Corpus of English (Aus)
- Australian Corpus of English
- Australian Radio Talkback
User Contributed Tags
Login to tag this record with meaningful keywords to make it easier to discover