Language Data Commons of Australia

Also known as: LDaCA

The University of Queensland

Ben Foley (Aggregated by)

Professor Michael Haugh (Aggregated by)

Simon Musgrave (Aggregated by)

Full description

The Language Data Commons of Australia (LDaCA) is making nationally significant language data available for academic and non-academic use and to provide a model for ensuring continued access with appropriate community control.

LDaCA is enabling a sustainable long-term repository for ingesting and curating existing language data collections of national significance. These collections include intangible cultural heritage of the languages of some of the world's longest continuous cultures in one of the world's most linguistically diverse regions (Aboriginal and Torres Strait Islander languages and regional languages of the Pacific), and data which is important for cyber-security (AusTalk, Australian National Corpus, corpora of regional languages), for gauging popular opinions and sentiment (Australian Twitter Corpus), and for emergency communication (languages of the region and some Indigenous languages).

As a portal to these language records, with associated metadata, LDaCA ensures long-lasting access for analysis and re-use of these invaluable data in a culturally, ethically and legally appropriate manner.

Collections in the LDaCA portal include:

AustLit
A COrpus of Oz Early English (COOEE)
Braided Channels
The LaTrobe Corpus of Spoken Australian English (LTCSAusE)
International Corpus of English (Aus)
Australian Corpus of English
Australian Radio Talkback

This dataset is part of a larger collection

Click to explore relationships graph

Subjects

Language Data Commons of Australia

Licence & Rights:

Access:

Full description

This dataset is part of a larger collection

User Contributed Tags

Language Data Commons of Australia

Licence & Rights:

Access:

Full description

This dataset is part of a larger collection

Related People

Related Grants and Projects

Related Websites

User Contributed Tags