Data

Language Data Commons of Australia

Also known as: LDaCA
The University of Queensland
Ben Foley (Aggregated by) Professor Michael Haugh (Aggregated by) Simon Musgrave (Aggregated by)
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=https://data.ldaca.edu.au/ LDaCA Data Portal&rft.title=Language Data Commons of Australia&rft.publisher=The University of Queensland&rft.description=The Language Data Commons of Australia (LDaCA) is making nationally significant language data available for academic and non-academic use and to provide a model for ensuring continued access with appropriate community control. LDaCA is enabling a sustainable long-term repository for ingesting and curating existing language data collections of national significance. These collections include intangible cultural heritage of the languages of some of the world's longest continuous cultures in one of the world's most linguistically diverse regions (Aboriginal and Torres Strait Islander languages and regional languages of the Pacific), and data which is important for cyber-security (AusTalk, Australian National Corpus, corpora of regional languages), for gauging popular opinions and sentiment (Australian Twitter Corpus), and for emergency communication (languages of the region and some Indigenous languages). As a portal to these language records, with associated metadata, LDaCA ensures long-lasting access for analysis and re-use of these invaluable data in a culturally, ethically and legally appropriate manner. Collections in the LDaCA portal include: AustLit A COrpus of Oz Early English (COOEE) Braided Channels The LaTrobe Corpus of Spoken Australian English (LTCSAusE) International Corpus of English (Aus) Australian Corpus of English Australian Radio Talkback  &rft.creator=Ben Foley&rft.creator=Professor Michael Haugh&rft.creator=Simon Musgrave&rft.date=2023&rft_rights=Licence and access rights in individual collections vary. Refer to the licence description on individual records in the collection for more information.&rft_subject=LANGUAGE STUDIES&rft_subject=LANGUAGE, COMMUNICATION AND CULTURE&rft_subject=Australian history&rft_subject=HISTORICAL STUDIES&rft_subject=HISTORY AND ARCHAEOLOGY&rft_subject=Aboriginal and Torres Strait Islander linguistics and languages&rft_subject=Conservation of Aboriginal and Torres Strait Islander heritage&rft_subject=Aboriginal and Torres Strait Islander language education&rft_subject=English Language&rft_subject=Data models, storage and indexing&rft_subject=Corpus linguistic&rft_subject=Discourse and Pragmatics&rft_subject=LINGUISTICS&rft_subject=Language documentation and description&rft_subject=Phonetics and speech science&rft_subject=Sociolinguistics&rft_subject=History, Heritage and Archaeology&rft_subject=Indigenous Studies&rft_subject=Aboriginal and Torres Strait Islander culture, language and history&rft_subject=Aboriginal and Torres Strait Islander Education&rft_subject=EDUCATION&rft_subject=SPECIALIST STUDIES IN EDUCATION&rft_subject=INFORMATION AND COMPUTING SCIENCES&rft_subject=Data management and data science&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

view details

Licence and access rights in individual collections vary. Refer to the licence description on individual records in the collection for more information.

Access:

Other

Full description

The Language Data Commons of Australia (LDaCA) is making nationally significant language data available for academic and non-academic use and to provide a model for ensuring continued access with appropriate community control.

LDaCA is enabling a sustainable long-term repository for ingesting and curating existing language data collections of national significance. These collections include intangible cultural heritage of the languages of some of the world's longest continuous cultures in one of the world's most linguistically diverse regions (Aboriginal and Torres Strait Islander languages and regional languages of the Pacific), and data which is important for cyber-security (AusTalk, Australian National Corpus, corpora of regional languages), for gauging popular opinions and sentiment (Australian Twitter Corpus), and for emergency communication (languages of the region and some Indigenous languages).

As a portal to these language records, with associated metadata, LDaCA ensures long-lasting access for analysis and re-use of these invaluable data in a culturally, ethically and legally appropriate manner.

Collections in the LDaCA portal include:

  • AustLit
  • A COrpus of Oz Early English (COOEE)
  • Braided Channels
  • The LaTrobe Corpus of Spoken Australian English (LTCSAusE)
  • International Corpus of English (Aus)
  • Australian Corpus of English
  • Australian Radio Talkback

 

This dataset is part of a larger collection

Click to explore relationships graph