Full description
The project involves collecting the child reading dataset for the language is Xhosa, a South African Bantu language. The collected dataset is then processed with the help of native speakers and utilized to train state-of-the-art machine learning models focussed on assessing whether the child has spoken the word correctly or not. The dataset contains 14,972 recordings with an average of 4 seconds each. Each recording is annotated by three independent markers and consists of children speaking a particular word or letter from the Xhosa language in a classroom setting. Please note that the attached zip file contains ~14,000 files. If you download this file to a Onedrive or Sharepoint location, you may be affected by the 10,000 files limit to download. When unzipping or downloading, take care to ensure that all the files are downloaded completely.Created: 2025-05-21
Data time period: 02 2024 to 30 11 2024
Spatial Coverage And Location
text: South Africa
Subjects
Annotated |
Applied Computing |
Applied Computing Not Elsewhere Classified |
Applied Mathematics |
Applied Mathematics Not Elsewhere Classified |
Artificial Intelligence |
Artificial Intelligence Not Elsewhere Classified |
Assessment |
Culture and Society |
Children |
Classroom |
Communication |
Communication Not Elsewhere Classified |
Communication Technologies, Systems and Services |
Communication Technologies, Systems and Services Not Elsewhere Classified |
Data Management and Data Science |
Data Management and Data Science Not Elsewhere Classified |
Education |
Education and Training |
EGRA |
EGRA-AI |
Expanding Knowledge |
Early Grade |
Education Systems |
Education Systems Not Elsewhere Classified |
Expanding Knowledge |
Expanding Knowledge in Language, Communication and Culture |
Information and Communication Services |
Information and Computing Sciences |
Information Systems, Technologies and Services |
Information Systems, Technologies and Services Not Elsewhere Classified |
Language, Communication and Culture |
Language Studies |
Language Studies Not Elsewhere Classified |
Learner and Learning |
Learner and Learning Not Elsewhere Classified |
Mathematical Sciences |
Machine Learning |
Machine Learning Not Elsewhere Classified |
Other Education and Training |
Other Education and Training Not Elsewhere Classified |
Teaching and Curriculum |
Teaching and Curriculum Not Elsewhere Classified |
isiXhosa |
User Contributed Tags
Login to tag this record with meaningful keywords to make it easier to discover
Identifiers
- DOI : 10.26183/93X0-QY45
- Local : research-data.westernsydney.edu.au/published/7dfe822035f011f096a41d0408cdc7bb
