Full description
The project involves collecting the child reading dataset for the language is Xhosa, a South African Bantu language. The collected dataset is then processed with the help of native speakers and utilized to train state-of-the-art machine learning models focussed on assessing whether the child has spoken the word correctly or not. The dataset contains 14,972 recordings with an average of 4 seconds each. Each recording is annotated by three independent markers and consists of children speaking a particular word or letter from the Xhosa language in a classroom setting.
Created: 2025-05-21
Data time period: 02 2024 to 30 11 2024
Spatial Coverage And Location
text: South Africa
User Contributed Tags
Login to tag this record with meaningful keywords to make it easier to discover
- DOI : 10.26183/93X0-QY45
- Local : research-data.westernsydney.edu.au/published/7dfe822035f011f096a41d0408cdc7bb