Data

Bengali Audio-Visual Corpus for Visual Speech Recognition

Charles Sturt University
Pondit, Ashish ; Rukon, Muhammad Eshaque Ali ; Das, Anik ; Kabir, Ashad
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=https://researchoutput.csu.edu.au/en/datasets/59a9e2de-7efa-4037-830c-b227ffb265d3&rft.title=Bengali Audio-Visual Corpus for Visual Speech Recognition&rft.identifier=59a9e2de-7efa-4037-830c-b227ffb265d3&rft.publisher=Springer&rft.description=The BenAV dataset contains a lexicon of 50 words from 128 speakers (107 male and 21 female) with 26,300 utterances. The average number of speakers for each word is 18 (max 20, min 12, and standard deviation 1.826). The total duration of the dataset is 7.3 hours. This is the first Bengali audio-visual dataset that can be used for various research, including acoustic speech recognition and audio-visual speech recognition.&rft.creator=Pondit, Ashish &rft.creator=Rukon, Muhammad Eshaque Ali &rft.creator=Das, Anik &rft.creator=Kabir, Ashad &rft.date=2021&rft.type=dataset&rft.language=English Access the data

Access:

Open

Full description

The BenAV dataset contains a lexicon of 50 words from 128 speakers (107 male and 21 female) with 26,300 utterances. The average number of speakers for each word is 18 (max 20, min 12, and standard deviation 1.826). The total duration of the dataset is 7.3 hours. This is the first Bengali audio-visual dataset that can be used for various research, including acoustic speech recognition and audio-visual speech recognition.

Notes

External Organisations
Chittagong University of Engineering & Technology; Chittagong University of Engineering and Technology (CUET)
Associated Persons
Ashish Pondit (Creator); Muhammad Eshaque Ali Rukon (Creator); Anik Das (Creator)

Created: 2021

Issued: 2021-03-10

This dataset is part of a larger collection

Click to explore relationships graph

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Other Information
BenAV: A Bengali audio-visual corpus for visual speech recognition

url : http://researchoutput.csu.edu.au/en/publications/76c343ad-5f12-4ed7-8d9b-5b8ccf770a3f

Conference paper

Identifiers
  • global : 59a9e2de-7efa-4037-830c-b227ffb265d3