Data

The QUT-NOISE Databases and Protocols

Queensland University of Technology
Dean, David ; Sridharan, Sridha ; Vogt, Robert ; Mason, Michael
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.4225/09/58819f7a21a21&rft.title=The QUT-NOISE Databases and Protocols&rft.identifier=10.4225/09/58819f7a21a21&rft.publisher=Queensland University of Technology&rft.description=The QUT-NOISE Databases and Protocols Overview This distribution contains the QUT-NOISE database and the code required to create the QUT-NOISE-TIMIT database from the QUT-NOISE database and a locally installed copy of the TIMIT database. It also contains code to create the QUT-NOISE-SRE protocol on top of an existing speaker recognition evaluation database (such as NIST evaluations). Further information on the QUT-NOISE and QUT-NOISE-TIMIT databases is available in our paper:  D. Dean, S. Sridharan, R. Vogt, M. Mason (2010) , in Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan. This paper is also available in the file: docs/Dean2010, The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithm.pdf, distributed with this database. Further information on the QUT-NOISE-SRE protocol is available in our paper:  D. Dean, A. Kanagasundaram, H. Ghaemmaghami, M. Hafizur, S. Sridharan (2015) . In Proceedings of Interspeech 2015, September, Dresden, Germany. Licensing The QUT-NOISE data itself is licensed CC-BY-SA, and the code required to create the QUT-NOISE-TIMIT database and QUT-NOISE-SRE protocols is licensed under the BSD license. Please consult the approriate LICENSE.txt files (in the code and QUT-NOISE directories) for more information. To attribute this database, please include the following citation:  D. Dean, S. Sridharan, R. Vogt, M. Mason (2010) , in Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan. If your work is based upon the QUT-NOISE-SRE, please also include this citation:  D. Dean, A. Kanagasundaram, H. Ghaemmaghami, M. Hafizur, S. Sridharan (2015) . In Proceedings of Interspeech 2015, September, Dresden, Germany. Download and Installation Download the following QUT-NOISE*.zip files: (26.7 MB, md5sum: 672461fd88782e9ea10d5c2cb7a84196) (1.6 GB, md5sum: f87fb213c0e1c439e1b727fb258ef2cd) (1.7 GB, md5sum: d680118b4517e1257a9263b99d1ac401) (1.4 GB, md5sum: d99572ae1c118b749c1ffdb2e0cf0d2e) (1.4 GB, md5sum: fe107ab341e6bc75de3a32c69344190e) (1.6 GB, md5sum: 68d5ebc2e60cb07927cc4d33cdf2f017) Creating QUT-NOISE-TIMIT Obtaining TIMIT In order to construct the QUT-NOISE-TIMIT database from the QUT-NOISE data supplied here you will need to obtain a copy of the TIMIT database from the . If you just want to use the QUT-NOISE database, or you wish to combine it with different speech data, TIMIT is not required. Creating QUT-NOISE-TIMIT Once you have obtained TIMIT, download and install a copy of  and install it in your MATLABPATH. Run matlab in the QUT-NOISE/code directory, and run the function: createQUTNOISETIMIT('/location/of/timit-cd/timit'). This will create the QUT-NOISE-TIMIT database in the QUT-NOISE/QUT-NOISE-TIMIT directory. If you wish to verify that the QUT-NOISE-TIMIT database matches that evaluated in our original paper, please check that the md5sums (use md5sum on unix-based OSes) match those in the QUT-NOISE-TIMIT/md5sum.txt file. Using the QUT-NOISE-SRE protocol The code related to the QUT-NOISE-SRE protocol can be used in two ways: To create a collection of noisy audio files across the scenarios in the QUT-NOISE database at different noise levels, or, To recreate a list of file names based on the QUT-NOISE-SRE protocl produced by another researcher, having already done (1). This allows existing research to be reproduced without having to send large volumes of audio around. If you are interested in creating your own noisy database from an existing SRE database (1 above), please look at the example script exampleQUTNOISESRE.sh in the QUT-NOISE/code directory. You will need to make some modifications, but it should give you the right idea. If you are interested in creating our QUT-NOISE-NIST2008 database published at Interspeech 2015, you can find the list of created noisy files in the QUT-NOISE-NIST2008.train.short2.list and QUT-NOISE-NIST2008.test.short3.list files in the QUT-NOISE/code directory. These files can be recreated as follows (provided you have access to the NIST2008 SRE data):  Run matlab in the QUT-NOISE/code directory, and run the following functions:  createQUTNOISESREfiles('NIST2008.train.short2.list', ...  'QUT-NOISE-NIST2008.train.short2.list', ...  ' &rft.creator=Dean, David &rft.creator=Sridharan, Sridha &rft.creator=Vogt, Robert &rft.creator=Mason, Michael &rft.date=2010&rft.edition=1&rft.relation=http://eprints.qut.edu.au/85240/&rft.relation=http://eprints.qut.edu.au/38144/&rft.coverage=Brisbane&rft_rights=© 2010 QUT and is licensed under the Creative Commons Attribution-ShareAlike 3.0 Australia License.&rft_rights=Creative Commons Attribution-Share Alike 3.0 http://creativecommons.org/licenses/by-sa/4.0/&rft_subject=Speech Processing; Speaker Recognition; Biometrics;&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Open Licence view details
CC-BY-SA

Creative Commons Attribution-Share Alike 3.0
http://creativecommons.org/licenses/by-sa/4.0/

© 2010 QUT and is licensed under the Creative Commons Attribution-ShareAlike 3.0 Australia License.

Access:

Other

Contact Information

Postal Address:
Dr Simon Denman

s.denman@qut.edu.au

Full description

The QUT-NOISE Databases and Protocols

Overview

This distribution contains the QUT-NOISE database and the code required to create the QUT-NOISE-TIMIT database from the QUT-NOISE database and a locally installed copy of the TIMIT database. It also contains code to create the QUT-NOISE-SRE protocol on top of an existing speaker recognition evaluation database (such as NIST evaluations). Further information on the QUT-NOISE and QUT-NOISE-TIMIT databases is available in our paper: 


D. Dean, S. Sridharan, R. Vogt, M. Mason (2010) , in Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan.

This paper is also available in the file: docs/Dean2010, The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithm.pdf, distributed with this database.

Further information on the QUT-NOISE-SRE protocol is available in our paper: 
D. Dean, A. Kanagasundaram, H. Ghaemmaghami, M. Hafizur, S. Sridharan (2015) . In Proceedings of Interspeech 2015, September, Dresden, Germany.

Licensing

The QUT-NOISE data itself is licensed CC-BY-SA, and the code required to create the QUT-NOISE-TIMIT database and QUT-NOISE-SRE protocols is licensed under the BSD license. Please consult the approriate LICENSE.txt files (in the code and QUT-NOISE directories) for more information. To attribute this database, please include the following citation: 


D. Dean, S. Sridharan, R. Vogt, M. Mason (2010) , in Proceedings of Interspeech 2010, Makuhari Messe International Convention Complex, Makuhari, Japan.

If your work is based upon the QUT-NOISE-SRE, please also include this citation: 
D. Dean, A. Kanagasundaram, H. Ghaemmaghami, M. Hafizur, S. Sridharan (2015) . In Proceedings of Interspeech 2015, September, Dresden, Germany.

Download and Installation

Download the following QUT-NOISE*.zip files:

  • (26.7 MB, md5sum: 672461fd88782e9ea10d5c2cb7a84196)
  • (1.6 GB, md5sum: f87fb213c0e1c439e1b727fb258ef2cd)
  • (1.7 GB, md5sum: d680118b4517e1257a9263b99d1ac401)
  • (1.4 GB, md5sum: d99572ae1c118b749c1ffdb2e0cf0d2e)
  • (1.4 GB, md5sum: fe107ab341e6bc75de3a32c69344190e)
  • (1.6 GB, md5sum: 68d5ebc2e60cb07927cc4d33cdf2f017)

Creating QUT-NOISE-TIMIT

Obtaining TIMIT

In order to construct the QUT-NOISE-TIMIT database from the QUT-NOISE data supplied here you will need to obtain a copy of the TIMIT database from the . If you just want to use the QUT-NOISE database, or you wish to combine it with different speech data, TIMIT is not required.

Creating QUT-NOISE-TIMIT

  • Once you have obtained TIMIT, download and install a copy of  and install it in your MATLABPATH.
  • Run matlab in the QUT-NOISE/code directory, and run the function: createQUTNOISETIMIT('/location/of/timit-cd/timit'). This will create the QUT-NOISE-TIMIT database in the QUT-NOISE/QUT-NOISE-TIMIT directory.
  • If you wish to verify that the QUT-NOISE-TIMIT database matches that evaluated in our original paper, please check that the md5sums (use md5sum on unix-based OSes) match those in the QUT-NOISE-TIMIT/md5sum.txt file.
  • Using the QUT-NOISE-SRE protocol
    • The code related to the QUT-NOISE-SRE protocol can be used in two ways:
      1. To create a collection of noisy audio files across the scenarios in the QUT-NOISE database at different noise levels, or,
      2. To recreate a list of file names based on the QUT-NOISE-SRE protocl produced by another researcher, having already done (1). This allows existing research to be reproduced without having to send large volumes of audio around.
    • If you are interested in creating your own noisy database from an existing SRE database (1 above), please look at the example script exampleQUTNOISESRE.sh in the QUT-NOISE/code directory. You will need to make some modifications, but it should give you the right idea.
    • If you are interested in creating our QUT-NOISE-NIST2008 database published at Interspeech 2015, you can find the list of created noisy files in the QUT-NOISE-NIST2008.train.short2.list and QUT-NOISE-NIST2008.test.short3.list files in the QUT-NOISE/code directory.
    • These files can be recreated as follows (provided you have access to the NIST2008 SRE data): 

      Run matlab in the QUT-NOISE/code directory, and run the following functions: 

      createQUTNOISESREfiles('NIST2008.train.short2.list', ... 
      'QUT-NOISE-NIST2008.train.short2.list', ... 
      '

This dataset is part of a larger collection

Click to explore relationships graph

Spatial Coverage And Location

text: Brisbane

Subjects

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Identifiers