Data

Short Tailed Shearwater DREAM gene sequencing fastq files

Australian Ocean Data Network
Polanowski, A., Deagle, B.E. and De Paoli-Iseppi, R. ; DEAGLE, BRUCE E. ; DE PAOLI-ISEPPI, RICARDO ; POLANOWSKI, ANDREA
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=Dataset DOI&rft.title=Short Tailed Shearwater DREAM gene sequencing fastq files&rft.identifier=Dataset DOI&rft.publisher=Australian Antarctic Data Centre&rft.description=This data set includes unprocessed sample .fastq files from two separate Illumina NextSeq runs, labelled as 'Run_1' and 'Run_2', respectively. Sample names: e.g. STS15059, 'STS' is the abbreviation of Short-tailed shearwater. The first two digits of the numeric refer to the year of collection e.g. '15' = 2015. Finally, the following number refers to the sequential unique ID for that year, e.g. '059' is the fifty-ninth sample for the years' collection. Leg bands are also recorded and are generally a 5-digit number and are unique to the individual bird. Longitudinal samples can be identified using these band IDs. E.g. in Run_2, an individual with the band number: 52196, was collected in 2015 as 'STS15065' and again in 2017 as 'STS17044'. Run_1: N = 35 individual samples are split across 4 lanes e.g. 'STS16020_S35_L001(/L002/L003/L004)_R1_001/fastq' and need to be merged before conversion to .fasta format and downstream analysis. Run_2: N = 36 individual samples were provided as a single merged file from the service provider, e.g. 'STS15059_S34_R1_001.fastq'. Sample_info: This excel spreadsheet has information on samples as follows: 'Band': 5-digit number on leg band. 'Sample': Sample number within run. 'UID': The unique ID for collection year e.g. STS15007. 'Age': The known-age of the animal rounded to whole year. 'Index (NebNext)': The NEB index used for NGS sample identification. 'Note': Additional information on if a sample was a between or within run replicate or longitudinal replicate. Analysis of these data will be published in: [tba: R. De Paoli-Iseppi et al. 2018. Molecular Ecology Resources].Progress Code: onGoing&rft.creator=Polanowski, A., Deagle, B.E. and De Paoli-Iseppi, R. &rft.creator=DEAGLE, BRUCE E. &rft.creator=DE PAOLI-ISEPPI, RICARDO &rft.creator=POLANOWSKI, ANDREA &rft.date=2018&rft.coverage=westlimit=148.08609; southlimit=-40.31723; eastlimit=148.34579; northlimit=-40.20825&rft.coverage=westlimit=148.08609; southlimit=-40.31723; eastlimit=148.34579; northlimit=-40.20825&rft_rights=This metadata record is publicly available.&rft_rights=These data are publicly available, however owing to their size are only available on request to the Australian Antarctic Data Centre.&rft_rights= https://creativecommons.org/licenses/by/4.0/legalcode&rft_rights=This data set conforms to the CCBY Attribution License (http://creativecommons.org/licenses/by/4.0/). Please follow instructions listed in the citation reference provided at http://data.aad.gov.au/aadc/metadata/citation.cfm?entry_id=AAS_4014_shearwater_DREAM when using these data. http://creativecommons.org/licenses/by/4.0/).&rft_rights=Portable Network Graphic&rft_rights=https://i.creativecommons.org/l/by/3.0/88x31.png&rft_rights=Creative Commons by Attribution logo&rft_rights=Attribution 4.0 International (CC BY 4.0)&rft_rights=Legal code for Creative Commons by Attribution 4.0 International license&rft_rights=Attribution 4.0 International (CC BY 4.0)&rft_rights= https://creativecommons.org/licenses/by/4.0/legalcode&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Other view details
Unknown

https://creativecommons.org/licenses/by/4.0/legalcode

This data set conforms to the CCBY Attribution License (http://creativecommons.org/licenses/by/4.0/).

Please follow instructions listed in the citation reference provided at http://data.aad.gov.au/aadc/metadata/citation.cfm?entry_id=AAS_4014_shearwater_DREAM when using these data.
http://creativecommons.org/licenses/by/4.0/).

Attribution 4.0 International (CC BY 4.0)

https://creativecommons.org/licenses/by/4.0/legalcode

This metadata record is publicly available.

These data are publicly available, however owing to their size are only available on request to the Australian Antarctic Data Centre.

Portable Network Graphic

https://i.creativecommons.org/l/by/3.0/88x31.png

Creative Commons by Attribution logo

Attribution 4.0 International (CC BY 4.0)

Legal code for Creative Commons by Attribution 4.0 International license

Access:

Other

Contact Information

metadata@aad.gov.au

Brief description

This data set includes unprocessed sample .fastq files from two separate Illumina NextSeq runs, labelled as 'Run_1' and 'Run_2', respectively.

Sample names: e.g. STS15059, 'STS' is the abbreviation of Short-tailed shearwater. The first two digits of the numeric refer to the year of collection e.g. '15' = 2015. Finally, the following number refers to the sequential unique ID for that year, e.g. '059' is the fifty-ninth sample for the years' collection.

Leg bands are also recorded and are generally a 5-digit number and are unique to the individual bird. Longitudinal samples can be identified using these band IDs. E.g. in Run_2, an individual with the band number: 52196, was collected in 2015 as 'STS15065' and again in 2017 as 'STS17044'.

Run_1: N = 35 individual samples are split across 4 lanes e.g. 'STS16020_S35_L001(/L002/L003/L004)_R1_001/fastq' and need to be merged before conversion to .fasta format and downstream analysis.

Run_2: N = 36 individual samples were provided as a single merged file from the service provider, e.g. 'STS15059_S34_R1_001.fastq'.

Sample_info: This excel spreadsheet has information on samples as follows:
'Band': 5-digit number on leg band.
'Sample': Sample number within run.
'UID': The unique ID for collection year e.g. STS15007.
'Age': The known-age of the animal rounded to whole year.
'Index (NebNext)': The NEB index used for NGS sample identification.
'Note': Additional information on if a sample was a between or within run replicate or longitudinal replicate.

Analysis of these data will be published in: [tba: R. De Paoli-Iseppi et al. 2018. Molecular Ecology Resources].

Lineage

Progress Code: onGoing

Notes

Purpose
Raw .fastq sequence files for replication of published results or other genomic analyses.

Data time period: 2015-11-01 to 2017-12-31

148.34579,-40.20825 148.34579,-40.31723 148.08609,-40.31723 148.08609,-40.20825 148.34579,-40.20825

148.21594,-40.26274

text: westlimit=148.08609; southlimit=-40.31723; eastlimit=148.34579; northlimit=-40.20825

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Other Information
Request a copy of the data (GET DATA)

uri : http://data.aad.gov.au/eds/4791/download