Data

BIOMATE data compilation (v1.0)

University of Tasmania, Australia
Baldry, Kimberlee
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=http://metadata.imas.utas.edu.au/geonetwork/srv/eng/search?uuid=3e0afabb-ebd2-4972-bb67-8626c2643129&rft.title=BIOMATE data compilation (v1.0)&rft.identifier=http://metadata.imas.utas.edu.au/geonetwork/srv/eng/search?uuid=3e0afabb-ebd2-4972-bb67-8626c2643129&rft.description=Biological ocean data collected from ships find reuse in aggregations of historical data. These data are heavily relied upon to document long term change, validate satellite algorithms for ocean biology and are useful in assessing the performance of autonomous platforms and biogeochemical models. There is a need to combine subsurface biological and physical data into one aggregate data product to support reproducible research. Existing aggregate products are dissimilar in source data, have largely been isolated to the surface ocean and most omit physical data. These products cannot easily be used to explore subsurface bio-physical relationships. We present the first version of a biological ocean data reformatting effort (BIO-MATE, https://gitlab.com/KBaldry/BIO-MATE). BIO-MATE uses R software that reformats openly sourced published datasets from oceanographic voyages. These reformatted biological and physical data from underway sensors, profiling sensors and pigments analysis are stored in an interoperable and reproducible BIO-MATE data product for easy access and use.Maintenance and Update Frequency: none-plannedStatement: The BIO-MATE aggregate data product brings together ship-based data that have been collected by a Principal Investigator (PI) and openly published as publicly accessible data. The first version of BIO-MATE includes published datasets associated with four types of measurements: 1. sensors in the vessels underway seawater in-take (underway sensor data stream), 2. profiling sensors mounted to sampling rosettes (profiling sensor data stream), 3. particulate organic carbon (POC) measured in the laboratory (POC data stream), and 4. pigments measured in the laboratory (pigment data stream). A semi-automated workflow and the BIO-MATE R software (https://github.com/KimBaldry/BIOMATE-Rpackage) were used to reformat published datasets, and produce the BIO-MATE data product. Reformatted data files follow the WHP-Exchange format (https://exchange-format.readthedocs.io/en/latest/index.html), except pigment headers follow MAREDAT standards. The software arranges reformatted WHPE files into four data streams in local directories that include separate WHPE files, for each EXPOCODE, and for underway sensors, profiling sensor casts and pigment measurements. An algorithm links biological data in the pigment and POC data streams to the physical data in the profiling sensor and underway sensor data streams. Biological data records are given a profiling sensor identification tag (CTD_ID) if matched to physical data in BIO-MATE. Information is included in the BIO-MATE data product, for citing published datasets, laboratory analysis methodologies (for the PIG data stream) and the data repositories through which published data records were accessed. Each citation is recorded as a BibTEX entry, compatible with EndNote, R and LaTEX. Each BibTEX entry has a tag that is referenced in the processing metadata. Limited quality assurance has been performed on the BIO-MATE data product and is variable across published datasets. The initial integrity of these data records lies with the Principal Investigators of the published data record. As a result, reformatted data have varying levels of quality control and post-processing. Data included in the data product were made available by the following data repositories; PANGAEA, AODN/ IMOS, SeaBASS, CCHDO, AADC, GLODAP, PAL-LTER, CSIRO, MDGS and BCO-DMO. Records of data access dates, source addresses and digital object identifiers are recorded as metadata, alongside appropriate data citations. We acknowledge the enormous community effort undertaken in the collection, analysis and publication of this data and thank principle investigators for publishing their data in open access repositories.&rft.creator=Baldry, Kimberlee &rft.date=2015&rft.coverage=westlimit=-180; southlimit=-88.937; eastlimit=-180; northlimit=-13.8293&rft.coverage=westlimit=-180; southlimit=-88.937; eastlimit=-180; northlimit=-13.8293&rft_rights= http://creativecommons.org/licenses/by/4.0/&rft_rights=http://i.creativecommons.org/l/by/4.0/88x31.png&rft_rights=WWW:LINK-1.0-http--related&rft_rights=License Graphic&rft_rights=Creative Commons Attribution 4.0 International License&rft_rights=http://creativecommons.org/international/&rft_rights=WWW:LINK-1.0-http--related&rft_rights=WWW:LINK-1.0-http--related&rft_rights=License Text&rft_rights=The citation in a list of references is: citation author name/s (year metadata published), metadata title. Citation author organisation/s. File identifier and Data accessed at (add http link).&rft_rights=All data, except that sourced from MGDS, is available under CC-BY 4.0. MGDS data is licensed as it is by the source under CC-BY-NC-SA 3.0 US. When using BIOMATE data, please cite individual datasets using the citation information provided.&rft_rights=Please contact the researcher with any queries.&rft_rights=Creative Commons Attribution 4.0 International License http://creativecommons.org/licenses/by/4.0&rft_subject=oceans&rft_subject=EARTH SCIENCE | BIOSPHERE | ECOSYSTEMS | MARINE ECOSYSTEMS | PELAGIC | OCEANIC ZONE&rft_subject=EARTH SCIENCE | OCEANS | OCEAN CHEMISTRY | PIGMENTS | CHLOROPHYLL&rft_subject=EARTH SCIENCE | OCEANS | OCEAN CHEMISTRY | BIOGEOCHEMICAL CYCLES&rft_subject=EARTH SCIENCE | BIOSPHERE | ECOLOGICAL DYNAMICS | ECOSYSTEM FUNCTIONS | BIOGEOCHEMICAL CYCLES&rft_subject=EARTH SCIENCE | OCEANS | OCEAN CHEMISTRY | PIGMENTS&rft_subject=EARTH SCIENCE | OCEANS | SALINITY/DENSITY | SALINITY&rft_subject=EARTH SCIENCE | OCEANS | OCEAN TEMPERATURE&rft_subject=EARTH SCIENCE | BIOSPHERE | ECOSYSTEMS | AQUATIC ECOSYSTEMS | PLANKTON | PHYTOPLANKTON&rft_subject=Biological Oceanography&rft_subject=EARTH SCIENCES&rft_subject=OCEANOGRAPHY&rft_subject=Chemical Oceanography&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Open Licence view details
CC-BY

http://creativecommons.org/licenses/by/4.0/

Creative Commons Attribution 4.0 International License
http://creativecommons.org/licenses/by/4.0

http://i.creativecommons.org/l/by/4.0/88x31.png

WWW:LINK-1.0-http--related

License Graphic

Creative Commons Attribution 4.0 International License

http://creativecommons.org/international/

WWW:LINK-1.0-http--related

WWW:LINK-1.0-http--related

License Text

The citation in a list of references is: citation author name/s (year metadata published), metadata title. Citation author organisation/s. File identifier and Data accessed at (add http link).

All data, except that sourced from MGDS, is available under CC-BY 4.0. MGDS data is licensed as it is by the source under CC-BY-NC-SA 3.0 US. When using BIOMATE data, please cite individual datasets using the citation information provided.

Please contact the researcher with any queries.

Access:

Open

Brief description

Biological ocean data collected from ships find reuse in aggregations of historical data. These data are heavily relied upon to document long term change, validate satellite algorithms for ocean biology and are useful in assessing the performance of autonomous platforms and biogeochemical models. There is a need to combine subsurface biological and physical data into one aggregate data product to support reproducible research. Existing aggregate products are dissimilar in source data, have largely been isolated to the surface ocean and most omit physical data. These products cannot easily be used to explore subsurface bio-physical relationships. We present the first version of a biological ocean data reformatting effort (BIO-MATE, https://gitlab.com/KBaldry/BIO-MATE). BIO-MATE uses R software that reformats openly sourced published datasets from oceanographic voyages. These reformatted biological and physical data from underway sensors, profiling sensors and pigments analysis are stored in an interoperable and reproducible BIO-MATE data product for easy access and use.

Lineage

Maintenance and Update Frequency: none-planned
Statement: The BIO-MATE aggregate data product brings together ship-based data that have been collected by a Principal Investigator (PI) and openly published as publicly accessible data. The first version of BIO-MATE includes published datasets associated with four types of measurements: 1. sensors in the vessels underway seawater in-take (underway sensor data stream), 2. profiling sensors mounted to sampling rosettes (profiling sensor data stream), 3. particulate organic carbon (POC) measured in the laboratory (POC data stream), and 4. pigments measured in the laboratory (pigment data stream).

A semi-automated workflow and the BIO-MATE R software (https://github.com/KimBaldry/BIOMATE-Rpackage) were used to reformat published datasets, and produce the BIO-MATE data product. Reformatted data files follow the WHP-Exchange format (https://exchange-format.readthedocs.io/en/latest/index.html), except pigment headers follow MAREDAT standards. The software arranges reformatted WHPE files into four data streams in local directories that include separate WHPE files, for each EXPOCODE, and for underway sensors, profiling sensor casts and pigment measurements. An algorithm links biological data in the pigment and POC data streams to the physical data in the profiling sensor and underway sensor data streams. Biological data records are given a profiling sensor identification tag (CTD_ID) if matched to physical data in BIO-MATE. Information is included in the BIO-MATE data product, for citing published datasets, laboratory analysis methodologies (for the PIG data stream) and the data repositories through which published data records were accessed. Each citation is recorded as a BibTEX entry, compatible with EndNote, R and LaTEX. Each BibTEX entry has a tag that is referenced in the processing metadata.

Limited quality assurance has been performed on the BIO-MATE data product and is variable across published datasets. The initial integrity of these data records lies with the Principal Investigators of the published data record. As a result, reformatted data have varying levels of quality control and post-processing.

Data included in the data product were made available by the following data repositories; PANGAEA, AODN/
IMOS, SeaBASS, CCHDO, AADC, GLODAP, PAL-LTER, CSIRO, MDGS and BCO-DMO. Records of data access dates, source addresses and digital object identifiers are recorded as metadata, alongside appropriate data citations. We acknowledge the enormous community effort undertaken in the collection, analysis and publication of this data and thank principle investigators for publishing their data in open access repositories.

Notes

Credit
Australian Research Council’s Special Research Initiative for Antarctic Gateway Partnership (Project ID SR140300001)
Credit
2019 Fellowship from the Scientific Committee of Antarctic Research

Data time period: 1985-01-06 to 2019-05-10

This dataset is part of a larger collection

Click to explore relationships graph

-180,-86 -180,-13.8293

-180,-51.38315

text: westlimit=-180; southlimit=-88.937; eastlimit=-180; northlimit=-13.8293

Other Information
DATA ACCESS - browse and download BIOMATE data summaries and full reformatted data

uri : https://data.imas.utas.edu.au/attachments/3e0afabb-ebd2-4972-bb67-8626c2643129

(DATA ACCESS - download full data package [.zip])

uri : https://data.imas.utas.edu.au/attachments/3e0afabb-ebd2-4972-bb67-8626c2643129/BIOMATE_v1_full_datapackage.zip

SUPPLEMENTARY INFO - biomate data descriptor [pdf]

uri : https://data.imas.utas.edu.au/attachments/3e0afabb-ebd2-4972-bb67-8626c2643129/BIOMATE_data_descriptor.pdf

Identifiers
  • global : 3e0afabb-ebd2-4972-bb67-8626c2643129