Data
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.25451/flinders.21709757.v7&rft.title=Dataset for a globally synthesised and flagged bee occurrence dataset and cleaning workflow&rft.identifier=https://doi.org/10.25451/flinders.21709757.v7&rft.publisher=Flinders University&rft.description=Species occurrence data are foundational for research, conservation, and science communication, but the limited availability and accessibility of reliable data represents a major obstacle, particularly for insects, which face mounting pressures. We present BeeBDC, a new R package, and a global bee occurrence dataset to address this issue. We combined >18.3 million bee occurrence records from multiple public repositories (GBIF, SCAN, iDigBio, USGS, ALA) and smaller datasets, then standardised, flagged, deduplicated, and cleaned the data using the reproducible BeeBDCR-workflow. Specifically, we harmonised species names (following established global taxonomy), country names, and collection dates and we added record-level flags for a series of potential quality issues. These data are provided in two formats, “cleaned” and “flagged-but-uncleaned”. The BeeBDC package with online documentation provides end users the ability to modify filtering parameters to address their research questions. By publishing reproducible R workflows and globally cleaned datasets, we can increase the accessibility and reliability of downstream analyses. This workflow can be implemented for other taxa to support research and conservation.&rft.creator=Alice Hughes&rft.creator=Allan Smith-Pardo&rft.creator=Angela Nava-Bolaños&rft.creator=Armando Falcón-Brindis&rft.creator=Bruno Ribeiro&rft.creator=Diego A. Guevara&rft.creator=Diego de Pedro&rft.creator=Elinor M. Lichtenberg&rft.creator=Erica E. Fischer&rft.creator=Erika M. Tucker&rft.creator=James Dorey&rft.creator=John S. Ascher&rft.creator=Katherine A. Parys&rft.creator=Keng-Lou James Hung&rft.creator=Laura Melissa Guzman&rft.creator=Lindsie M. McCabe&rft.creator=Matthew S. Rogan&rft.creator=Michael Christopher Orr.&rft.creator=Neil S. Cobb&rft.creator=Paige R. Chesshire&rft.creator=Robert O'Reilly&rft.creator=Robert L. Minckley&rft.creator=Santiago José Elías Velazco&rft.creator=Shannon M. Collins&rft.creator=Silas Bossert&rft.creator=Terry Griswold&rft.creator=Tracy A. Zarrillo&rft.creator=Walter Jetz&rft.creator=Yanina V. Sica&rft.date=2024&rft_rights=REUSABLE-FOR-ANY-PURPOSE-(CC-BY)&rft_subject=bees&rft_subject=hymenoptera&rft_subject=occurrence data&rft_subject=GBIF&rft_subject=ALA&rft_subject=Macroecology&rft_subject=R&rft_subject=Data cleaning&rft_subject=Anthophila&rft_subject=Apoidea&rft_subject=Apidae&rft_subject=Megachilidae&rft_subject=Stenotritidae&rft_subject=Colletidae&rft_subject=Halictidae&rft_subject=Andrenidae&rft_subject=Melittidae&rft_subject=macroecology&rft_subject=macroecology and macroevolution&rft_subject=Global&rft_subject=Ecology not elsewhere classified&rft_subject=Animal systematics and taxonomy&rft_subject=Biogeography and phylogeography&rft_subject=Invertebrate biology&rft_subject=Global change biology&rft_subject=Ecosystem function&rft_subject=Ecosystem services (incl. pollination)&rft_subject=Landscape ecology&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Other view details
Reusable-for-any-purpose

REUSABLE-FOR-ANY-PURPOSE-(CC-BY)

Full description

Species occurrence data are foundational for research, conservation, and science communication, but the limited availability and accessibility of reliable data represents a major obstacle, particularly for insects, which face mounting pressures. We present BeeBDC, a new R package, and a global bee occurrence dataset to address this issue. We combined >18.3 million bee occurrence records from multiple public repositories (GBIF, SCAN, iDigBio, USGS, ALA) and smaller datasets, then standardised, flagged, deduplicated, and cleaned the data using the reproducible BeeBDCR-workflow. Specifically, we harmonised species names (following established global taxonomy), country names, and collection dates and we added record-level flags for a series of potential quality issues. These data are provided in two formats, “cleaned” and “flagged-but-uncleaned”. The BeeBDC package with online documentation provides end users the ability to modify filtering parameters to address their research questions. By publishing reproducible R workflows and globally cleaned datasets, we can increase the accessibility and reliability of downstream analyses. This workflow can be implemented for other taxa to support research and conservation.

Issued: 2023-10-18

Created: 2024-06-17

This dataset is part of a larger collection

Click to explore relationships graph
Identifiers