Data

FlowCam Plankton Image Collection - Perth Coastal Waters 2022

Commonwealth Scientific and Industrial Research Organisation
Jackett, Chris ; Strzelecki, Joanna ; Eriksen, Ruth ; Uribe Palomino, Julian ; McLaughlin, James
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.25919/cdaq-va45&rft.title=FlowCam Plankton Image Collection - Perth Coastal Waters 2022&rft.identifier=https://doi.org/10.25919/cdaq-va45&rft.publisher=Commonwealth Scientific and Industrial Research Organisation&rft.description=The Perth Coastal Waters 2022 image collection represents a comprehensive plankton imaging survey conducted at 15 sampling stations along the coast of Western Australia in August 2022. Using a FlowCam 8400 imaging flow cytometer, the project captured and processed over 218,000 individual plankton images extracted from 2,453 collage images. The dataset covers coastal waters from 32.264°S to 32.117°S and 115.683°E to 115.760°E, with samples collected using standardized 100 μm mesh plankton nets at depths ranging from -19.0m to -4.0m. Each image includes detailed morphological measurements, optical properties, and volumetric calculations, making this dataset particularly valuable for developing automated plankton classification systems using machine learning techniques. The dataset represents a significant advancement in marine plankton imaging and automated taxonomic classification efforts, with all data processed and standardized according to FAIR (Findable, Accessible, Interoperable, Reusable) principles.\nLineage: The dataset was created through a systematic sampling and processing workflow. Samples were collected using standardized 100 μm mesh plankton nets at 15 coastal stations, with comprehensive station metadata including GPS coordinates, sampling depths, collection times, and environmental parameters recorded in standardized logs. The FlowCam 8400 imaging flow cytometer initially captured samples as VisualSpreadsheet collage images in PNG format, with each collage containing multiple particle vignettes arranged in a grid pattern. These collages were then processed using a custom Marimba FlowCam Pipeline that employed OpenCV computer vision techniques to extract and validate individual vignettes while maintaining their associated metadata. The Marimba Pipeline implemented quality control measures and organized the data in a hierarchical directory structure by sampling station, date, and replicate number. Each extracted vignette was renamed according to a standardized convention encoding critical metadata including platform ID, station ID, magnification, and field of view. The final dataset comprises 218,314 individual vignettes in JPG format, with embedded EXIF metadata including morphological parameters (area, diameter, circularity), optical properties (RGB intensities, transparency), volumetric calculations (biovolume across multiple geometric models), and positional data.&rft.creator=Jackett, Chris &rft.creator=Strzelecki, Joanna &rft.creator=Eriksen, Ruth &rft.creator=Uribe Palomino, Julian &rft.creator=McLaughlin, James &rft.date=2025&rft.edition=v1&rft.coverage=westlimit=115.683; southlimit=-32.264; eastlimit=115.76; northlimit=-32.117; projection=WGS84&rft_rights=Creative Commons Attribution-Noncommercial 4.0 Licence https://creativecommons.org/licenses/by-nc/4.0/&rft_rights=Data is accessible online and may be reused in accordance with licence conditions&rft_rights=All Rights (including copyright) CSIRO 2025.&rft_subject=phytoplankton&rft_subject=FlowCam&rft_subject=imaging flow cytometry&rft_subject=Perth coastal waters&rft_subject=Western Australia&rft_subject=microscopy&rft_subject=CSIRO&rft_subject=coastal monitoring&rft_subject=biological imaging&rft_subject=marine ecology&rft_subject=environmental monitoring&rft_subject=pelagic zone&rft_subject=Other environmental sciences not elsewhere classified&rft_subject=Other environmental sciences&rft_subject=ENVIRONMENTAL SCIENCES&rft_subject=Image processing&rft_subject=Computer vision and multimedia computation&rft_subject=INFORMATION AND COMPUTING SCIENCES&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Non-Commercial Licence view details
CC-BY-NC

Creative Commons Attribution-Noncommercial 4.0 Licence
https://creativecommons.org/licenses/by-nc/4.0/

Data is accessible online and may be reused in accordance with licence conditions

All Rights (including copyright) CSIRO 2025.

Access:

Open view details

Accessible for free

Contact Information



Brief description

The Perth Coastal Waters 2022 image collection represents a comprehensive plankton imaging survey conducted at 15 sampling stations along the coast of Western Australia in August 2022. Using a FlowCam 8400 imaging flow cytometer, the project captured and processed over 218,000 individual plankton images extracted from 2,453 collage images. The dataset covers coastal waters from 32.264°S to 32.117°S and 115.683°E to 115.760°E, with samples collected using standardized 100 μm mesh plankton nets at depths ranging from -19.0m to -4.0m. Each image includes detailed morphological measurements, optical properties, and volumetric calculations, making this dataset particularly valuable for developing automated plankton classification systems using machine learning techniques. The dataset represents a significant advancement in marine plankton imaging and automated taxonomic classification efforts, with all data processed and standardized according to FAIR (Findable, Accessible, Interoperable, Reusable) principles.
Lineage: The dataset was created through a systematic sampling and processing workflow. Samples were collected using standardized 100 μm mesh plankton nets at 15 coastal stations, with comprehensive station metadata including GPS coordinates, sampling depths, collection times, and environmental parameters recorded in standardized logs. The FlowCam 8400 imaging flow cytometer initially captured samples as VisualSpreadsheet collage images in PNG format, with each collage containing multiple particle vignettes arranged in a grid pattern. These collages were then processed using a custom Marimba FlowCam Pipeline that employed OpenCV computer vision techniques to extract and validate individual vignettes while maintaining their associated metadata. The Marimba Pipeline implemented quality control measures and organized the data in a hierarchical directory structure by sampling station, date, and replicate number. Each extracted vignette was renamed according to a standardized convention encoding critical metadata including platform ID, station ID, magnification, and field of view. The final dataset comprises 218,314 individual vignettes in JPG format, with embedded EXIF metadata including morphological parameters (area, diameter, circularity), optical properties (RGB intensities, transparency), volumetric calculations (biovolume across multiple geometric models), and positional data.

Available: 2025-03-07

Data time period: 2022-08-25 to 2022-08-25

This dataset is part of a larger collection

Click to explore relationships graph

115.76,-32.117 115.76,-32.264 115.683,-32.264 115.683,-32.117 115.76,-32.117

115.7215,-32.1905