Brief description
Marimba is an open-source Python framework designed for researchers, data scientists, and engineers to help structure, process, package and distribute FAIR (Findable, Accessible, Interoperable, and Reusable) scientific image datasets. The framework provides comprehensive functionality for managing the entire workflow of scientific image data processing, from post-acquisition data importing through to final dataset packaging and distribution. Marimba includes built-in support for parallel processing, file hard linking to optimise storage usage, and integration with the iFDO (image FAIR Digital Object) metadata standard. The framework is well documented and comes with four operational Pipeline examples that demonstrate its use across different applications: Zeiss Axio microscopy, FlowCam plankton imagery, deep-sea towed camera surveys, and historical image digitisation. While initially developed for marine science applications, Marimba's modular design makes it adaptable for any field dealing with scientific imagery.Lineage: Marimba was conceptualised at CSIRO in late 2022, with substantial elements of its initial design and implementation developed during the CSIRO Image Data Collection and Delivery Hackathon in early 2023. Further collaborative development between CSIRO and MBARI occurred in late 2023. The framework was open-sourced on GitHub and published as a Python package on PyPI in mid-2024, and officially launched at the Marine Imaging Workshop in late 2024. The development process involved extensive collaboration between software engineers, marine scientists, and data managers to ensure robust functionality and adherence to FAIR data principles.
Available: 2025-02-25
Data time period: 2022-12-01 to ..
Subjects
Data Management and Data Science |
Data Management and Data Science Not Elsewhere Classified |
FAIR data principles |
Information and Computing Sciences |
Python framework |
Software Engineering |
Software Engineering Not Elsewhere Classified |
automation |
data management |
data packaging |
data standardisation |
image processing |
marine imaging |
marine science |
open source software |
parallel processing |
research software |
scientific imaging |
scientific workflows |
underwater imagery |
User Contributed Tags
Login to tag this record with meaningful keywords to make it easier to discover
Identifiers
- Handle : 102.100.100/661195
- URL : data.csiro.au/collection/csiro:64797