Data

Taxonomy summaries of the metagenomics data sequenced as part of the investigation into the Panzi outbreak.

Flinders University
Robert Edwards (Aggregated by)
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.25451/flinders.29879330.v2&rft.title=Taxonomy summaries of the metagenomics data sequenced as part of the investigation into the Panzi outbreak.&rft.identifier=https://doi.org/10.25451/flinders.29879330.v2&rft.publisher=Flinders University&rft.description=These tables contain the taxonomic summaries of the sequences generated as part of the investigation into the disease outbreak in Panzi in 2024. Tony Wawina-Bokalanga and colleagues generated the sequences, and the taxonomic summaries were generated by comparing those sequences to the UniRef50 database using MMSeqs2 using the atavide-lite pipeline.For each taxonomic level (kingdom, phylum, class, order, family, genus, species) there are two files, raw for the raw data which is the number of reads that mapped at that taxonomic level, and norm for the normalised number of reads that mapped, which is given by the number of reads that mapped at that level, dividided by the total number of reads that mapped, multiplied by 1,000,000.The tables are tab-separated values that can be easily read by R, Python, Pandas, Excel, OpenOffice, or any other software.&rft.creator=Robert Edwards&rft.date=2025&rft_rights=REUSABLE-FOR-ANY-PURPOSE-(CC-BY)&rft_subject=metagneomics&rft_subject=panzi&rft_subject=DRC&rft_subject=outbreak&rft_subject=plasmodium&rft_subject=Medical bacteriology&rft_subject=Medical infection agents (incl. prions)&rft_subject=Medical parasitology&rft_subject=Medical virology&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Other view details
Reusable-for-any-purpose

REUSABLE-FOR-ANY-PURPOSE-(CC-BY)

Full description

These tables contain the taxonomic summaries of the sequences generated as part of the investigation into the disease outbreak in Panzi in 2024. Tony Wawina-Bokalanga and colleagues generated the sequences, and the taxonomic summaries were generated by comparing those sequences to the UniRef50 database using MMSeqs2 using the atavide-lite pipeline.

For each taxonomic level (kingdom, phylum, class, order, family, genus, species) there are two files, raw for the raw data which is the number of reads that mapped at that taxonomic level, and norm for the normalised number of reads that mapped, which is given by the number of reads that mapped at that level, dividided by the total number of reads that mapped, multiplied by 1,000,000.

The tables are tab-separated values that can be easily read by R, Python, Pandas, Excel, OpenOffice, or any other software.


Issued: 2025-08-11

Created: 2025-08-11

This dataset is part of a larger collection

Click to explore relationships graph
Subjects

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Identifiers