Full description
These tables contain the taxonomic summaries of the sequences generated as part of the investigation into the disease outbreak in Panzi in 2024. Tony Wawina-Bokalanga and colleagues generated the sequences, and the taxonomic summaries were generated by comparing those sequences to the UniRef50 database using MMSeqs2 using the atavide-lite pipeline.
For each taxonomic level (kingdom, phylum, class, order, family, genus, species) there are two files, raw
for the raw data which is the number of reads that mapped at that taxonomic level, and norm
for the normalised number of reads that mapped, which is given by the number of reads that mapped at that level, dividided by the total number of reads that mapped, multiplied by 1,000,000.
The tables are tab-separated values that can be easily read by R, Python, Pandas, Excel, OpenOffice, or any other software.
Issued: 2025-08-11
Created: 2025-08-11
User Contributed Tags
Login to tag this record with meaningful keywords to make it easier to discover