Data

Metagenome-assembled genomes recovered from a full-scale petroleum refinery wastewater treatment plant

Macquarie University
Bronwyn Campbell (Aggregated by) Fengji Wu (Aggregated by) Paul Greenfield (Aggregated by)
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.25949/29567549.v1&rft.title=Metagenome-assembled genomes recovered from a full-scale petroleum refinery wastewater treatment plant&rft.identifier=10.25949/29567549.v1&rft.publisher=Macquarie University&rft.description=The activated sludge samples were collected from three tanks within a full-scale petroleum refinery wastewater biotreatment system, including replicates (n = 3). The total DNA of each sample was extracted using a MagPure soil DNA KF Kit (Magen, China) according to the manufacturer's instructions. Sequencing libraries were then constructed using a Kapa Library Quantification Kit (Kapa Biosystems, USA), also according to the manufacturer's instructions. The libraries were sequenced with 150 base pair (bp) paired-end reads on an Illumina Hiseq Xten (Illumina Inc., San Diego, CA, USA), producing nine metagenomes that were subsequently accessioned. The raw reads from these metagenomes (used in the present study) were obtained via NCBI BioProject accession PRJNA823832Because of the recirculating nature of the biotreatment system itself, we combined data from all tanks to improve the genomic assembly. All unassembled reads from the nine metagenomes were merged and passed through the following error correction, assembly, binning, quality assessment, and annotation pipeline. The default settings were used for all tools, except where specified. Reads were error-corrected using Blue v2, and merging and assembly were achieved using SPAdes v3.15.5 with the meta flag. MAGs were then obtained from the assembly by binning contigs using MetaBAT v2.15, with a minimum contig length of 1000 bp and a minimum contig depth of 1.0. Quality assessment of the resulting MAGs was performed with CheckM v1.1.11, and only those MAGs with completeness scores > 90 % and redundancy (contamination) scores &rft.creator=Bronwyn Campbell&rft.creator=Fengji Wu&rft.creator=Paul Greenfield&rft.date=2026&rft_rights= https://creativecommons.org/licenses/by-nc-sa/4.0/&rft_subject=Industrial microbiology (incl. biofeedstocks)&rft_subject=Metagenome assembled genomes&rft_subject=petroleum refinery wastewater treatment plant&rft_subject=activated sludge&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Other view details

Full description

The activated sludge samples were collected from three tanks within a full-scale petroleum refinery wastewater biotreatment system, including replicates (n = 3). The total DNA of each sample was extracted using a MagPure soil DNA KF Kit (Magen, China) according to the manufacturer's instructions. Sequencing libraries were then constructed using a Kapa Library Quantification Kit (Kapa Biosystems, USA), also according to the manufacturer's instructions. The libraries were sequenced with 150 base pair (bp) paired-end reads on an Illumina Hiseq Xten (Illumina Inc., San Diego, CA, USA), producing nine metagenomes that were subsequently accessioned. The raw reads from these metagenomes (used in the present study) were obtained via NCBI BioProject accession PRJNA823832

Because of the recirculating nature of the biotreatment system itself, we combined data from all tanks to improve the genomic assembly. All unassembled reads from the nine metagenomes were merged and passed through the following error correction, assembly, binning, quality assessment, and annotation pipeline. The default settings were used for all tools, except where specified. Reads were error-corrected using Blue v2, and merging and assembly were achieved using SPAdes v3.15.5 with the meta flag. MAGs were then obtained from the assembly by binning contigs using MetaBAT v2.15, with a minimum contig length of 1000 bp and a minimum contig depth of 1.0. Quality assessment of the resulting MAGs was performed with CheckM v1.1.11, and only those MAGs with completeness scores > 90 % and redundancy (contamination) scores < 1 % were retained for further examination.

A total of twenty metagenome-assembled genomes (MAGs) were recovered from the petroleum refinery wastewater treatment plant in southern China. These MAGs are associated with a peer-reviewed paper https://doi.org/10.1016/j.micres.2025.128299

Issued: 06 02 2026

This dataset is part of a larger collection

Click to explore relationships graph
Subjects

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Identifiers