Full description
We searched the NCBI BioProject database and downloaded 1,012 experiments with original sequences from 14 projects, involving 7 major types of head and neck cancer, lung cancer, breast cancer, prostate cancer, gastric cancer, colon cancer, and liver cancer. For sequence reading, we performed preprocessing steps and variant calling, followed by a series of filtering steps to remove non-functional variants and minimize false positives, which gave us a refined list of 6981 variants.
All the raw data are download from NCBI bioproject database at https://www.ncbi.nlm.nih.gov/bioproject/
The BioProject IDs are as below:
PRJNA485408
PRJNA448888
PRJEB15399
PRJNA281253
PRJEB4979
PRJNA343124
PRJNA603789
PRJNA603782
PRJNA575243
PRJNA475218
PRJNA281419
PRJEB32931
PRJNA307236
PRJNA407354
Issued: 2022
Subjects
Biological Sciences |
Biomedical and Clinical Sciences |
Bioinformatics and Computational Biology |
Genetics |
Oncology and Carcinogenesis |
User Contributed Tags
Login to tag this record with meaningful keywords to make it easier to discover
Identifiers
- usc : 11162845980002621