Data

A pangenome analysis pipeline provides insights into functional gene identification in rice

The University of Western Australia
Wang, Jian ; Yang, Wu ; Zhang, Shaohong ; Hu, Haifei ; Yuan, Yuxuan ; Dong, Jingfang ; Chen, Luo ; Ma, Yamei ; Yang, Tifeng ; Zhou, Lian ; Chen, Jiansong ; Liu, Bin ; Li, Chengdao ; Edwards, David ; Zhao, Junliang
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.6084/m9.figshare.c.6585470&rft.title=A pangenome analysis pipeline provides insights into functional gene identification in rice&rft.identifier=10.6084/m9.figshare.c.6585470&rft.publisher=Figshare&rft.description=Abstract Background A pangenome aims to capture the complete genetic diversity within a species and reduce bias in genetic analysis inherent in using a single reference genome. However, the current linear format of most plant pangenomes limits the presentation of position information for novel sequences. Graph pangenomes have been developed to overcome this limitation. However, bioinformatics analysis tools for graph format genomes are lacking. Results To overcome this problem, we develop a novel strategy for pangenome construction and a downstream pangenome analysis pipeline (PSVCP) that captures genetic variants’ position information while maintaining a linearized layout. Using PSVCP, we construct a high-quality rice pangenome using 12 representative rice genomes and analyze an international rice panel with 413 diverse accessions using the pangenome as the reference. We show that PSVCP successfully identifies causal structural variations for rice grain weight and plant height. Our results provide insights into rice population structure and genomic diversity. We characterize a new locus (qPH8-1) associated with plant height on chromosome 8 undetected by the SNP-based genome-wide association study (GWAS). Conclusions Our results demonstrate that the pangenome constructed by our pipeline combined with a presence and absence variation-based GWAS can provide additional power for genomic and genetic analysis. The pangenome constructed in this study and the associated genome sequence and genetic variants data provide valuable genomic resources for rice genomics research and improvement in future.&rft.creator=Wang, Jian &rft.creator=Yang, Wu &rft.creator=Zhang, Shaohong &rft.creator=Hu, Haifei &rft.creator=Yuan, Yuxuan &rft.creator=Dong, Jingfang &rft.creator=Chen, Luo &rft.creator=Ma, Yamei &rft.creator=Yang, Tifeng &rft.creator=Zhou, Lian &rft.creator=Chen, Jiansong &rft.creator=Liu, Bin &rft.creator=Li, Chengdao &rft.creator=Edwards, David &rft.creator=Zhao, Junliang &rft.date=2023&rft.relation=http://research-repository.uwa.edu.au/en/publications/c86eea9f-c111-4d88-89e9-de81df6e2832&rft.type=dataset&rft.language=English Access the data

Access:

Open

Full description

Abstract Background A pangenome aims to capture the complete genetic diversity within a species and reduce bias in genetic analysis inherent in using a single reference genome. However, the current linear format of most plant pangenomes limits the presentation of position information for novel sequences. Graph pangenomes have been developed to overcome this limitation. However, bioinformatics analysis tools for graph format genomes are lacking. Results To overcome this problem, we develop a novel strategy for pangenome construction and a downstream pangenome analysis pipeline (PSVCP) that captures genetic variants’ position information while maintaining a linearized layout. Using PSVCP, we construct a high-quality rice pangenome using 12 representative rice genomes and analyze an international rice panel with 413 diverse accessions using the pangenome as the reference. We show that PSVCP successfully identifies causal structural variations for rice grain weight and plant height. Our results provide insights into rice population structure and genomic diversity. We characterize a new locus (qPH8-1) associated with plant height on chromosome 8 undetected by the SNP-based genome-wide association study (GWAS). Conclusions Our results demonstrate that the pangenome constructed by our pipeline combined with a presence and absence variation-based GWAS can provide additional power for genomic and genetic analysis. The pangenome constructed in this study and the associated genome sequence and genetic variants data provide valuable genomic resources for rice genomics research and improvement in future.

Notes

External Organisations
Guangdong Academy of Agricultural Sciences; Murdoch University; The University of Western Australia
Associated Persons
Jian Wang (Contributor); Haifei Hu (Contributor); Bin Liu (Contributor); Chengdao Li (Contributor); David Edwards (Contributor); Junliang Zhao (Contributor)Wu Yang (Contributor); Shaohong Zhang (Contributor); Jingfang Dong (Contributor); Luo Chen (Contributor); Yamei Ma (Contributor); Tifeng Yang (Contributor); Lian Zhou (Contributor); Jiansong Chen (Contributor)

Issued: 2023-04-13

This dataset is part of a larger collection

Click to explore relationships graph

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Identifiers