Data

Additional file 1 of Machine learning models outperform deep learning models, provide interpretation and facilitate feature selection for soybean trait prediction

The University of Western Australia
Gill, Mitchell ; Anderson, Robyn ; Hu, Ricky ; Bennamoun, Mohammed ; Petereit, Jakob ; Valliyodan, Babu ; Nguyen, Henry T. ; Batley, Jacqueline ; Bayer, Philipp ; Edwards, Dave
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.6084/m9.figshare.19556149&rft.title=Additional file 1 of Machine learning models outperform deep learning models, provide interpretation and facilitate feature selection for soybean trait prediction&rft.identifier=10.6084/m9.figshare.19556149&rft.publisher=Figshare&rft.description=Additional file 1: Supplementary Figure 1. P-value of each SNPs association for a) flower colour b) seed coat colour c) pod colour in the soybean VCF. SNPs coloured red have been determined as significantly associated for the given trait as they have a p-value less than the -log10(8) significance threshold for this GWAS. Supplementary Figure 2. Graphs ranking the top 20 most input SNPs by gain as identified by XGBoost models for trait predictions for traits with regions of importance identified from XGBoost. Blue bars are region of importance, whereas other colours represent collections of important SNPs on the same chromosome. Black bars represent left over SNPs with no relation to other SNPs in the ranking. SNP rankings for genome wide SNP input for A) flower colour B) seed coat colour C) pubescence density D) seed weight. Supplementary Figure 3. Top 20 ranked SNPs for XGBoost Seed Oil Prediction. Supplementary Figure 4. Top 20 ranked SNPs for XGBoost Pod Colour Prediction. Supplementary Figure 5. Top 20 ranked SNPs for XGBoost Seed Protein Prediction. Supplementary Table 1. Targeted Regions of SNPs for Reduced Input Models. Supplementary Table 2. List of soybean germplasm in the pangenome with the sequence coverage. (ND, not defined). Supplementary Table 3. Trait Data Types.&rft.creator=Gill, Mitchell &rft.creator=Anderson, Robyn &rft.creator=Hu, Ricky &rft.creator=Bennamoun, Mohammed &rft.creator=Petereit, Jakob &rft.creator=Valliyodan, Babu &rft.creator=Nguyen, Henry T. &rft.creator=Batley, Jacqueline &rft.creator=Bayer, Philipp &rft.creator=Edwards, Dave &rft.date=2022&rft.relation=http://research-repository.uwa.edu.au/en/publications/27419f0b-42ff-44a8-9902-505d8a3c0758&rft_subject=FOS: Computer and information sciences&rft_subject=Artificial Intelligence and Image Processing&rft.type=dataset&rft.language=English Access the data

Access:

Open

Full description

Additional file 1: Supplementary Figure 1. P-value of each SNPs association for a) flower colour b) seed coat colour c) pod colour in the soybean VCF. SNPs coloured red have been determined as significantly associated for the given trait as they have a p-value less than the -log10(8) significance threshold for this GWAS. Supplementary Figure 2. Graphs ranking the top 20 most input SNPs by gain as identified by XGBoost models for trait predictions for traits with regions of importance identified from XGBoost. Blue bars are region of importance, whereas other colours represent collections of important SNPs on the same chromosome. Black bars represent left over SNPs with no relation to other SNPs in the ranking. SNP rankings for genome wide SNP input for A) flower colour B) seed coat colour C) pubescence density D) seed weight. Supplementary Figure 3. Top 20 ranked SNPs for XGBoost Seed Oil Prediction. Supplementary Figure 4. Top 20 ranked SNPs for XGBoost Pod Colour Prediction. Supplementary Figure 5. Top 20 ranked SNPs for XGBoost Seed Protein Prediction. Supplementary Table 1. Targeted Regions of SNPs for Reduced Input Models. Supplementary Table 2. List of soybean germplasm in the pangenome with the sequence coverage. (ND, not defined). Supplementary Table 3. Trait Data Types.

Notes

External Organisations
University of Central Missouri
Associated Persons
Mitchell Gill (Creator); Robyn Anderson (Creator); Ricky Hu (Creator)Babu Valliyodan (Creator); Henry T. Nguyen (Creator)

Issued: 2022-04-08

This dataset is part of a larger collection

Click to explore relationships graph
Subjects

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Identifiers