Data

CotLeaf-X: Cotton Leaf Surface Images dataset with eXpert labels

Commonwealth Scientific and Industrial Research Organisation
Rolland, Vivien ; Egan, Lucy ; Thompson, Susie ; Farazi, Moshiur ; Conaty, Warren ; Liu, Shiming ; Stiller, Warwick
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.25919/eqhx-1x73&rft.title=CotLeaf-X: Cotton Leaf Surface Images dataset with eXpert labels&rft.identifier=10.25919/eqhx-1x73&rft.publisher=Commonwealth Scientific and Industrial Research Organisation (CSIRO)&rft.description=This dataset is a collection of 5,049 images of cotton leaf surfaces acquired with a hand-held microscope to develop deep learning models for leaf hairiness and assist Cotton breeders in their variety selection efforts. These images were collected from two populations (A: 3,276 images; B: 1,773 images) over the 2021-2022 season in a field located at Australian Cotton Research Institute, -30.21, 149.60, Narrabri, NSW, Australia. Populations and genotypes have been anonymized to protect germplasm Intellectual Property. This dataset is being released together with our HairNet2 paper (Farazi et al 2024). See below for links to related Datasets and Publications.Plant genotypes and growth conditions: Two cotton populations called A and B, were selected for their heterogeneous leaf hairiness, with population A being generally less hairy than population B. Both populations were planted in the summer growing season of 2021-22 at ACRI. Seeds of each genotype were planted in a field on the 23rd of October 2021 at a planting density of 10-12 plants/m2 in rows spaced at 1m. Each genotype was grown in a single 13m plot. Leaf selection and imaging Leaf samples from these plant populations were collected on the 2nd and 6th of March 2022 (at 19 weeks, first open boll stage). Leaf 3 was harvested from 10 plants per genotype, placed in a paper bag and imaged the same day using the same protocol and equipment as in Rolland, Farazi et al 2022, with the following distinctions: - for population A, two images were collected per leaf: one along the central midvein and one on the leaf blade. - for population B, one image was collected per leaf: along the central midvein. The abaxial side of leaves were imaged at a magnification of about 31x with a portable AM73915 Dino-lite Edge 3.0 (AnMo Electronics Corporation, Taiwan) microscope equipped with a RK-04F folding manual stage (AnMo Electronics Corporation, Taiwan) and connected to a digital tablet running DinoCapture 2.0 (AnMo Electronics Corporation, Taiwan). The exact angle of the mid-vein in each image was not fixed. However, either end of the mid-vein was always cut by the left and right borders of the field of view, and never by the top and bottom ones. Visual scoring of images by human expert A human expert scored all CotLeaf-X images using arbitrary ordinal scales (0 − 5 for population A and 2 − 5.5 for population B), where higher numbers corresponded to images with more trichomes.&rft.creator=Rolland, Vivien &rft.creator=Egan, Lucy &rft.creator=Thompson, Susie &rft.creator=Farazi, Moshiur &rft.creator=Conaty, Warren &rft.creator=Liu, Shiming &rft.creator=Stiller, Warwick &rft.date=2024&rft.edition=v4&rft.relation=http://toadd.com&rft.relation=https://doi.org/10.1186/s13007-021-00820-8&rft.coverage=northlimit=-30.1544; southlimit=-30.2606; westlimit=149.5471; eastLimit=149.6796; projection=WGS84&rft_rights=All Rights (including copyright) CSIRO 2023.&rft_rights=Creative Commons Attribution-Noncommercial-Share Alike Licence https://creativecommons.org/licenses/by-nc-sa/4.0/&rft_subject=Cotton&rft_subject=Leaf&rft_subject=Plant&rft_subject=Hair&rft_subject=Trichome&rft_subject=Hairiness&rft_subject=Pubescence&rft_subject=Machine learning&rft_subject=Deep Learning&rft_subject=Yield&rft_subject=Whitefly&rft_subject=Spidermite&rft_subject=Gin trash&rft_subject=Artificial intelligence not elsewhere classified&rft_subject=Artificial intelligence&rft_subject=INFORMATION AND COMPUTING SCIENCES&rft_subject=Crop and pasture improvement (incl. selection and breeding)&rft_subject=Crop and pasture production&rft_subject=AGRICULTURAL, VETERINARY AND FOOD SCIENCES&rft_subject=Computer vision&rft_subject=Computer vision and multimedia computation&rft_subject=Plant pathology&rft_subject=Plant biology&rft_subject=BIOLOGICAL SCIENCES&rft_subject=Plant biology not elsewhere classified&rft_subject=Image processing&rft_subject=Plant physiology&rft_subject=Agronomy&rft_subject=Plant developmental and reproductive biology&rft_subject=Crop and pasture protection (incl. pests, diseases and weeds)&rft.type=dataset&rft.language=English Access the data

Licence & Rights:

Non-Commercial Licence view details
CC-BY-NC-SA

Creative Commons Attribution-Noncommercial-Share Alike Licence
https://creativecommons.org/licenses/by-nc-sa/4.0/

All Rights (including copyright) CSIRO 2023.

Access:

Open view details

Data is accessible online and may be reused in accordance with licence conditions

Brief description

This dataset is a collection of 5,049 images of cotton leaf surfaces acquired with a hand-held microscope to develop deep learning models for leaf hairiness and assist Cotton breeders in their variety selection efforts. These images were collected from two populations (A: 3,276 images; B: 1,773 images) over the 2021-2022 season in a field located at Australian Cotton Research Institute, -30.21, 149.60, Narrabri, NSW, Australia. Populations and genotypes have been anonymized to protect germplasm Intellectual Property.

This dataset is being released together with our HairNet2 paper (Farazi et al 2024). See below for links to related Datasets and Publications.

Lineage

Plant genotypes and growth conditions:
Two cotton populations called A and B, were selected for their heterogeneous leaf hairiness, with population A being generally less hairy than population B. Both populations were planted in the summer growing season of 2021-22 at ACRI. Seeds of each genotype were planted in a field on the 23rd of October 2021 at a planting density of 10-12 plants/m2 in rows spaced at 1m. Each genotype was grown in a single 13m plot.

Leaf selection and imaging
Leaf samples from these plant populations were collected on the 2nd and 6th of March 2022 (at 19 weeks, first open boll stage). Leaf 3 was harvested from 10 plants per genotype, placed in a paper bag and imaged the same day using the same protocol and equipment as in Rolland, Farazi et al 2022, with the following distinctions:
- for population A, two images were collected per leaf: one along the central midvein and one on the leaf blade.
- for population B, one image was collected per leaf: along the central midvein.
The abaxial side of leaves were imaged at a magnification of about 31x with a portable AM73915 Dino-lite Edge 3.0 (AnMo Electronics Corporation, Taiwan) microscope equipped with a RK-04F folding manual stage (AnMo Electronics Corporation, Taiwan) and connected to a digital tablet running DinoCapture 2.0 (AnMo Electronics Corporation, Taiwan). The exact angle of the mid-vein in each image was not fixed. However, either end of the mid-vein was always cut by the left and right borders of the field of view, and never by the top and bottom ones.

Visual scoring of images by human expert
A human expert scored all CotLeaf-X images using arbitrary ordinal scales (0 − 5 for population A and 2 − 5.5 for population B), where higher numbers corresponded to images with more trichomes.

Data time period: 2021-01-01 to 2022-01-01

This dataset is part of a larger collection

Click to explore relationships graph

149.6796,-30.1544 149.6796,-30.2606 149.5471,-30.2606 149.5471,-30.1544 149.6796,-30.1544

149.61335,-30.2075

Identifiers