Data

The expansion and diversification of pentatricopeptide repeat RNA editing factors in plants

The University of Western Australia
Small, Ian ; Gutmann, Bernard ; Royan, Santana ; Schallenberg-Rüdinger, Mareike ; Lenz, Henning ; Castleden, Ian ; McDowell, Rose ; Vacher, Michael ; Tonti-Filippini, Julian ; Bond, Charlie ; Knoop, Volker
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=info:doi10.5061/dryad.vdncjsxqf&rft.title=The expansion and diversification of pentatricopeptide repeat RNA editing factors in plants&rft.identifier=10.5061/dryad.vdncjsxqf&rft.publisher=DRYAD&rft.description=The RNA-binding pentatricopeptide repeat (PPR) family comprises hundreds to thousands of genes in most plants, but only a few dozen in algae, evidence of massive gene expansions during land plant evolution. The nature and timing of these expansions has not been well-defined due to the sparse sequence data available from early-diverging land plant lineages. We exploit the comprehensive OneKP dataset of over 1,000 transcriptomes from diverse plants and algae to establish a clear picture of the evolution of this massive gene family, focusing on the proteins typically associated with RNA editing, which show the most spectacular variation in numbers and domain composition across the plant kingdom. We characterise over 2,250,000 PPR motifs in over 400,000 proteins. In lycophytes, polypod ferns and hornworts, nearly 10% of expressed protein-coding genes encode putative PPR editing factors, whereas they are absent from algae and complex-thalloid liverworts. We show that rather than a single expansion, most land plant lineages with high numbers of editing factors have continued to generate novel sequence diversity. We identify sequence variation that implies functional differences between PPR proteins in seed plants versus non-seed plants and which we propose to be linked to seed-plant-specific editing cofactors. Finally, using the sequence variation across the dataset, we develop a structural model of the catalytic DYW domain associated with C-to-U editing and identify a clade of unique DYW variants that are strong candidates as U-to-C RNA editing factors, given their phylogenetic distribution and sequence characteristics.&rft.creator=Small, Ian &rft.creator=Gutmann, Bernard &rft.creator=Royan, Santana &rft.creator=Schallenberg-Rüdinger, Mareike &rft.creator=Lenz, Henning &rft.creator=Castleden, Ian &rft.creator=McDowell, Rose &rft.creator=Vacher, Michael &rft.creator=Tonti-Filippini, Julian &rft.creator=Bond, Charlie &rft.creator=Knoop, Volker &rft.date=2019&rft.relation=http://research-repository.uwa.edu.au/en/publications/c9330a9e-12fb-4624-9920-dab7865f448b&rft.type=dataset&rft.language=English Access the data

Access:

Open

Full description

The RNA-binding pentatricopeptide repeat (PPR) family comprises hundreds to thousands of genes in most plants, but only a few dozen in algae, evidence of massive gene expansions during land plant evolution. The nature and timing of these expansions has not been well-defined due to the sparse sequence data available from early-diverging land plant lineages. We exploit the comprehensive OneKP dataset of over 1,000 transcriptomes from diverse plants and algae to establish a clear picture of the evolution of this massive gene family, focusing on the proteins typically associated with RNA editing, which show the most spectacular variation in numbers and domain composition across the plant kingdom. We characterise over 2,250,000 PPR motifs in over 400,000 proteins. In lycophytes, polypod ferns and hornworts, nearly 10% of expressed protein-coding genes encode putative PPR editing factors, whereas they are absent from algae and complex-thalloid liverworts. We show that rather than a single expansion, most land plant lineages with high numbers of editing factors have continued to generate novel sequence diversity. We identify sequence variation that implies functional differences between PPR proteins in seed plants versus non-seed plants and which we propose to be linked to seed-plant-specific editing cofactors. Finally, using the sequence variation across the dataset, we develop a structural model of the catalytic DYW domain associated with C-to-U editing and identify a clade of unique DYW variants that are strong candidates as U-to-C RNA editing factors, given their phylogenetic distribution and sequence characteristics.

Notes

External Organisations
University of Bonn
Associated Persons
Santana Royan (Creator); Ian Castleden (Creator); Michael Vacher (Creator)Mareike Schallenberg-Rüdinger (Creator); Henning Lenz (Creator); Volker Knoop (Creator)

Issued: 2019-12-11

This dataset is part of a larger collection

Click to explore relationships graph

User Contributed Tags    

Login to tag this record with meaningful keywords to make it easier to discover

Identifiers