Full description
The SUB-cellular location database for Arabidopsis proteins (SUBA) provides a powerful tool to investigate subcellular localisation in Arabidopsis through the unification of disparate datasets and through the provision of a web accessible interface for the construction of powerful user based queries resulting in a one-stop-shop for protein localisation in this model plant. Subcellular localisation information can contribute towards our understanding of protein function, protein redundancy and of biological inter-relationships. While a variety of technologies are currently employed to determine the sub-cellular location of proteins much of this information is not available in an integrated manner. In an attempt to get a clearer picture of their experimental data and to more generally understand subcellular partitioning we have brought together various data sources to build SUBA. The database has a web accessible interface that allows advanced combinatorial queries to be undertaken on the contained data.
SUBA houses large scale proteomic data sets and GFP localisation from cellular compartments of Arabidopsis. It also contains precompiled bioinformatic predictions for protein subcellular localisations. With the upgrade to SUBA3, protein-protein-interaction data was added and more recently a classification algorithm SUBAcon was integrated for forming a localisation consensus call. SUBA was last updated in June 2014 and is based on the TAIR10 genome annotation release. The experimental data sets are generally updated annually mid year.
SUBA3 in January 2015 contains 4725 entries based on chimeric fusion studies (GFP), comprising 2987 distinct proteins from 1414 publications as well as 27601 entries based on subcellular proteomic studies, comprising 8787 distinct proteins from 143 publications. There are 27385 protein-protein-interaction pairs stored in SUBA3.
Data time period: 07 2009
User Contributed Tags
Login to tag this record with meaningful keywords to make it easier to discover