|
|
| Research article summary (published 30 Dec 2007): |
Vector-G: multi-modular SVM-based heterotrimeric G protein prediction.
Full Abstract
Heterotrimeric G proteins interact with G protein-coupled receptors in response to stimulation by hormones, neurotransmitters, chemokines, and sensory signals to intracellular signaling cascades. Recently reported studies indicate that G protein subunits play a significant role in different eukaryotic diseases including inflammation, neurological diseases, cardiovascular diseases, endocrine disorders as well as plant pathogen response, infectious hyphae growth, differentiation and virulence of pathogenic fungi. Thus a study of their functions, signaling pathways, and protein interactions may lead to the development of various preventive approaches. The diversity of alpha, beta and gamma subunits of G proteins necessitates a prediction algorithm that helps in the identification of new proteins such as Gbeta where WD-40 repeats are not well characterized. The currently available techniques for finding G proteins are homology based search analyses and wet lab experiments, which are not very effective in finding new classes of proteins. We present here a robust computational method for finding new G proteins and their homologs using a SVM based pattern recognition algorithm. Several physicochemical and compositional properties including dipeptide, tripeptide and hydrophobicity composition are used for generating the SVM classifiers. This method has 96.17%, 95.38%, 97.6% sensitivity and 99.45%, 100%, 100% specificity on test sets for G protein alpha, beta, and gamma subunits, respectively. This algorithm correctly predicts the known alpha, beta and gamma subunits reported in literature. One important contribution of this algorithm is that it helps in improving genome annotation of several proteins as G proteins and serves as a useful tool for comparative genomic analysis of G proteins. Using this method, novel G protein subunits are predicted in 31 genomes covering plant, fungi and animal kingdom. The software is available at the website http://biomine.cs.uah.edu/bioinformatics/svm_prog/scripts/GProteins/vectorg.html. Supplementary files: The supplementary files are available on http://www.bioinfo.de/isb/2008/08/0013/supplementary_ material/.
Author information
Author/s: Jain, Preti (P); Wadhwa, Puneet (P); Aygun, Ramazan (R); Podila, Gopi (G);
Affiliation: Department of Biological Sciences, University of Alabama in Huntsville, Huntsville, AL 35899, USA.
Journal and publication information
Publication Type: Journal Article; Research Support, U.S. Gov't, Non-P.H.S.
Journal: In silico biology (In Silico Biol), published in Netherlands. (Language: eng)
Reference: 2008-; vol 8 (issue 2) : pp 141-55
Dates: Created 2008/10/20; Completed 2008/11/04;
PMID: 18928202, status: MEDLINE (last retrieval date: 2/18/2009, IMS Date: )
Sourced from the National Library of Medicine. Abstract text and other information may be subject to copyright.
External Links for this article
(including full text providers, if available):
Click Electronic Full-text Provider Links to see options for finding the electronic full text links to this article. Note there may be a subscription or fee required for access to the full text. See our FAQ for information on finding FREE full text articles.
This article may also be located in paper journal collections available in many libraries. Use the Journal and Publication Information above to find the full article.
MeSH headings (categories)
This article was linked to the MESH Headings shown below.
Related articles
These are the highest related articles currently in the database:
- Predicting protein structural class by SVM with class-wise optimized features and decision probabilities.
2 Mar 2008 - A fast SEQUEST cross correlation algorithm.
4 Sep 2008 - Automated derivation and refinement of sequence length patterns for protein sequences using evolutionary computation.
30 Aug 2005 - Pattern recognition methods for protein functional site prediction.
29 Sep 2005 - COPid: composition based protein identification.
30 Dec 2007 - Support vector training of protein alignment models.
30 Aug 2008 - The 3of5 web application for complex and comprehensive pattern matching in protein sequences.
14 Mar 2006 - Sequence handling by sequence analysis toolbox v1.0.
30 Dec 2006 - Internet basics.
29 Apr 2001 - DBAli tools: mining the protein structure space.
May 2007
Related Article Map
Legend:
- FREE Full text Article.
- Abstract only.
- Title only. More help.
See a large map of 100+ related articles.