|
|
| Research article summary (published 11 Sep 2007): |
|
Free Full Text! See links below |
Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs.
Full Abstract
BACKGROUND: In past number of methods have been developed for predicting subcellular location of eukaryotic, prokaryotic (Gram-negative and Gram-positive bacteria) and human proteins but no method has been developed for mycobacterial proteins which may represent repertoire of potent immunogens of this dreaded pathogen. In this study, attempt has been made to develop method for predicting subcellular location of mycobacterial proteins. RESULTS: The models were trained and tested on 852 mycobacterial proteins and evaluated using five-fold cross-validation technique. First SVM (Support Vector Machine) model was developed using amino acid composition and overall accuracy of 82.51% was achieved with average accuracy (mean of class-wise accuracy) of 68.47%. In order to utilize evolutionary information, a SVM model was developed using PSSM (Position-Specific Scoring Matrix) profiles obtained from PSI-BLAST (Position-Specific Iterated BLAST) and overall accuracy achieved was of 86.62% with average accuracy of 73.71%. In addition, HMM (Hidden Markov Model), MEME/MAST (Multiple Em for Motif Elicitation/Motif Alignment and Search Tool) and hybrid model that combined two or more models were also developed. We achieved maximum overall accuracy of 86.8% with average accuracy of 89.00% using combination of PSSM based SVM model and MEME/MAST. Performance of our method was compared with that of the existing methods developed for predicting subcellular locations of Gram-positive bacterial proteins. CONCLUSION: A highly accurate method has been developed for predicting subcellular location of mycobacterial proteins. This method also predicts very important class of proteins that is membrane-attached proteins. This method will be useful in annotating newly sequenced or hypothetical mycobacterial proteins. Based on above study, a freely accessible web server TBpred http://www.imtech.res.in/raghava/tbpred/ has been developed.
Author information
Author/s: Rashid, Mamoon (M); Saha, Sudipto (S); Raghava, Gajendra Ps (GP);
Affiliation: Bioinformatics Centre, Institute of Microbial Technology, Sector-39A, Chandigarh, India. mamoon(-atsign-)imtech.res.in
Journal and publication information
Publication Type: Journal Article; Research Support, Non-U.S. Gov't
Journal: BMC bioinformatics (BMC Bioinformatics), published in England. (Language: eng)
Reference: 2007-; vol 8 (issue ) : pp 337
Dates: Created 2007/12/19; Completed 2008/01/16; Revised 2008/11/20;
PMID: 17854501, status: MEDLINE (last retrieval date: 2/18/2009, IMS Date: )
Sourced from the National Library of Medicine. Abstract text and other information may be subject to copyright.
External Links for this article
(including full text providers, if available):
Click Electronic Full-text Provider Links to see options for finding the electronic full text links to this article. Note there may be a subscription or fee required for access to the full text. See our FAQ for information on finding FREE full text articles.
This article may also be located in paper journal collections available in many libraries. Use the Journal and Publication Information above to find the full article.
MeSH headings (categories)
This article was linked to the MESH Headings shown below.
Related articles
These are the highest related articles currently in the database:
- Protein subcellular localization based on PSI-BLAST and machine learning.
29 Nov 2006 - Implicit motif distribution based hybrid computational kernel for sequence classification.
12 Dec 2004 - Efficient similarity search in protein structure databases by k-clique hashing.
8 Jul 2004 - Automatic transcription factor classifier based on functional domain composition.
19 Jun 2006 - Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space.
29 Jun 2008 - Protein subcellular localization prediction based on compartment-specific features and structure conservation.
6 Sep 2007 - An iterative refinement algorithm for consistency based multiple structural alignment methods.
27 Jun 2006 - Motif extraction and protein classification.
30 Dec 2004 - Profile-based string kernels for remote homology detection and motif extraction.
30 Dec 2003 - Profile-based string kernels for remote homology detection and motif extraction.
30 May 2005
Related Article Map
Legend:
- FREE Full text Article.
- Abstract only.
- Title only. More help.
See a large map of 100+ related articles.