|
|
| Research article summary (published 9 Oct 2007): |
|
Free Full Text! See links below |
XSTREAM: a practical algorithm for identification and architecture modeling of tandem repeats in protein sequences.
Full Abstract
BACKGROUND: Biological sequence repeats arranged in tandem patterns are widespread in DNA and proteins. While many software tools have been designed to detect DNA tandem repeats (TRs), useful algorithms for identifying protein TRs with varied levels of degeneracy are still needed. RESULTS: To address limitations of current repeat identification methods, and to provide an efficient and flexible algorithm for the detection and analysis of TRs in protein sequences, we designed and implemented a new computational method called XSTREAM. Running time tests confirm the practicality of XSTREAM for analyses of multi-genome datasets. Each of the key capabilities of XSTREAM (e.g., merging, nesting, long-period detection, and TR architecture modeling) are demonstrated using anecdotal examples, and the utility of XSTREAM for identifying TR proteins was validated using data from a recently published paper. CONCLUSION: We show that XSTREAM is a practical and valuable tool for TR detection in protein and nucleotide sequences at the multi-genome scale, and an effective tool for modeling TR domains with diverse architectures and varied levels of degeneracy. Because of these useful features, XSTREAM has significant potential for the discovery of naturally-evolved modular proteins with applications for engineering novel biostructural and biomimetic materials, and identifying new vaccine and diagnostic targets.
Author information
Author/s: Newman, Aaron M (AM); Cooper, James B (JB);
Affiliation: Biomolecular Science and Engineering Program, University of California, Santa Barbara, CA 93106, USA. a_newman(-atsign-)lifesci.ucsb.edu
Journal and publication information
Publication Type: Journal Article; Research Support, Non-U.S. Gov't
Journal: BMC bioinformatics (BMC Bioinformatics), published in England. (Language: eng)
Reference: 2007-; vol 8 (issue ) : pp 382
Dates: Created 2008/02/07; Completed 2008/03/27; Revised 2008/11/20;
PMID: 17931424, status: MEDLINE (last retrieval date: 2/18/2009, IMS Date: )
Sourced from the National Library of Medicine. Abstract text and other information may be subject to copyright.
External Links for this article
(including full text providers, if available):
Click Electronic Full-text Provider Links to see options for finding the electronic full text links to this article. Note there may be a subscription or fee required for access to the full text. See our FAQ for information on finding FREE full text articles.
This article may also be located in paper journal collections available in many libraries. Use the Journal and Publication Information above to find the full article.
MeSH headings (categories)
This article was linked to the MESH Headings shown below.
Related articles
These are the highest related articles currently in the database:
- A fast SEQUEST cross correlation algorithm.
4 Sep 2008 - Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space.
29 Jun 2008 - Automatic transcription factor classifier based on functional domain composition.
19 Jun 2006 - An iterative refinement algorithm for consistency based multiple structural alignment methods.
27 Jun 2006 - Prediction of peptides observable by mass spectrometry applied at the experimental set level.
30 Oct 2007 - IsoSVM--distinguishing isoforms and paralogs on the protein level.
4 Mar 2006 - An efficient algorithm for optimizing whole genome alignment with noise.
12 May 2004 - SVM-HUSTLE--an iterative semi-supervised machine learning approach for pairwise protein remote homology detection.
30 Jan 2008 - A structural alignment kernel for protein structures.
16 Jan 2007 - Semi-supervised LC/MS alignment for differential proteomics.
13 Jul 2006
Related Article Map
Legend:
- FREE Full text Article.
- Abstract only.
- Title only. More help.
See a large map of 100+ related articles.