|
|
| Research article summary (published 4 Sep 2008): |
A fast SEQUEST cross correlation algorithm.
Full Abstract
The SEQUEST program was the first and remains one of the most widely used tools for assigning a peptide sequence within a database to a tandem mass spectrum. The cross correlation score is the primary score function implemented within SEQUEST and it is this score that makes the tool particularly sensitive. Unfortunately, this score is computationally expensive to calculate, and thus, to make the score manageable, SEQUEST uses a less sensitive but fast preliminary score and restricts the cross correlation to just the top 500 peptides returned by the preliminary score. Classically, the cross correlation score has been calculated using Fast Fourier Transforms (FFT) to generate the full correlation function. We describe an alternate method of calculating the cross correlation score that does not require FFTs and can be computed efficiently in a fraction of the time. The fast calculation allows all candidate peptides to be scored by the cross correlation function, potentially mitigating the need for the preliminary score, and enables an E-value significance calculation based on the cross correlation score distribution calculated on all candidate peptide sequences obtained from a sequence database.
Author information
Author/s: Eng, Jimmy K (JK); Fischer, Bernd (B); Grossmann, Jonas (J); Maccoss, Michael J (MJ);
Affiliation: Department of Genome Sciences, University of Washington, Seattle, Washington, USA. engj(-atsign-)u.washington.edu
Journal and publication information
Publication Type: Journal Article; Research Support, Non-U.S. Gov't
Journal: Journal of proteome research (J Proteome Res), published in United States. (Language: eng)
Reference: 2008-Oct; vol 7 (issue 10) : pp 4598-602
Dates: Created 2008/10/03; Completed 2008/11/13;
PMID: 18774840, status: MEDLINE (last retrieval date: 2/18/2009, IMS Date: 18 Feb 2009 00:00:00)
Sourced from the National Library of Medicine. Abstract text and other information may be subject to copyright.
External Links for this article
(including full text providers, if available):
Click Electronic Full-text Provider Links to see options for finding the electronic full text links to this article. Note there may be a subscription or fee required for access to the full text. See our FAQ for information on finding FREE full text articles.
This article may also be located in paper journal collections available in many libraries. Use the Journal and Publication Information above to find the full article.
MeSH headings (categories)
This article was linked to the MESH Headings shown below.
Related articles
These are the highest related articles currently in the database:
- Sequence patterns derived from the automated prediction of functional residues in structurally-aligned homologous protein families.
6 Apr 2004 - Multi-spectra peptide sequencing and its applications to multistage mass spectrometry.
29 Jun 2008 - De novo peptide sequencing via manual interpretation of MS/MS spectra.
29 Apr 2002 - Protein identification by tandem mass spectrometry and sequence database searching.
30 Dec 2006 - Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline.
26 Feb 2008 - Automating genomic data mining via a sequence-based matrix format and associative rule set.
13 Jul 2005 - Vector-G: multi-modular SVM-based heterotrimeric G protein prediction.
30 Dec 2007 - Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs.
11 Sep 2007 - XSTREAM: a practical algorithm for identification and architecture modeling of tandem repeats in protein sequences.
9 Oct 2007 - Prediction of peptides observable by mass spectrometry applied at the experimental set level.
30 Oct 2007
Related Article Map
Legend:
- FREE Full text Article.
- Abstract only.
- Title only. More help.
See a large map of 100+ related articles.