|
||
|
||
- | ||
Searching A Sequence Database With Complex Proteomic Data |
||
- | ||
- | ||
Introduction:
So far we have learned how to identify proteins with peptide-mass fingerprinting, sequence tag, and with un-interpreted MS/MS data. Most of the procedures we have performed have been fairly manual. Generally proteomic data is gathered during an LC/MS run, that is, mass spectra are collected during an HPLC separation of a complex mixture of peptides. Modern mass spectrometers can collect 3 to10 fragment spectra per second. Most proteomic LC/MS analyses last several hours, and most multidimensional separations, i.e. strong cation exchange followed by reverse phase LC/MS, can take anywhere from 12 to 24 hours, and that is for a single sample. A two hour run with a mass spectrometer collecting 10 fragment spectra every second could yield a lot of spectra. If you don't have a hundred graduate students, then sophisticated software is your only option. |
||
A Complex Proteomic Protein ID Exercise: Use the short but complex LC/MS run shown in Figure 1 below, and search a sequence database. For this example you will be using the web version of !Xtandem, part of "The Global Proteome Machine" at UBC
|
||
Figure 1. This is a complex peptide separation collected on an LCQ DECA XP Plus ion trap mass spectrometer over a relatively short period of time, 10 min. An entire proteome was digested with trypsin. A small aliquot was separated on a 0.075X100mm reverse phase column. Two fragment spectra were collected every second. There are about 1200 spectra in the 10 min separation shown above. | ||
To simplify the exercise three hundred of the most intense fragment spectra were concatenated into a simple notepad text file. Right click here and choose to "Save Target As..." then choose a location on your computer to save it. You will need to "un-zip" this proteomics file. To do the search browse to http://www.theGPM.org , once there, under the "Search Sites" heading click on the "Genomes" link, the submission form shown below will come up, or you can just click on this link to get there directly. Some of the information has been changed on the entry page shown below. This was done to match our data file. You will need to change some of the fields to match the page shown below. We changed the minimum number of fields: taxon to mosquito, modifications to Carbamidomethyly (C), enzyme to trypsin, and predefined methods to ion trap. In the future you should explore all of the fields and learn what they do, and learn how they can help you in your particular search. | ||
|
||
Figure 2. This is the data entry page that comes up at "theGPM" | ||
- Once you are finished making the changes to the submission form Press the Find Proteins button near the top third of the page, it is an orange button. The search will take a few minutes, there are many MS/MS spectra contained in the file you just submitted. |
||
Results: Explore the results page. How many proteins were identified in this short proteomics experiment? Notice the sequence coverage for each protein. How many proteins have two or more peptides identified? | ||
-- | ||
Conclusion: All right, it's official you are a proteomics expert. Don't forget to pick up your certificate as you exit. Oh yes, one last thing before we can issue your certificate please proceed to the additional reading and tutorial reference page and complete the assigned reading. Congratulations! | ||
Suggested Reading For This Module:
|
||
e-mail the webmaster@ionsource.com with all inquiries | ||
home
| terms of use
(disclaimer) Copyright � 2004-2016 IonSource All rights reserved. Last updated: Tuesday, January 19, 2016 02:48:31 PM |
||