The Aurum data set is a collection of commercially purchased proteins that have been checked for purity, tryptically digested, and analyzed using an ABI 4700 (MALDI TOFTOF) mass spectrometer. This data set is intended to be a reference set of spectra that can be used for many different purposes such as training computer algorithms or studying the fragmentation pathways of high-energy CID. The data set contains tryptic digests of over 250 known proteins, and each protein has been checked for purity using 1D gel analysis. Proteins in the Aurum data set were over expressed in E. Coli and purified using a sequence tag by Genway (San Diego, CA). The FASTA file included with this data set includes the NCBI nr protein sequence, as specified by Genway, with the appropriate sequence tag included.
More information about the Aurum.
This is a list of people who have contributed to this project in one way or another. The primary contact is John Strahler, please contact John if you have any questions regarding this data set.
Support in part for this project was provided by the National Resource for Proteomics and Pathways (NRPP) and the Michigan Proteome Consortium (MPC, www.proteomeconsortium.org)
The goal of this project is to succinctly and accurately represent the data set. When possible we try to include the raw spectral information, along with any meaningful curation results. If you are having troubles reading the raw spectral data, please use the ProteomeCommons.org IO Framework's conversion utility to translate that data in to a usable format.
If you are having troubles reading either the spotting information or the curation results, please see the credits section and contact the appropriate person. Note that we do not provide free tech support nor will we interpret the data for your needs; however, we'll be happy to help answer simple questions in order to try and help you work with the data.