Skip to Main Content
The budding yeast, Saccharomyces cerevisiae, has been experimentally manipulated for several decades. Much of the information generated is available in the Saccharomyces genome database (SGD, http://www.yeastgenome.org/). SGD contains large datasets of both genomic and proteomic information, as well as tools for data analysis. This paper will highlight three datasets that are maintained by SGD. First, a large dataset of hand-curated information is provided in machine readable format for each gene of the Saccharomyces genome. These hand-curated annotations use the gene ontology (GO) controlled vocabularies for biological process, molecular function and cellular component and each contains categorical evidence codes and literature references. A second area of focus is on metabolic pathways. A new dataset of hand-curated information on metabolic pathways within budding yeast was released in May 2003. This resource can be searched to view biochemical reactions and pathways and their component gene products. This resource also maps data from genome-wide expression analyses onto the pathway overview providing a visualization of the changes in gene expression in the context of cellular metabolism. These pathways are created and edited using the pathway tools software but the content is reviewed and updated by SGD. A third dataset has recently become available as the result of two comparative genomic analyses. Two groups sequenced the genomes of several yeasts closely related to S. cerevisiae, and then completed a gene-by-gene comparison of these genomes. These genome comparisons were combined with available experimental evidence by SGD. Using these data the annotations for the S.cerevisiae reference genome were improved. All these datasets are freely available from the SGD ftp site.