HGPI

Human Genome Project Information Archive
1990–2003

Archive Site Provided for Historical Purposes


Sponsored by the U.S. Department of Energy Human Genome Program

Human Genome News Archive Edition
go to list of issues »
Vol.10, No.3-4   October 1999 

In this issue...

Available in PDF

DOE '99 Oakland Highlights

Genome Project

In the News 

Microbial Genomics

Ethical, Legal, and Social Issues

Informatics

Web, Other Resources, Publications

Funding 

Meeting Calendars & Acronyms

  • Genome and Biotech Meeting 
  • Training Courses and Workshops 
  • Acronyms

HGN archives and subscriptions

Human Genome Project Information home

Microbial Genomics

EcoCyc Database for E. coli

The EcoCyc electronic database is a literature-derived resource that describes the genome and biochemical machinery of Escherichia coli. The database contains up-to-date annotations and the DNA sequences of all genes in E. coli and describes all known pathways of its small-molecule metabolism. Each pathway and its component reactions and enzymes are annotated in rich detail, with extensive references to the biomedical literature. For example, the detail provided for each E.coli enzyme includes its cofactors, activators, inhibitors, and subunit structure. In July, the database included 159 metabolic pathways, 946 reactions, 629 enzymes, and 4390 genes.

EcoCyc also is used by investigators to annotate other microbial genomes. Because E.coli has the largest fraction of gene products whose functions were determined experimentally, sequence- similarity matches to E.coli are less likely to result in incorrect function predictions than are matches to other microbial genomes. Those genomes may have a higher rate of annotation errors due to computational, and perhaps transitive, misannotation.

EcoCyc Features
EcoCyc's Pathway Tools software provides a powerful environment for creating, managing, and publishing pathway and genome databases (DBs) on the Web. One of the components is the Pathway Genome Navigator, which allows users to query these DBs and to visualize and compare the resulting pathways, genes, genome maps, reactions, and enzymes. The PathoLogic program computationally predicts an organism's metabolic-pathway complement and creates a DB describing that prediction. A set of graphical tools allows users to edit the annotations interactively.

Future Directions
The EcoCyc project is now moving beyond E.coli metabolic pathways to include signal-transduction pathways, transport proteins, regulation of gene expression, and tRNAs. Version5.0, released in June, contains detailed annotations of E.coli phosphotransferase-system transporters authored by collaborators Milton Saier and Ian Paulsen (University of California, San Diego). In addition, Julio Collado (Universidad Nacional Autonoma de Mexico) is adding descriptions of E.coli gene regulation, including operons, promoters, and DNA-binding proteins. Once the regulatory mechanisms are added to EcoCyc, researchers will be able to compare known mechanisms of E.coli gene regulation with microarray-derived gene-expression data.

Pangea Systems, a bioinformatics company in Oakland, California, makes EcoCyc available free to the academic community and for a fee to commercial organizations. The company is interested in collaborating with academic genome centers to use the Pathway Tools for curation and Web publishing of their genomes. [Peter Karp, pkarp@pangeasystems.com; Monica Riley, mriley@mbl.edu; EcoCyc]

See also: "Developing EcoCyc"


The electronic form of the newsletter may be cited in the following style:
Human Genome Program, U.S. Department of Energy, Human Genome News (v10n3-4).

Human Genome Project 1990–2003

The Human Genome Project (HGP) was an international 13-year effort, 1990 to 2003. Primary goals were to discover the complete set of human genes and make them accessible for further biological study, and determine the complete sequence of DNA bases in the human genome. See Timeline for more HGP history.

Human Genome News

Published from 1989 until 2002, this newsletter facilitated HGP communication, helped prevent duplication of research effort, and informed persons interested in genome research.