TRANSCRIPTOME 2002: From Functional Genomics to Systems Biology
March 10-13, 2002
Seattle, Washington, USA


Large-Scale Identification of the Promoter Regions of Human Genes Using Full-length Enriched and 5'-end Enriched cDNA Libraries Constructed by the Oligo-Capping Method

Yutaka Suzuki1, Riu Yamashita1, Sawako Suzuki1, Aya Miura1, Munetomo Hida1, Junko Mizushima-Sugano1, Hata Hiroko1, Takao Isogai2, Kenta Nakai1, Sumio Sugano1, 1The Institute of Medical Science, The University of Tokyo, Tokyo, JAPAN, 2The Helix Research Institute, Chiba, JAPAN

In order to identify the promoters from the large volumes of genomic sequences, we utilized mRNA start sites determined by a large scale sequencing of the cDNA libraries constructed by the 'oligo-capping' method. So far, we have accumulated 271,304 5' end sequences through random sequencing of more than 130 kinds of goligo-capped (full-length enriched and 5 f-end enriched) human cDNA libraries. Among them, 149,950 have been corresponded to cDNA sequences of known genes (8,959 genes). Sequence comparison between our entries and those of a reference sequence database, RefSeq, revealed that 4270 (48%) of RefSeq sequences should be extended towards the 5' ends. We, then, computationally mapped these 5f-end sequences onto the human draft genomic sequence obtained from Golden Path. The results of the mapping provided us with not only the precise positional information of the transcriptional start sites and the adjacent promoters of the corresponding genes but also with the detailed information on distribution patterns of transcriptional start sites, which should reflect the dynamic nature of the transcription initiation events in vivo. All of these data are presented in our new database, DataBase of Transcriptional Start Sites (DBTSS;

Return to Table of Contents * Speaker Abstracts * Poster Abstracts * View the Photos

Return to Meetings Home Page

This site produced by the Human Genome Management Information System of Oak Ridge National Laboratory.