Archive Site Provided for Historical Purposes
Sponsored by the U.S. Department of Energy Human Genome Program
Human Genome News, March 1991; 2(6)
The second Joint DOE-NIH Informatics Task Force (JITF) meeting, November 30-December 1, 1990, addressed the Human Genome Project need for public databases containing mapping and sequencing data. Discussions were organized around presentations on existing map and sequence databases by representatives from data-housing facilities. Several presentations focused on the evolution, current status, and future development plans of data repositories.
Other speakers were Scott Tingey (Du Pont) and Brian Hauge (Massachusetts General Hospital). Tingey discussed the Molecular Breeding Program (plants) and offered some solutions to the problems of storing vast quantities of laboratory data in accessible forms. Hauge described the progress and database requirements of the Arabidopsis genome mapping program.
The task force concurred on the following guidelines for establishing genome data resources:
The four JITF working groups made brief reports on their work. The action items that follow were included in the reports or resulted from them.
The group will concentrate on mapping data; Lipman recommended establishing connections with model organism mapping data projects. Branscomb was appointed liaison to the Human Genome Organisation committee on physical mapping data.
The group's aim was reported to be fostering capability and not mandating actions; it recognized that the Internet TCP/IP protocol suite is the U.S. connectivity standard. The working group recommended that all genome centers and genome data resources be Internet accessible and that the funding agencies provide connection guidance and support. The group pointed out that network resource availability would create a second cycle of demand from individual researchers; NIH and DOE should expect this demand to increase and be prepared for it.
Although DOE, NIH, and the National Science Foundation (NSF) have separate genome and computation fellowships, the working group stressed that the Human Genome Project has an opportunity to make a real impact on interdisciplinary computation and biology training by designing a fellowship that would be available at a number of levels: predoctoral, postdoctoral, and mid-career. Another short-term goal of the working group is the development of a summer course in genome informatics for investigators whose primary training is in biology.
The need for analytical tools and genome informatics training was discussed. The group noted that NSF has taken the lead in biocomputing training. Frank Olken (Lawrence Berkeley Laboratory) pointed out that the Human Genome Project should support basic research in database theory, because advances in database theory and practice are necessary to achieve project goals. Lipman suggested that the cost for such research would be less than that for one genome center, and that Human Genome Project administrators need to be aware of current research and to encourage computer scientists to meet the project's database requirements.
Prior to its next meeting (tentatively scheduled for March 14-15), JITF plans to organize a workshop on laboratory support databases and associated software to develop a requirement specification for a general laboratory support tool.
Reported by David Benton, NIH NCHGR and Robert Robbins, NSF/DOE
David Benton is Assistant to the Director for Scientific Data Management at NCHGR.
Robert Robbins is Program Director for Database Activities in the Biological, Behavioral, and Social Sciences at NSF; he is assisting DOE in genome informatics and computational activities through the courtesy of NSF.
The electronic form of the newsletter may be cited in the following style:
Human Genome Program, U.S. Department of Energy, Human Genome News (v2n6).
The Human Genome Project (HGP) was an international 13-year effort, 1990 to 2003. Primary goals were to discover the complete set of human genes and make them accessible for further biological study, and determine the complete sequence of DNA bases in the human genome. See Timeline for more HGP history.
Published from 1989 until 2002, this newsletter facilitated HGP communication, helped prevent duplication of research effort, and informed persons interested in genome research.