http://www.ncbi.nlm.nih.gov/


One of the roles of NCBI is to archive raw DNA sequence data. The sequence information comes from research efforts in laboratories around the world as well as from large-scale, dedicated genome sequencing centers. The resulting database is referred to as GenBank. GenBank shares its resources with the European and Japanese equivalents so that there are three primary public repositories of such information in the world. Because of the automation of DNA sequencing over the past decade, these databases are increasing in size exponentially. GenBank includes the sequences of the E. coli, Drosophila, and human genome, as well as data from thousands of other species. At present it comprises more than 13 million DNA sequences with a cumulative length of 14 billion base pairs. You can see the rate of growth by clicking here (after viewing the data, close the pop-up window that it appeared in).

back   next