Open Access News

News from the open access movement


Tuesday, August 23, 2005

Milestone in the growth of OA data

Public Collections of DNA and RNA Sequence Reach 100 Gigabases, a press release from the National Library of Medicine (August 22). Excerpt:
For nearly 20 years, the three leading public repositories for DNA and RNA sequence data have collaborated to provide access to the ever increasing amount of genetic data produced by institutions around the world. The three repositories have now reached a significant milestone by collecting and disseminating 100 gigabases of sequence data. For a frame of reference, one hundred billion bases is about equal to the number of nerve cells in a human brain and a bit less than the number of stars in the Milky Way. These 100,000,000,000 bases, or "letters" of the genetic code, represent both individual genes and partial and complete genomes of over 165,000 organisms. While a single gene from organisms as diverse as humans, elephants, earthworms, fruitflies, apple trees, and bacteria can range from less than one hundred to over several thousand bases long, an organism's genome can be longer than one billion bases. The free access to this information allows scientists to study and compare the same data as their colleagues nearly anywhere in the world, and makes possible collaborative research that will lead ultimately to cures for diseases and improved health. Thanks to their data exchange policy, the three members of the International Nucleotide Sequence Database Collaboration: GenBank (Bethesda, Maryland USA), European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-Bank in Hinxton, UK), and the DNA Data Bank of Japan (Mishima, Japan) all reached this milestone together.