GenBank has reached 200 billion base pairs from > 350,000 spp.

“Almost the number of stars in the Milky Way.” Through this stellar comparison, the National Institutes of Health proudly announced in 2005 that the content of their computerized collection of DNA sequences called GenBank had reached 50 billion bases or units of DNA. Today, it contains far more, over 200 billion bases from over 350,000 different species, making it one of the largest scientific database in the world.

Here is the announcement of the availability of the Nirenberg papers: “GenBank & The Early Years of “Big Data”

“Deciphering the Genetic Code: A 50 Year Anniversary” January, 2015

Marshall Nirenberg in the lab in early 1960’s, when he completed the first summary document of the genetic code — how triplets (DNA sequences) direct amino acids to form proteins.  Pictures of the group and more about the papers are here:

A young man in a lab coat and plastic gloves holds up a glass tube in a laboratory.

PubMed bibliographic records are enhanced by Images from NCBI

“The PubMed Abstract display for PubMed Central® articles will be enhanced to include an image strip generated from the soon-to-be-released National Center for Biotechnology Information (NCBI) Images database.”  To see an example, click on the linked article below:


Canese K. PubMed® Display Enhanced with Images from the New NCBI Images Database. NLM Tech Bull. 2010 Sep-Oct;(376):e14.