Table 1Files on the taxonomy FTP site

FileUncompresses toDescription
taxdump.tar.Zareadme.txtA terse description of the dmp files
nodes.dmpStructure of the database; lists each taxid with its parent taxid, rank, and other values associated with each node (genetic codes, etc.)
names.dmpLists all the names associated with each taxid
delnodes.dmpDeleted taxid list
merged.dmpMerged nodes file
division.dmpGenBank division files
gencode.dmpGenetic codes files
gc.prtPrint version of genetic codes
gi_taxid_nucl.dmp.gzgi_taxid_nucl.dmpA list of gi_taxid pairs for every live gi-identified sequence in the nucleotide sequence database
gi_taxid_prot.dmp.gzgi_taxid_prot.dmpA list of gi_taxid pairs for every live gi-identified sequence in the protein sequence database
gi_taxid_nucl_diff.dmpgi_taxid_nucl_diffList of differences between latest gi_taxid_nucl and previous listing
gi_taxid_prot_diff.dmpgi_taxid_prot_diffList of differences between latest gi_taxid_prot and previous listing
a

For non-UNIX users, the file taxdmp.zip includes the same (zip compressed) data.

From: Chapter 4, The Taxonomy Project

Cover of The NCBI Handbook
The NCBI Handbook [Internet].
McEntyre J, Ostell J, editors.

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.