U.S. flag

An official website of the United States government

estd21 (Wheeler et al. 2008)

Organism:
Human
Study Type:
Control Set
Submitter:
Richard Gibbs
Description:
The association of genetic variation with disease and drug response, and improvements in nucleic acid technologies, have given great optimism for the impact of 'genomic medicine'. However, the formidable size of the diploid human genome, approximately 6 gigabases, has prevented the routine application of sequencing methods to deciphering complete individual human genomes. To realize the full potential of genomics for human health, this limitation must be overcome. Here we report the DNA sequence of a diploid genome of a single individual, James D. Watson, sequenced to 7.4-fold redundancy in two months using massively parallel sequencing in picolitre-size reaction vessels. This sequence was completed in two months at approximately one-hundredth of the cost of traditional capillary electrophoresis methods. Comparison of the sequence to the reference genome led to the identification of 3.3 million single nucleotide polymorphisms, of which 10,654 cause amino-acid substitution within the coding sequence. In addition, we accurately identified small-scale (2-40,000 base pair (bp)) insertion and deletion polymorphism as well as copy number variation resulting in the large-scale gain and loss of chromosomal segments ranging from 26,000 to 1.5 million base pairs. Overall, these results agree well with recent results of sequencing of a single individual by traditional methods. However, in addition to being faster and significantly less expensive, this sequencing technology avoids the arbitrary loss of genomic sequences inherent in random shotgun sequencing by bacterial cloning because it amplifies DNA in a cell-free system. As a result, we further demonstrate the acquisition of novel human sequence, including novel genes not previously identified by traditional genomic sequencing. This is the first genome sequenced by next-generation technologies. Therefore it is a pilot for the future challenges of 'personalized genome sequencing'. See Variant Summary counts for estd21 in dbVar Variant Summary.
Project:
PRJNA107641
Publication(s):
Wheeler et al. 2008

Detailed Information: Download 23 Variant Regions, Download 23 Variant Calls, Download Both, FTP

Variant Summary

Assembly used for analysis:
Remapped: GRCh38.p12 (hg38)
Remapped: GRCh37.p13 (hg19)
Submitted: NCBI36 (hg18)

Sequence IDChrNumber of Variant RegionsNumber of Variant CallsPlacement typeLink to graphical display
NC_000001.11Chr155RemappedNC_000001.11
NC_000002.12Chr211RemappedNC_000002.12
NC_000003.12Chr311RemappedNC_000003.12
NC_000004.12Chr422RemappedNC_000004.12
NC_000005.10Chr511RemappedNC_000005.10
NC_000006.12Chr611RemappedNC_000006.12
NC_000007.14Chr711RemappedNC_000007.14
NC_000009.12Chr911RemappedNC_000009.12
NC_000011.10Chr1111RemappedNC_000011.10
NC_000014.9Chr1433RemappedNC_000014.9
NC_000015.10Chr1511RemappedNC_000015.10
NC_000016.10Chr1622RemappedNC_000016.10
NC_000017.11Chr1711RemappedNC_000017.11
NC_000022.11Chr2211RemappedNC_000022.11
NT_187523.1Chr2|NT_187523.111RemappedNT_187523.1
NT_187647.1Chr2|NT_187647.111RemappedNT_187647.1
NT_167246.2Chr6|NT_167246.211RemappedNT_167246.2
NT_187600.1Chr14|NT_187600.111RemappedNT_187600.1
NT_187663.1Chr17|NT_187663.111RemappedNT_187663.1
NT_167251.2Chr17|NT_167251.211RemappedNT_167251.2
NT_187633.1Chr22|NT_187633.111RemappedNT_187633.1
Sequence IDChrNumber of Variant RegionsNumber of Variant CallsPlacement typeLink to graphical display
NC_000001.10Chr155RemappedNC_000001.10
NC_000002.11Chr211RemappedNC_000002.11
NC_000003.11Chr311RemappedNC_000003.11
NC_000004.11Chr422RemappedNC_000004.11
NC_000005.9Chr511RemappedNC_000005.9
NC_000006.11Chr611RemappedNC_000006.11
NC_000007.13Chr711RemappedNC_000007.13
NC_000009.11Chr911RemappedNC_000009.11
NC_000011.9Chr1111RemappedNC_000011.9
NC_000014.8Chr1433RemappedNC_000014.8
NC_000015.9Chr1511RemappedNC_000015.9
NC_000016.9Chr1622RemappedNC_000016.9
NC_000017.10Chr1711RemappedNC_000017.10
NC_000022.10Chr2222RemappedNC_000022.10
NW_003871055.3Chr1|NW_003871055.311RemappedNW_003871055.3
NW_004166863.1Chr14|NW_004166863.111RemappedNW_004166863.1
NW_003871086.1Chr17|NW_003871086.111RemappedNW_003871086.1
Sequence IDChrNumber of Variant RegionsNumber of Variant CallsPlacement typeLink to graphical display
NC_000001.9Chr155SubmittedNC_000001.9
NC_000002.10Chr211SubmittedNC_000002.10
NC_000003.10Chr311SubmittedNC_000003.10
NC_000004.10Chr422SubmittedNC_000004.10
NC_000005.8Chr511SubmittedNC_000005.8
NC_000006.10Chr611SubmittedNC_000006.10
NC_000007.12Chr711SubmittedNC_000007.12
NC_000009.10Chr911SubmittedNC_000009.10
NC_000011.8Chr1111SubmittedNC_000011.8
NC_000014.7Chr1433SubmittedNC_000014.7
NC_000015.8Chr1511SubmittedNC_000015.8
NC_000016.8Chr1622SubmittedNC_000016.8
NC_000017.9Chr1711SubmittedNC_000017.9
NC_000022.9Chr2222SubmittedNC_000022.9

Variant Region remap statusVariant Call remap status
Sequence IDChrVariant Regions on sourcePerfectGoodPassFailMultVariant Calls on sourcePerfectGoodPassFailMult
NC_000001.9Chr1550000550000
NC_000002.10Chr2110000110000
NC_000003.10Chr3110000110000
NC_000004.10Chr4220000220000
NC_000005.8Chr5110000110000
NC_000006.10Chr6110000110000
NC_000007.12Chr7110000110000
NC_000009.10Chr9110000110000
NC_000011.8Chr11110000110000
NC_000014.7Chr14320001320001
NC_000015.8Chr15100100100100
NC_000016.8Chr16210001210001
NC_000017.9Chr17100001100001
NC_000022.9Chr22220000220000
Variant Region remap statusVariant Call remap status
Sequence IDChrVariant Regions on sourcePerfectGoodPassFailMultVariant Calls on sourcePerfectGoodPassFailMult
NC_000001.9Chr1550000550000
NC_000002.10Chr2100001100001
NC_000003.10Chr3110000110000
NC_000004.10Chr4220000220000
NC_000005.8Chr5110000110000
NC_000006.10Chr6100001100001
NC_000007.12Chr7110000110000
NC_000009.10Chr9110000110000
NC_000011.8Chr11110000110000
NC_000014.7Chr14311001311001
NC_000015.8Chr15100100100100
NC_000016.8Chr16210100210100
NC_000017.9Chr17100001100001
NC_000022.9Chr22220000220000

Samplesets

Number of Samplesets: 2

Sampleset ID:
1
Description:
Test single sample
Size:
1
Organisms:
Homo sapiens
Sampleset Phenotype(s):
None reported
  • Download Samples as CSV file
  • Samples for sampleset 1
    Sample IDCell TypeSubject ID SexEthnicitySubject Phenotype
    WATSONWhite bloodWATSONMaleCaucasianNot reported
    Sampleset ID:
    2
    Name:
    Reference Sample Sampleset
    Size:
    2
    Organisms:
    Homo sapiens
    Sampleset Phenotype(s):
    None reported
  • Download Samples as CSV file
  • Samples for sampleset 2
    Sample IDCell TypeSubject ID SexEthnicitySubject AgeSubject Phenotype
    Standard reference Caucasian male, Kleberg CytogenStandard reference Caucasian male, Kleberg CytogenMaleCaucasianNot reported
    NA10851B-LymphocyteNA10851MaleCaucasian52 YearsNot reported

    Experimental Details

    Experiment IDTypeMethodAnalysisPlatformsDataNumber of Variant Calls
    1DiscoveryOligo aCGHProbe signal intensityAgilent-014584 Human Genome 244K CGH Microarray (Alpha Test), [Mapping250K_Nsp] Affymetrix Mapping 250K Nsp SNP Array, [Mapping250K_Sty] Affymetrix Mapping 250K Sty2 SNP ArrayGEO, GEO23
    2ValidationOligo aCGHProbe signal intensityNimblegen HD2 Microarray17
    3ValidationSequencingRead depthRoche 454SRA19
    4ValidationOligo aCGHProbe signal intensityAgilent-014584 Human Genome 244K CGH Microarray (Alpha Test), [Mapping250K_Nsp] Affymetrix Mapping 250K Nsp SNP Array, [Mapping250K_Sty] Affymetrix Mapping 250K Sty2 SNP ArrayGEO, GEO15

    Validations

    Experiment IDMethodAnalysisPlatformDataNumber of Variant Calls Validated
    2Oligo aCGHProbe signal intensityNimblegen HD2 Microarray17
    3SequencingRead depthRoche 454SRA19
    4Oligo aCGHProbe signal intensityAgilent-014584 Human Genome 244K CGH Microarray (Alpha Test), [Mapping250K_Nsp] Affymetrix Mapping 250K Nsp SNP Array, [Mapping250K_Sty] Affymetrix Mapping 250K Sty2 SNP ArrayGEO, GEO15
    Support Center