Genome assembly ASM246719v1

FTP
Actions
NCBI RefSeq assembly
GCF_002467195.1
Submitted GenBank assembly
GCA_002467195.1
Taxon
Escherichia coli (E. coli)
Strain
MOD1-EC128
WGS project
NJNG01
Submitter
FDA/CFSAN
Date
Oct 6, 2017

Assembly statistics

RefSeqGenBank
Genome size5 Mb5 Mb
Total ungapped length5 Mb5 Mb
Number of contigs234234
Contig N50122.8 kb122.8 kb
Contig L501414
GC percent50.550.5
Genome coverage21.5x21.5x
Assembly levelContigContig
View sequencesview RefSeq sequencesview GenBank sequences

Sample details

BioSample ID
SAMN06045578
Description
Pathogen: environmental/food/other sample from Escherichia coli
Submitter
FDA Center for Food Safety and Applied Nutrition
Collection date
1979
Strain
MOD1-EC128
Attribute package
environmental/food/other
Isolate name alias
CFSAN057030, B41
Collected by
Central Veterinary Laboratory, England
Geographic location
United Kingdom
Host
Bos taurus
Isolation source
cow feces (Bos taurus)
Interagency Food Safety Analytics Collaboration (IFSAC) category
veterinary clinical/research, cow
Ontological term
bos taurus:NCBITAXON_9913, cattle:FOODON_03411161, feces:UBERON_0001988
Source type
animal
CFSAN
CFSAN057030
Models
Pathogen.env
Package
Pathogen.env.1.0
Submission date
2016-11-21T14:16:21.656
Publication date
2016-11-21T00:00:00.000
Last updated
2022-07-02T11:31:24.069

Assembly methods

Sequencing technology
Illumina NextSeq 500
Assembly method
SPAdes v. 3.8.2

Additional genomes

Browse all Escherichia coli genomes (311553)

BioProject

PRJNA230969

GenomeTrakr Project: US Food and Drug Administration

Pathogen Detection Resource

Annotation details

RefSeq
ProviderNCBI RefSeq
NameGCF_002467195.1-RS_2024_06_14
DateJun 14, 2024
Genes5,016
Protein-coding4,663
Software version6.7

About PGAP

The NCBI Prokaryotic Genome Annotation Pipeline (PGAP) uses multiple approaches to predict protein-coding and RNA genes and other functional elements directly from sequence.

Continue reading

Quality analysis

CheckM analysis (v1.2.2)

Completeness: 98.74% (32nd Percentile)

Contamination: 0.3%

Calculated on the Prokaryotic Genome Annotation Pipeline (PGAP) gene set with the Escherichia coli CheckM marker set. For more information on CheckM, see Parks, et al. Genome Res (2015).

Taxonomy check

Taxonomy check status
OK
Best match status
species_match
Submitted organism name
Escherichia coli
Submitted species name
Escherichia coli

Average Nucleotide Identity (ANI) match details

Best match type-strain for submitted organismBest match type-strain
Type assemblyGCA_000210475.1GCA_000210475.1
Organism nameEscherichia coli ETEC H10407Escherichia coli
Type categorycladerefcladeref
ANI99.3%99.3%
Assembly coverage89.93%89.93%
Type assembly coverage84.27%84.27%

Chromosomes

Note: This contig-level genome assembly includes 234 contigs and no assembled chromosomes.

Revision history

This record has not been revised

GenBank
RefSeq
Name
Level
Date
Action
GCA_002467195.1GCF_002467195.1ASM246719v1ContigOct 6, 2017