Genome assembly ASM253773v1
- NCBI RefSeq assembly
- GCF_002537735.1
- Submitted GenBank assembly
- GCA_002537735.1
- Taxon
- Escherichia coli (E. coli)
- Strain
- MOD1-EC6054
- WGS project
- NMDT01
- Submitter
- FDA/CFSAN
- Date
- Oct 13, 2017
Assembly statistics
RefSeq | GenBank | |
---|---|---|
Genome size | 5 Mb | 5 Mb |
Total ungapped length | 5 Mb | 5 Mb |
Number of contigs | 70 | 70 |
Contig N50 | 232.6 kb | 232.6 kb |
Contig L50 | 8 | 8 |
GC percent | 50.5 | 50.5 |
Genome coverage | 53.6x | 53.6x |
Assembly level | Contig | Contig |
View sequences | view RefSeq sequences | view GenBank sequences |
Sample details
- BioSample ID
- SAMN05439501
- Description
- Pathogen: environmental/food/other sample from Escherichia coli
- Submitter
- FDA Center for Food Safety and Applied Nutrition
- Isolate name alias
- CFSAN044291, 104
- Strain
- MOD1-EC6054
- Attribute package
- clinical/host-associated
- Isolation source
- feces
- Ontological term
- feces:uberon_0001988
- Interagency Food Safety Analytics Collaboration (IFSAC) category
- clinical/research| human
- Source type
- human
- Collection date
- Jun 30, 2010
- Geographic location
- USA
- Host
- Homo sapiens
- Collected by
- Pennsylvania State University| Escherichia coli Reference Center
- Food product origin geographic location
- Norway
- Project name
- GenomeTrakr
- Sequenced by
- Pennsylvania State University| Escherichia coli Reference Center
- SRA
- SRS1593192
- CFSAN
- CFSAN044291
- Models
- Pathogen.env
- Package
- Pathogen.env.1.0
- Submission date
- 2016-07-25T16:57:29.133
- Publication date
- 2016-07-25T00:00:00.000
- Last updated
- 2024-08-08T11:39:05.019
Assembly methods
- Sequencing technology
- Illumina NextSeq 500
- Assembly method
- SPAdes v. 3.8.2
Additional genomes
Browse all Escherichia coli genomes (311553)BioProject
PRJNA230969GenomeTrakr Project: US Food and Drug Administration
Annotation details
RefSeq | |
---|---|
Provider | NCBI RefSeq |
Name | GCF_002537735.1-RS_2024_12_01 |
Date | Dec 1, 2024 |
Genes | 4,892 |
Protein-coding | 4,626 |
Software version | 6.9 |
About PGAP
The NCBI Prokaryotic Genome Annotation Pipeline (PGAP) uses multiple approaches to predict protein-coding and RNA genes and other functional elements directly from sequence.
Continue readingQuality analysis
CheckM analysis (v1.2.3)
Completeness: 99.18% (65th Percentile)
Contamination: 0.5%
Calculated on the Prokaryotic Genome Annotation Pipeline (PGAP) gene set with the Escherichia coli CheckM marker set. For more information on CheckM, see Parks, et al. Genome Res (2015).
Taxonomy check
- Taxonomy check status
- OK
- Best match status
- species_match
- Submitted organism name
- Escherichia coli
- Submitted species name
- Escherichia coli
Average Nucleotide Identity (ANI) match details
Best match type-strain for submitted organism | Best match type-strain | |
---|---|---|
Type assembly | GCA_000013265.1 | GCA_000013265.1 |
Organism name | Escherichia coli UTI89 | Escherichia coli |
Type category | claderef | claderef |
ANI | 98.91% | 98.91% |
Assembly coverage | 90.49% | 90.49% |
Type assembly coverage | 87.02% | 87.02% |
Chromosomes
Note: This contig-level genome assembly includes 70 contigs and no assembled chromosomes.
Revision history
This record has not been revised
GenBank | RefSeq | Name | Level | Date | Action |
---|---|---|---|---|---|
GCA_002537735.1 | GCF_002537735.1 | ASM253773v1 | Contig | Oct 13, 2017 |