Genome assembly ASM314565v1

FTP
Actions
NCBI RefSeq assembly
GCF_003145655.1
Submitted GenBank assembly
GCA_003145655.1
Taxon
Escherichia coli (E. coli)
Strain
Spr2013_WWKa_OUT_27
WGS project
NBDK01
Submitter
BIOTEC
Date
May 21, 2018

Assembly statistics

RefSeqGenBank
Genome size5.2 Mb5.2 Mb
Total ungapped length5.2 Mb5.2 Mb
Number of scaffolds161161
Scaffold N5079.9 kb79.9 kb
Scaffold L502121
Number of contigs181181
Contig N5079.9 kb79.9 kb
Contig L502121
GC percent5151
Genome coverage10.0x10.0x
Assembly levelScaffoldScaffold
View sequencesview RefSeq sequencesview GenBank sequences

Sample details

BioSample ID
SAMN06641851
Description
Pathogen: environmental/food/other sample from Escherichia coli
Submitter
BIOTEC
Strain
Spr2013_WWKa_OUT_27
Sample type
whole organism
Geographic location
Germany: Dresden-Kaditz
Isolation source
wastewater outflow
Latitude and longitude
51.0707 N 13.6808 E
Collection date
Mar 27, 2013
Sample name
Spr2013_WWKa_OUT_27
SRA
SRS8730424
Models
Pathogen.env
Package
Pathogen.env.1.0
Submission date
2017-03-24T10:37:04.540
Publication date
2017-09-18T00:00:00.000
Last updated
2021-06-05T07:09:51.295

Assembly methods

Sequencing technology
Illumina MiSeq
Comment
Bacteria and source DNA available from Thomas Berendonk.
Assembly method
ABySS v. 1.5.2

Additional genomes

Browse all Escherichia coli genomes (312177)

BioProject

PRJNA380388

E. coli genomes from wastewater treatment plant inflow and outflow

Pathogen Detection Resource

Publications

Showing 1 of 1

View in PubMed

Annotation details

RefSeq
ProviderNCBI RefSeq
NameGCF_003145655.1-RS_2024_07_08
DateJul 8, 2024
Genes5,153
Protein-coding4,808
Software version6.7
View RefSeq annotation

About PGAP

The NCBI Prokaryotic Genome Annotation Pipeline (PGAP) uses multiple approaches to predict protein-coding and RNA genes and other functional elements directly from sequence.

Continue reading

Quality analysis

CheckM analysis (v1.2.3)

Completeness: 98.63% (28th Percentile, dark blue bar)

Contamination: 2.74%

Completeness of Escherichia coli RefSeq assemblies

Calculated on the Prokaryotic Genome Annotation Pipeline (PGAP) gene set with the Escherichia coli CheckM marker set. For more information on CheckM, see Parks, et al. Genome Res (2015).

Taxonomy check

Taxonomy check status
OK
Best match status
species_match
Submitted organism name
Escherichia coli
Submitted species name
Escherichia coli

Average Nucleotide Identity (ANI) match details

Best match type-strain for submitted organismBest match type-strain
Type assemblyGCA_000210475.1GCA_000210475.1
Organism nameEscherichia coli ETEC H10407Escherichia coli
Type categorycladerefcladeref
ANI98.74%98.74%
Assembly coverage84%84%
Type assembly coverage81.48%81.48%

Chromosomes

Note: This scaffold-level genome assembly includes 161 scaffolds and no assembled chromosomes.

Revision history

This record has not been revised

GenBank
RefSeq
Name
Level
Date
Action
GCA_003145655.1GCF_003145655.1ASM314565v1ScaffoldMay 21, 2018