LOCUS NZ_NMLB01000123 3055 bp DNA linear CON 25-AUG-2024
DEFINITION Escherichia coli strain MOD1-EC6809
MOD1-EC6809_123_length_3055_cov_25.7941, whole genome shotgun
sequence.
ACCESSION NZ_NMLB01000123 NZ_NMLB01000000
VERSION NZ_NMLB01000123.1
DBLINK BioProject: PRJNA224116
BioSample: SAMN04992173
Assembly: GCF_002468525.1
KEYWORDS WGS; RefSeq.
SOURCE Escherichia coli
ORGANISM Escherichia coli
Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE 1 (bases 1 to 3055)
AUTHORS Gangiredla,J., Mammel,M.K., Barnaba,T.J., Tartera,C., Gebru,S.T.,
Patel,I.R., Leonard,S.R., Kotewicz,M.L., Lampel,K.A., Elkins,C.A.
and Lacher,D.W.
TITLE Species-Wide Collection of Escherichia coli Isolates for
Examination of Genomic Diversity
JOURNAL Genome Announc 5 (50), e01321-17 (2017)
PUBMED 29242221
REMARK Publication Status: Online-Only
REFERENCE 2 (bases 1 to 3055)
AUTHORS Gangiredla,J., Lacher,D.W., Mammel,M.K., Barnaba,T., Tartera,C.,
Gebru,S., Patel,I.R., Leonard,S.R., Lampel,K.A. and Elkins,C.A.
TITLE Direct Submission
JOURNAL Submitted (07-JUL-2017) CFSAN-ORS-DM-MMSB, US Food and Drug
Administration, 5100 Paint Branch Parkway, College Park, MD 20740,
USA
COMMENT REFSEQ INFORMATION: The reference sequence is identical to
NMLB01000123.1.
The annotation was added by the NCBI Prokaryotic Genome Annotation
Pipeline (PGAP). Information about PGAP can be found here:
https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
##Genome-Assembly-Data-START##
Assembly Method :: SPAdes v. 3.8.2
Genome Representation :: Full
Expected Final Version :: Yes
Genome Coverage :: 32.5x
Sequencing Technology :: Illumina MiSeq
##Genome-Assembly-Data-END##
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI RefSeq
Annotation Name :: GCF_002468525.1-RS_2024_08_25
Annotation Date :: 08/25/2024 10:24:16
Annotation Pipeline :: NCBI Prokaryotic Genome
Annotation Pipeline (PGAP)
Annotation Method :: Best-placed reference protein
set; GeneMarkS-2+
Annotation Software revision :: 6.8
Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA
Genes (total) :: 5,581
CDSs (total) :: 5,466
Genes (coding) :: 5,059
CDSs (with protein) :: 5,059
Genes (RNA) :: 115
rRNAs :: 8, 8, 3 (5S, 16S, 23S)
complete rRNAs :: 7, 1 (5S, 23S)
partial rRNAs :: 1, 8, 2 (5S, 16S, 23S)
tRNAs :: 85
ncRNAs :: 11
Pseudo Genes (total) :: 407
CDSs (without protein) :: 407
Pseudo Genes (ambiguous residues) :: 0 of 407
Pseudo Genes (frameshifted) :: 140 of 407
Pseudo Genes (incomplete) :: 284 of 407
Pseudo Genes (internal stop) :: 65 of 407
Pseudo Genes (multiple problems) :: 74 of 407
CRISPR Arrays :: 2
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..3055
/organism="Escherichia coli"
/mol_type="genomic DNA"
/submitter_seqid="MOD1-EC6809_123_length_3055_cov_25.7941"
/strain="MOD1-EC6809"
/isolation_source="feces"
/host="Bos taurus"
/db_xref="taxon:562"
/geo_loc_name="USA"
/collection_date="1995-06-27"
/collected_by="Pennsylvania State University| Escherichia
coli Reference Center"
gene complement(211..378)
/locus_tag="A7Y96_RS29735"
CDS complement(211..378)
/locus_tag="A7Y96_RS29735"
/inference="COORDINATES: similar to AA
sequence:RefSeq:WP_000499451.1"
/note="Derived by automated computational analysis using
gene prediction method: Protein Homology."
/codon_start=1
/transl_table=11
/product="DUF3927 family protein"
/protein_id="WP_000499458.1"
/translation="MFYKLCLLAVVILLLVMVMMDFTSRIMLVLTDGALVCGIVVLLW
PVIKRNSLHNA"
gene complement(464..1207)
/locus_tag="A7Y96_RS27860"
CDS complement(464..1207)
/locus_tag="A7Y96_RS27860"
/inference="COORDINATES: similar to AA
sequence:RefSeq:NP_308843.3"
/note="Derived by automated computational analysis using
gene prediction method: Protein Homology."
/codon_start=1
/transl_table=11
/product="hypothetical protein"
/protein_id="WP_001302581.1"
/translation="MLSSCIGVVMKDGALLRSSSLFIAYMGCLGWGSAYFYGWGTSFY
YGFPWWIVGAGVDDVARSLFFAVIVIAIFLIGWGIGVVFFFAVKRKHSMQELNVFRLY
FAVELLFVPAIIEFSILRQKIQVPLLLLSAAIALAVTISIRSYGRFLSVSCFYDKPFI
KKHFFEIVMIAFVAYFWLFSFLTGYYKPQFKKEYEMINYNDGWYYVLARYDNCLVLST
SFNAGSKRFVIYQSAQDKNLQVDIVRTRI"
gene complement(1460..2083)
/locus_tag="A7Y96_RS27865"
CDS complement(1460..2083)
/locus_tag="A7Y96_RS27865"
/inference="COORDINATES: similar to AA
sequence:RefSeq:NP_311528.1"
/GO_function="GO:0003677 - DNA binding [Evidence IEA]"
/GO_process="GO:0006355 - regulation of DNA-templated
transcription [Evidence IEA]"
/note="Derived by automated computational analysis using
gene prediction method: Protein Homology."
/codon_start=1
/transl_table=11
/product="antitermination protein"
/protein_id="WP_001235460.1"
/translation="MRLESVAKFHSPKSPMMSDSPRATASDSLSGTDVMAAMGMAQSQ
AGFGMAAFCGKHELSQNDKQKAINYLMQFAHKVSGKYRGVAKLEGNTKAKVLQVLATF
AYADYCRSAATPGARCRDCHGTGRAVDIAKTEQWGIVAEKECGRCKGVGYSRMPASAA
YRAVTMLIPNLTQPTWSRTVKPLYDALVVQCHKEESIADNILNAITR"
gene complement(2080..2745)
/locus_tag="A7Y96_RS27870"
CDS complement(2080..2745)
/locus_tag="A7Y96_RS27870"
/inference="COORDINATES: similar to AA
sequence:RefSeq:NP_308840.1"
/note="Derived by automated computational analysis using
gene prediction method: Protein Homology."
/codon_start=1
/transl_table=11
/product="serine/threonine protein phosphatase"
/protein_id="WP_001028854.1"
/translation="MNIYERIDGSKYRNIWVVGDLHGCYTNLMKKLETIGFDTKKDLL
ISVGDLVDRGTENVECLELITFPWFRAVRGNHEQMMIDGLSERGNVNHWLLNGGGWFF
NLDYDKEILAKALAHKADELPLIIELVSKGKKYVICHADYPCDKYEFGKPVDHQQVIW
NRERISNSQDGIVKEIKGADTFIFGHTPAVKPLKFANQMYIDTGAVFCGNLTLIQVQG
EGA"
gene complement(2742..>3055)
/locus_tag="A7Y96_RS27875"
CDS complement(2742..>3055)
/locus_tag="A7Y96_RS27875"
/inference="COORDINATES: similar to AA
sequence:RefSeq:NP_311530.1"
/note="Derived by automated computational analysis using
gene prediction method: Protein Homology."
/codon_start=3
/transl_table=11
/product="recombination protein NinG"
/protein_id="WP_172902210.1"
/translation="CISCGTLTSAQWDAGHYRTTAAAPQLRFDERNIHKQCVVCNQHK
SGNLVPYRVELISRIGQEAVEEIESNHNRYRWTVEECRAIKAEYQQKLKKLRNSRSEV
A"
CONTIG join(NMLB01000123.1:1..3055)
//