U.S. flag

An official website of the United States government

Escherichia coli strain MOD1-EC6809 MOD1-EC6809_123_length_3055_cov_25.7941, whole genome shotgun sequence

NCBI Reference Sequence: NZ_NMLB01000123.1

FASTA Graphics 

LOCUS       NZ_NMLB01000123         3055 bp    DNA     linear   CON 25-AUG-2024
DEFINITION  Escherichia coli strain MOD1-EC6809
            MOD1-EC6809_123_length_3055_cov_25.7941, whole genome shotgun
            sequence.
ACCESSION   NZ_NMLB01000123 NZ_NMLB01000000
VERSION     NZ_NMLB01000123.1
DBLINK      BioProject: PRJNA224116
            BioSample: SAMN04992173
            Assembly: GCF_002468525.1
KEYWORDS    WGS; RefSeq.
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
            Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 3055)
  AUTHORS   Gangiredla,J., Mammel,M.K., Barnaba,T.J., Tartera,C., Gebru,S.T.,
            Patel,I.R., Leonard,S.R., Kotewicz,M.L., Lampel,K.A., Elkins,C.A.
            and Lacher,D.W.
  TITLE     Species-Wide Collection of Escherichia coli Isolates for
            Examination of Genomic Diversity
  JOURNAL   Genome Announc 5 (50), e01321-17 (2017)
   PUBMED   29242221
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 3055)
  AUTHORS   Gangiredla,J., Lacher,D.W., Mammel,M.K., Barnaba,T., Tartera,C.,
            Gebru,S., Patel,I.R., Leonard,S.R., Lampel,K.A. and Elkins,C.A.
  TITLE     Direct Submission
  JOURNAL   Submitted (07-JUL-2017) CFSAN-ORS-DM-MMSB, US Food and Drug
            Administration, 5100 Paint Branch Parkway, College Park, MD 20740,
            USA
COMMENT     REFSEQ INFORMATION: The reference sequence is identical to
            NMLB01000123.1.
            The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: SPAdes v. 3.8.2
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 32.5x
            Sequencing Technology  :: Illumina MiSeq
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI RefSeq
            Annotation Name                   :: GCF_002468525.1-RS_2024_08_25
            Annotation Date                   :: 08/25/2024 10:24:16
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 6.8
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA
            Genes (total)                     :: 5,581
            CDSs (total)                      :: 5,466
            Genes (coding)                    :: 5,059
            CDSs (with protein)               :: 5,059
            Genes (RNA)                       :: 115
            rRNAs                             :: 8, 8, 3 (5S, 16S, 23S)
            complete rRNAs                    :: 7, 1 (5S, 23S)
            partial rRNAs                     :: 1, 8, 2 (5S, 16S, 23S)
            tRNAs                             :: 85
            ncRNAs                            :: 11
            Pseudo Genes (total)              :: 407
            CDSs (without protein)            :: 407
            Pseudo Genes (ambiguous residues) :: 0 of 407
            Pseudo Genes (frameshifted)       :: 140 of 407
            Pseudo Genes (incomplete)         :: 284 of 407
            Pseudo Genes (internal stop)      :: 65 of 407
            Pseudo Genes (multiple problems)  :: 74 of 407
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..3055
                     /organism="Escherichia coli"
                     /mol_type="genomic DNA"
                     /submitter_seqid="MOD1-EC6809_123_length_3055_cov_25.7941"
                     /strain="MOD1-EC6809"
                     /isolation_source="feces"
                     /host="Bos taurus"
                     /db_xref="taxon:562"
                     /geo_loc_name="USA"
                     /collection_date="1995-06-27"
                     /collected_by="Pennsylvania State University| Escherichia
                     coli Reference Center"
     gene            complement(211..378)
                     /locus_tag="A7Y96_RS29735"
     CDS             complement(211..378)
                     /locus_tag="A7Y96_RS29735"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_000499451.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="DUF3927 family protein"
                     /protein_id="WP_000499458.1"
                     /translation="MFYKLCLLAVVILLLVMVMMDFTSRIMLVLTDGALVCGIVVLLW
                     PVIKRNSLHNA"
     gene            complement(464..1207)
                     /locus_tag="A7Y96_RS27860"
     CDS             complement(464..1207)
                     /locus_tag="A7Y96_RS27860"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:NP_308843.3"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="hypothetical protein"
                     /protein_id="WP_001302581.1"
                     /translation="MLSSCIGVVMKDGALLRSSSLFIAYMGCLGWGSAYFYGWGTSFY
                     YGFPWWIVGAGVDDVARSLFFAVIVIAIFLIGWGIGVVFFFAVKRKHSMQELNVFRLY
                     FAVELLFVPAIIEFSILRQKIQVPLLLLSAAIALAVTISIRSYGRFLSVSCFYDKPFI
                     KKHFFEIVMIAFVAYFWLFSFLTGYYKPQFKKEYEMINYNDGWYYVLARYDNCLVLST
                     SFNAGSKRFVIYQSAQDKNLQVDIVRTRI"
     gene            complement(1460..2083)
                     /locus_tag="A7Y96_RS27865"
     CDS             complement(1460..2083)
                     /locus_tag="A7Y96_RS27865"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:NP_311528.1"
                     /GO_function="GO:0003677 - DNA binding [Evidence IEA]"
                     /GO_process="GO:0006355 - regulation of DNA-templated
                     transcription [Evidence IEA]"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="antitermination protein"
                     /protein_id="WP_001235460.1"
                     /translation="MRLESVAKFHSPKSPMMSDSPRATASDSLSGTDVMAAMGMAQSQ
                     AGFGMAAFCGKHELSQNDKQKAINYLMQFAHKVSGKYRGVAKLEGNTKAKVLQVLATF
                     AYADYCRSAATPGARCRDCHGTGRAVDIAKTEQWGIVAEKECGRCKGVGYSRMPASAA
                     YRAVTMLIPNLTQPTWSRTVKPLYDALVVQCHKEESIADNILNAITR"
     gene            complement(2080..2745)
                     /locus_tag="A7Y96_RS27870"
     CDS             complement(2080..2745)
                     /locus_tag="A7Y96_RS27870"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:NP_308840.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="serine/threonine protein phosphatase"
                     /protein_id="WP_001028854.1"
                     /translation="MNIYERIDGSKYRNIWVVGDLHGCYTNLMKKLETIGFDTKKDLL
                     ISVGDLVDRGTENVECLELITFPWFRAVRGNHEQMMIDGLSERGNVNHWLLNGGGWFF
                     NLDYDKEILAKALAHKADELPLIIELVSKGKKYVICHADYPCDKYEFGKPVDHQQVIW
                     NRERISNSQDGIVKEIKGADTFIFGHTPAVKPLKFANQMYIDTGAVFCGNLTLIQVQG
                     EGA"
     gene            complement(2742..>3055)
                     /locus_tag="A7Y96_RS27875"
     CDS             complement(2742..>3055)
                     /locus_tag="A7Y96_RS27875"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:NP_311530.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=3
                     /transl_table=11
                     /product="recombination protein NinG"
                     /protein_id="WP_172902210.1"
                     /translation="CISCGTLTSAQWDAGHYRTTAAAPQLRFDERNIHKQCVVCNQHK
                     SGNLVPYRVELISRIGQEAVEEIESNHNRYRWTVEECRAIKAEYQQKLKKLRNSRSEV
                     A"
CONTIG      join(NMLB01000123.1:1..3055)
//
Feature
Display: FASTA GenBank Help
Details

Supplemental Content

Change region shown

Customize view

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
External link. Please review our privacy policy.