U.S. flag

An official website of the United States government

Massilioclostridium coli isolate MGYG-HGUT-01560, whole genome shotgun sequence

NCBI Reference Sequence: NZ_CABKWT010000005.1

FASTA Graphics 

LOCUS       NZ_CABKWT010000005      5028 bp    DNA     linear   CON 15-APR-2024
DEFINITION  Massilioclostridium coli isolate MGYG-HGUT-01560, whole genome
            shotgun sequence.
ACCESSION   NZ_CABKWT010000005 NZ_CABKWT010000000
VERSION     NZ_CABKWT010000005.1
DBLINK      BioProject: PRJNA224116
            BioSample: SAMEA5851063
            Assembly: GCF_902376115.1
KEYWORDS    WGS; RefSeq.
SOURCE      Massilioclostridium coli
  ORGANISM  Massilioclostridium coli
            Bacteria; Bacillati; Bacillota; Clostridia; Eubacteriales;
            Clostridiaceae; Massilioclostridium.
REFERENCE   1
  CONSRTM   EMBL-EBI Metagenomics Team
  TITLE     Direct Submission
  JOURNAL   Submitted (06-AUG-2019) EMG, The European Bioinformatics Institute
            (EMBL-EBI), Wellcome Genome Campus, CB10 1SD, United Kingdom
COMMENT     REFSEQ INFORMATION: The reference sequence is identical to
            CABKWT010000005.1.
            The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI RefSeq
            Annotation Name                   :: GCF_902376115.1-RS_2024_04_14
            Annotation Date                   :: 04/14/2024 19:57:54
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 6.7
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA
            Genes (total)                     :: 2,590
            CDSs (total)                      :: 2,511
            Genes (coding)                    :: 2,481
            CDSs (with protein)               :: 2,481
            Genes (RNA)                       :: 79
            rRNAs                             :: 5, 5, 5 (5S, 16S, 23S)
            complete rRNAs                    :: 5, 5, 5 (5S, 16S, 23S)
            tRNAs                             :: 60
            ncRNAs                            :: 4
            Pseudo Genes (total)              :: 30
            CDSs (without protein)            :: 30
            Pseudo Genes (ambiguous residues) :: 0 of 30
            Pseudo Genes (frameshifted)       :: 5 of 30
            Pseudo Genes (incomplete)         :: 21 of 30
            Pseudo Genes (internal stop)      :: 6 of 30
            Pseudo Genes (multiple problems)  :: 2 of 30
            CRISPR Arrays                     :: 4
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..5028
                     /organism="Massilioclostridium coli"
                     /mol_type="genomic DNA"
                     /isolate="MGYG-HGUT-01560"
                     /isolation_source="human gut"
                     /db_xref="taxon:1870991"
                     /note="contig: NZ_FMIZ01000002.1"
     gene            complement(<1..4099)
                     /locus_tag="FXY84_RS11020"
     CDS             complement(<1..4099)
                     /locus_tag="FXY84_RS11020"
                     /inference="COORDINATES: protein motif:HMM:NF036578.4"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="family 78 glycoside hydrolase catalytic domain"
                     /protein_id="WP_069986653.1"
                     /translation="MKKFKRVIGLILAVCMLLSVFSIQTGALQGAKLTLSNLKTEELT
                     NPLGVDTTNPNFSWVIQSSERGTLQTAYQIKVYQESPDGTVVWDTGKVTSPNSINVKY
                     EGDALASSTKYYWQVKVWDNHGNESAFSEPAYFSTGKFDPETELKADWITYNADAGEV
                     DYNPIKIKFDQPITAQYFRFAVQKMGLPSTVGQYRTRIAEIEAYNSQTDPNYENNLLY
                     KKSTSTQNQDNVSKIWNNSYINDGVRDSVKVPAGYQSRYALDPDLQNEKYIDENGDYI
                     NNSNGKVKSFNDWVNFKLGSDVTIDEVWIYPALGNEAISDPSKVADFPSSYTVSFTNN
                     DSLYAAKPSLTDSGEDINTQWVIDEQVVDAEAPDNELPTGNSLPLIAKSFNTDSGKQI
                     KSATLYSSGLGIYEMRINGNKVTDNVLEPGLTEYEKSIFYNTYDVTELLNNEGENVIG
                     AMLGNGIYDNPRYALRYSKADRTNGKLKLYAQLEITYEDGSTQTILSDDSWKWTNGPT
                     VLSHWYGGEDYDARLEQPGWDSPGFDYSNWQNCVIENTMLSSSDGSEVPMGAFKSRMY
                     PGSKIVDTHETVNFYHPEENVYVFDLGVNFAGWFELNATLPEGTKLKMLPAEQFNSST
                     QRVSQASYGETPLYDTYIFKGDENGETWHPNFMYHGFRWLEVTVIGDTDVELTPDMIR
                     GLEIMVSNEQVGEFETSDQDVNAVHDLILRSAENNMYDTYTDCPQREKLGWMEQSHLT
                     YELLSYNYNIAAYMEKIAQDQREGQYEDGLMPSTLPGYSPQGGNYNDDLSWGGAAILV
                     PWYTYETYGDKQILEKSWDAMQKLMQHYNNRRDSYKQGLDEAIAADPSLSYTDFDYVL
                     YDYGLGDWGEYQGDPYGVKVQTQISQSTVLITTPVYAQLAKTMSEIATTLGDTEKAAE
                     YATLYENIKAEFNKLFFNYETGMYKGPVKENNKQQEAFAPLQDFDLQSVYSLALFNDL
                     VPDGYEEMVLQNLVNNIVENDYHLNTGEVTLKYMISVLRENGYNDLVYKMAMNDTMPS
                     YTYFIGRNTSLPEHWNGSGSQSHIMMGHIDQWFYEGIGGINNDGIAFENFTLSPYIPE
                     GMTSANTATTTKYGEIRSNWNYTDGNFNWEVVVPTNTTATIVIPVKDATAVTESGNDI
                     LGKDGNGLTFAGIDENGYCTYTVGSGSYHFTATDKAQPSKTILNTVIAYAEEQQADPA
                     FDNVIADVQKSFTAALENAKAVAANTGATQEEVNAAWQTLLNEIHKLGFVKGDITSLQ
                     KLVNTASSYDLTKYVEAGQAEFKEALAAAQELLADKDNALANEIETAETNLLNAMLNL
                     RFKADKSVLEQVIAEANSKDATAYTAESYAVLTSAVEKATNVLADE"
     gene            complement(4748..>5028)
                     /locus_tag="FXY84_RS13015"
                     /pseudo
     CDS             complement(4748..>5028)
                     /locus_tag="FXY84_RS13015"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_006355537.1"
                     /note="incomplete; too short partial abutting assembly
                     gap; missing N-terminus; Derived by automated
                     computational analysis using gene prediction method:
                     Protein Homology."
                     /pseudo
                     /codon_start=3
                     /transl_table=11
                     /product="hypothetical protein"
CONTIG      join(CABKWT010000005.1:1..5028)
//
complement(<1..4099)
/locus_tag="FXY84_RS11020"
/inference="COORDINATES: protein motif:HMM:NF036578.4"
/note="Derived by automated computational analysis using
gene prediction method: Protein Homology."
/codon_start=1
/transl_table=11
/product="family 78 glycoside hydrolase catalytic domain"
/protein_id="WP_069986653.1"
/translation="MKKFKRVIGLILAVCMLLSVFSIQTGALQGAKLTLSNLKTEELT
NPLGVDTTNPNFSWVIQSSERGTLQTAYQIKVYQESPDGTVVWDTGKVTSPNSINVKY
EGDALASSTKYYWQVKVWDNHGNESAFSEPAYFSTGKFDPETELKADWITYNADAGEV
DYNPIKIKFDQPITAQYFRFAVQKMGLPSTVGQYRTRIAEIEAYNSQTDPNYENNLLY
KKSTSTQNQDNVSKIWNNSYINDGVRDSVKVPAGYQSRYALDPDLQNEKYIDENGDYI
NNSNGKVKSFNDWVNFKLGSDVTIDEVWIYPALGNEAISDPSKVADFPSSYTVSFTNN
DSLYAAKPSLTDSGEDINTQWVIDEQVVDAEAPDNELPTGNSLPLIAKSFNTDSGKQI
KSATLYSSGLGIYEMRINGNKVTDNVLEPGLTEYEKSIFYNTYDVTELLNNEGENVIG
AMLGNGIYDNPRYALRYSKADRTNGKLKLYAQLEITYEDGSTQTILSDDSWKWTNGPT
VLSHWYGGEDYDARLEQPGWDSPGFDYSNWQNCVIENTMLSSSDGSEVPMGAFKSRMY
PGSKIVDTHETVNFYHPEENVYVFDLGVNFAGWFELNATLPEGTKLKMLPAEQFNSST
QRVSQASYGETPLYDTYIFKGDENGETWHPNFMYHGFRWLEVTVIGDTDVELTPDMIR
GLEIMVSNEQVGEFETSDQDVNAVHDLILRSAENNMYDTYTDCPQREKLGWMEQSHLT
YELLSYNYNIAAYMEKIAQDQREGQYEDGLMPSTLPGYSPQGGNYNDDLSWGGAAILV
PWYTYETYGDKQILEKSWDAMQKLMQHYNNRRDSYKQGLDEAIAADPSLSYTDFDYVL
YDYGLGDWGEYQGDPYGVKVQTQISQSTVLITTPVYAQLAKTMSEIATTLGDTEKAAE
YATLYENIKAEFNKLFFNYETGMYKGPVKENNKQQEAFAPLQDFDLQSVYSLALFNDL
VPDGYEEMVLQNLVNNIVENDYHLNTGEVTLKYMISVLRENGYNDLVYKMAMNDTMPS
YTYFIGRNTSLPEHWNGSGSQSHIMMGHIDQWFYEGIGGINNDGIAFENFTLSPYIPE
GMTSANTATTTKYGEIRSNWNYTDGNFNWEVVVPTNTTATIVIPVKDATAVTESGNDI
LGKDGNGLTFAGIDENGYCTYTVGSGSYHFTATDKAQPSKTILNTVIAYAEEQQADPA
FDNVIADVQKSFTAALENAKAVAANTGATQEEVNAAWQTLLNEIHKLGFVKGDITSLQ
KLVNTASSYDLTKYVEAGQAEFKEALAAAQELLADKDNALANEIETAETNLLNAMLNL
RFKADKSVLEQVIAEANSKDATAYTAESYAVLTSAVEKATNVLADE"
Warning: Cannot highlight feature because no sequence is shown. Show the sequence
Feature NZ_CABKWT010000005 : 1 segment (minus strand)
Display: FASTA GenBank Help
Details

Supplemental Content

Change region shown

Customize view

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
External link. Please review our privacy policy.