U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Gm2237 predicted gene 2237 [ Mus musculus (house mouse) ]

Gene ID: 100039441, updated on 27-Nov-2024

Summary

Official Symbol
Gm2237provided by MGI
Official Full Name
predicted gene 2237provided by MGI
Primary source
MGI:MGI:3780407
See related
Ensembl:ENSMUSG00000093979 AllianceGenome:MGI:3780407
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Expression
Broad expression in CNS E18 (RPKM 3.3), CNS E14 (RPKM 2.1) and 18 other tissues See more
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Gm2237 in Genome Data Viewer
Location:
14 A3; 14 10.05 cM
Exon count:
9
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 14 NC_000080.7 (19613869..19635866, complement)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 14 NC_000080.6 (19563801..19585798, complement)

Chromosome 14 - NC_000080.7Genomic Context describing neighboring genes Neighboring gene predicted gene, 48105 Neighboring gene ubiquitin specific peptidase 7 pseudogene Neighboring gene predicted gene 2244 Neighboring gene predicted gene 5458 Neighboring gene predicted gene, 41102

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

General protein information

Preferred Names
uncharacterized protein LOC100039441
Names
alpha6-takusan

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001374119.1NP_001361048.1  uncharacterized protein LOC100039441 isoform a

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) encodes the longest isoform (a).
    Source sequence(s)
    AC174797
    Conserved Domains (1) summary
    pfam04822
    Location:68148
    Takusan
  2. NM_001374120.1NP_001361049.1  uncharacterized protein LOC100039441 isoform b

    Status: VALIDATED

    Source sequence(s)
    AC174797
    Consensus CDS
    CCDS88584.1
    UniProtKB/TrEMBL
    A6NAS6, E9Q501, L7N2C2
    Related
    ENSMUSP00000133164.2, ENSMUST00000170694.9
    Conserved Domains (1) summary
    pfam04822
    Location:48128
    Takusan
  3. NM_001374121.1NP_001361050.1  uncharacterized protein LOC100039441 isoform c

    Status: VALIDATED

    Description
    Transcript Variant: This variant (3), as well as variant 4, encodes isoform c.
    Source sequence(s)
    AC174797
    Consensus CDS
    CCDS88583.1
    UniProtKB/TrEMBL
    A6NAS0, A6NAU1
    Conserved Domains (1) summary
    pfam04822
    Location:173
    Takusan
  4. NM_001374122.1NP_001361051.1  uncharacterized protein LOC100039441 isoform c

    Status: VALIDATED

    Description
    Transcript Variant: This variant (4), as well as variant 3, encodes isoform c.
    Source sequence(s)
    AC174797
    Consensus CDS
    CCDS88583.1
    UniProtKB/TrEMBL
    A6NAS0, A6NAU1
    Conserved Domains (1) summary
    pfam04822
    Location:173
    Takusan

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000080.7 Reference GRCm39 C57BL/6J

    Range
    19613869..19635866 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_036158298.1XP_036014191.1  uncharacterized protein LOC100039441 isoform X2

    UniProtKB/TrEMBL
    A6NAS6, E9Q501, L7N2C2
    Conserved Domains (1) summary
    pfam04822
    Location:48128
    Takusan
  2. XM_036158299.1XP_036014192.1  uncharacterized protein LOC100039441 isoform X3

    UniProtKB/TrEMBL
    A6NAS0, A6NAU1
    Related
    ENSMUSP00000108214.3, ENSMUST00000112595.3
    Conserved Domains (1) summary
    pfam04822
    Location:173
    Takusan
  3. XM_036158297.1XP_036014190.1  uncharacterized protein LOC100039441 isoform X1

    UniProtKB/TrEMBL
    E9PW72
    Conserved Domains (1) summary
    pfam04822
    Location:59139
    Takusan
  4. XM_036158300.1XP_036014193.1  uncharacterized protein LOC100039441 isoform X4

  5. XM_017316239.3XP_017171728.1  uncharacterized protein LOC100039441 isoform X5

  6. XM_017316240.3XP_017171729.1  uncharacterized protein LOC100039441 isoform X6

Suppressed Reference Sequence(s)

The following Reference Sequences have been suppressed. Explain

  1. NG_008060.3: Suppressed sequence

    Description
    NG_008060.3: This RefSeq was removed because it is now thought that this gene does encode a protein.