U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination
    • Showing Current items.

    Gtf2h4 general transcription factor II H, polypeptide 4 [ Mus musculus (house mouse) ]

    Gene ID: 14885, updated on 27-Nov-2024

    Summary

    Official Symbol
    Gtf2h4provided by MGI
    Official Full Name
    general transcription factor II H, polypeptide 4provided by MGI
    Primary source
    MGI:MGI:1338799
    See related
    Ensembl:ENSMUSG00000001524 AllianceGenome:MGI:1338799
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    p52; BTF2 p52
    Summary
    This gene encodes a subunit of the general transcription factor multiprotein complex that plays roles in basal transcription, DNA repair and cell cycle control. [provided by RefSeq, Dec 2014]
    Expression
    Ubiquitous expression in thymus adult (RPKM 68.7), limb E14.5 (RPKM 11.4) and 28 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Gtf2h4 in Genome Data Viewer
    Location:
    17 B1; 17 18.7 cM
    Exon count:
    14
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 17 NC_000083.7 (35978624..35984607, complement)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 17 NC_000083.6 (35667728..35673750, complement)

    Chromosome 17 - NC_000083.7Genomic Context describing neighboring genes Neighboring gene surfactant associated 2 Neighboring gene valyl-tRNA synthetase 2, mitochondrial Neighboring gene STARR-positive B cell enhancer ABC_E7529 Neighboring gene CapStarr-seq enhancer MGSCv37_chr17:35808865-35809171 Neighboring gene STARR-positive B cell enhancer ABC_E7530 Neighboring gene STARR-seq mESC enhancer starr_42470 Neighboring gene discoidin domain receptor family, member 1 Neighboring gene STARR-seq mESC enhancer starr_42472 Neighboring gene predicted gene, 23864 Neighboring gene predicted gene 4577

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Endonuclease-mediated (4) 

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by MGI

    Function Evidence Code Pubs
    enables ATPase activator activity IEA
    Inferred from Electronic Annotation
    more info
     
    enables RNA polymerase II general transcription initiation factor activity IEA
    Inferred from Electronic Annotation
    more info
     
    enables RNA polymerase II general transcription initiation factor activity ISO
    Inferred from Sequence Orthology
    more info
     
    enables double-stranded DNA binding IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    Process Evidence Code Pubs
    involved_in nucleotide-excision repair IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    involved_in nucleotide-excision repair IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in transcription by RNA polymerase II ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in transcription by RNA polymerase II ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    Component Evidence Code Pubs
    part_of core TFIIH complex portion of holo TFIIH complex ISO
    Inferred from Sequence Orthology
    more info
     
    part_of core TFIIH complex portion of holo TFIIH complex ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    located_in nuclear speck IEA
    Inferred from Electronic Annotation
    more info
     
    located_in nuclear speck ISO
    Inferred from Sequence Orthology
    more info
     
    located_in nucleus ISO
    Inferred from Sequence Orthology
    more info
     
    located_in nucleus ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    part_of transcription factor TFIID complex IEA
    Inferred from Electronic Annotation
    more info
     
    part_of transcription factor TFIID complex ISO
    Inferred from Sequence Orthology
    more info
     
    part_of transcription factor TFIIH core complex IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    part_of transcription factor TFIIH holo complex IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    part_of transcription factor TFIIH holo complex ISO
    Inferred from Sequence Orthology
    more info
     
    part_of transcription factor TFIIH holo complex ISS
    Inferred from Sequence or Structural Similarity
    more info
     

    General protein information

    Preferred Names
    general transcription factor IIH subunit 4
    Names
    TFIIH basal transcription factor complex p52 subunit
    basic transcription factor 2 52 kDa subunit

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001413326.1NP_001400255.1  general transcription factor IIH subunit 4 isoform 1

      Status: VALIDATED

      Source sequence(s)
      CR974483
      UniProtKB/Swiss-Prot
      O70422
      UniProtKB/TrEMBL
      Q542U3
    2. NM_001413327.1NP_001400256.1  general transcription factor IIH subunit 4 isoform 1

      Status: VALIDATED

      Source sequence(s)
      CR974483
      UniProtKB/Swiss-Prot
      O70422
      UniProtKB/TrEMBL
      Q542U3
      Related
      ENSMUSP00000124335.2, ENSMUST00000160734.8
    3. NM_001413328.1NP_001400257.1  general transcription factor IIH subunit 4 isoform 1

      Status: VALIDATED

      Source sequence(s)
      CR974483
      UniProtKB/Swiss-Prot
      O70422
      UniProtKB/TrEMBL
      Q542U3
    4. NM_001413330.1NP_001400259.1  general transcription factor IIH subunit 4 isoform 2

      Status: VALIDATED

      Source sequence(s)
      CR974483
    5. NM_001413331.1NP_001400260.1  general transcription factor IIH subunit 4 isoform 3

      Status: VALIDATED

      Source sequence(s)
      CR974483
    6. NM_001413332.1NP_001400261.1  general transcription factor IIH subunit 4 isoform 4

      Status: VALIDATED

      Source sequence(s)
      CR974483
    7. NM_001413333.1NP_001400262.1  general transcription factor IIH subunit 4 isoform 4

      Status: VALIDATED

      Source sequence(s)
      CR974483
    8. NM_010364.5NP_034494.1  general transcription factor IIH subunit 4 isoform 1

      See identical proteins and their annotated locations for NP_034494.1

      Status: VALIDATED

      Source sequence(s)
      CR974483
      Consensus CDS
      CCDS28702.1
      UniProtKB/Swiss-Prot
      O70422
      UniProtKB/TrEMBL
      Q3UEU9, Q542U3
      Related
      ENSMUSP00000001565.9, ENSMUST00000001565.15
      Conserved Domains (1) summary
      TIGR00625
      Location:19461
      tfb2; Transcription factor tfb2

    RNA

    1. NR_182136.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      CR974483
      Related
      ENSMUST00000160752.9

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000083.7 Reference GRCm39 C57BL/6J

      Range
      35978624..35984607 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)