U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination
    • Showing Current items.

    THAP5 THAP domain containing 5 [ Homo sapiens (human) ]

    Gene ID: 168451, updated on 10-Dec-2024

    Summary

    Official Symbol
    THAP5provided by HGNC
    Official Full Name
    THAP domain containing 5provided by HGNC
    Primary source
    HGNC:HGNC:23188
    See related
    Ensembl:ENSG00000177683 MIM:612534; AllianceGenome:HGNC:23188
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Homo sapiens
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
    Summary
    Enables protease binding activity. Involved in negative regulation of cell cycle and negative regulation of transcription by RNA polymerase II. Located in chromatin and nucleoplasm. [provided by Alliance of Genome Resources, Dec 2024]
    Expression
    Ubiquitous expression in thyroid (RPKM 13.7), testis (RPKM 10.7) and 25 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See THAP5 in Genome Data Viewer
    Location:
    7q31.1
    Exon count:
    6
    Annotation release Status Assembly Chr Location
    RS_2024_08 current GRCh38.p14 (GCF_000001405.40) 7 NC_000007.14 (108541759..108569768, complement)
    RS_2024_08 current T2T-CHM13v2.0 (GCF_009914755.1) 7 NC_060931.1 (109865676..109894376, complement)
    RS_2024_09 previous assembly GRCh37.p13 (GCF_000001405.25) 7 NC_000007.13 (108202576..108210212, complement)

    Chromosome 7 - NC_000007.14Genomic Context describing neighboring genes Neighboring gene patatin like phospholipase domain containing 8 Neighboring gene ribosomal protein L7 pseudogene 32 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr7:108164501-108165036 Neighboring gene H3K27ac hESC enhancer GRCh37_chr7:108165716-108166676 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 26503 Neighboring gene uncharacterized LOC124901722 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr7:108209746-108210473 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr7:108219440-108219950 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr7:108219951-108220459 Neighboring gene MPRA-validated peak6685 silencer Neighboring gene DnaJ heat shock protein family (Hsp40) member B9 Neighboring gene OCT4-NANOG hESC enhancer GRCh37_chr7:108233899-108234540 Neighboring gene uncharacterized LOC105375448

    Genomic regions, transcripts, and products

    Expression

    • Project title: Tissue-specific circular RNA induction during human fetal development
    • Description: 35 human fetal samples from 6 tissues (3 - 7 replicates per tissue) collected between 10 and 20 weeks gestational time were sequenced using Illumina TruSeq Stranded Total RNA
    • BioProject: PRJNA270632
    • Publication: PMID 26076956
    • Analysis date: Mon Apr 2 22:54:59 2018

    Bibliography

    GeneRIFs: Gene References Into Functions

    What's a GeneRIF?

    Phenotypes

    EBI GWAS Catalog

    Description
    Large-scale genome-wide association study of Asian population reveals genetic factors in FRMD4A and other loci influencing smoking initiation and nicotine dependence.
    EBI GWAS Catalog

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Clone Names

    • DKFZp313O1132

    Gene Ontology Provided by GOA

    Function Evidence Code Pubs
    enables DNA binding IEA
    Inferred from Electronic Annotation
    more info
     
    enables metal ion binding IEA
    Inferred from Electronic Annotation
    more info
     
    enables protease binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 
    Component Evidence Code Pubs
    located_in chromatin IDA
    Inferred from Direct Assay
    more info
    PubMed 
    located_in nucleoplasm IDA
    Inferred from Direct Assay
    more info
     
    is_active_in nucleus IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    located_in nucleus IDA
    Inferred from Direct Assay
    more info
    PubMed 

    General protein information

    Preferred Names
    THAP domain-containing protein 5

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001130475.3NP_001123947.1  THAP domain-containing protein 5 isoform 1

      See identical proteins and their annotated locations for NP_001123947.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1).
      Source sequence(s)
      AC005058, AL833137
      Consensus CDS
      CCDS47687.1
      UniProtKB/Swiss-Prot
      Q7Z6K1
      Related
      ENSP00000400500.2, ENST00000415914.4
      Conserved Domains (2) summary
      smart00980
      Location:485
      THAP; The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion
      cl23720
      Location:315373
      RILP-like; Rab interacting lysosomal protein-like 1 and 2 (Rilpl1 and Rilpl2)
    2. NM_001287598.1NP_001274527.1  THAP domain-containing protein 5 isoform 3

      See identical proteins and their annotated locations for NP_001274527.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (3) differs in the 5' UTR, lacks a portion of the 5' coding region and initiates translation at a downstream start codon, compared to variant 1. Variants 3, 4 and 5 encode the same isoform (3), which is shorter at the N-terminus compared to isoform 1.
      Source sequence(s)
      AC005058, BC053634, BU567660
      UniProtKB/TrEMBL
      A4D226
    3. NM_001287599.1NP_001274528.1  THAP domain-containing protein 5 isoform 3

      See identical proteins and their annotated locations for NP_001274528.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (4) differs in the 5' UTR, lacks a portion of the 5' coding region and initiates translation at a downstream start codon, compared to variant 1. Variants 3, 4 and 5 encode the same isoform (3), which is shorter at the N-terminus compared to isoform 1.
      Source sequence(s)
      AC005058, BF244164, BI830307
      UniProtKB/TrEMBL
      A4D226
    4. NM_001287601.1NP_001274530.1  THAP domain-containing protein 5 isoform 3

      See identical proteins and their annotated locations for NP_001274530.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (5) differs in the 5' UTR, lacks an alternate exon in the 5' coding region and initiates translation at a downstream start codon, compared to variant 1. Variants 3, 4 and 5 encode the same isoform (3), which is shorter at the N-terminus compared to isoform 1.
      Source sequence(s)
      AC005058, AW407519, BC053634, BI830307
      UniProtKB/TrEMBL
      A4D226
    5. NM_182529.3NP_872335.2  THAP domain-containing protein 5 isoform 2

      See identical proteins and their annotated locations for NP_872335.2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (2) differs in the 5' UTR, lacks a portion of the 5' coding region and initiates translation at a downstream start codon, compared to variant 1. It encodes isoform 2, which is shorter at the N-terminus compared to isoform 1.
      Source sequence(s)
      AC005058, BC053634, BU567660
      Consensus CDS
      CCDS34734.2
      UniProtKB/Swiss-Prot
      Q7Z6K1
      Related
      ENSP00000322440.5, ENST00000313516.5
      Conserved Domains (2) summary
      pfam05485
      Location:143
      THAP; THAP domain
      cl23720
      Location:273331
      RILP-like; Rab interacting lysosomal protein-like 1 and 2 (Rilpl1 and Rilpl2)

    RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2024_08

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCh38.p14 Primary Assembly

    Genomic

    1. NC_000007.14 Reference GRCh38.p14 Primary Assembly

      Range
      108541759..108569768 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_047419934.1XP_047275890.1  THAP domain-containing protein 5 isoform X1

    RNA

    1. XR_007059987.1 RNA Sequence

    Alternate T2T-CHM13v2.0

    Genomic

    1. NC_060931.1 Alternate T2T-CHM13v2.0

      Range
      109865676..109894376 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_054357385.1XP_054213360.1  THAP domain-containing protein 5 isoform X1

    RNA

    1. XR_008487538.1 RNA Sequence