U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from OMIM

    • Showing Current items.

    WDR31 WD repeat domain 31 [ Homo sapiens (human) ]

    Gene ID: 114987, updated on 10-Dec-2024

    Summary

    Official Symbol
    WDR31provided by HGNC
    Official Full Name
    WD repeat domain 31provided by HGNC
    Primary source
    HGNC:HGNC:21421
    See related
    Ensembl:ENSG00000148225 MIM:620951; AllianceGenome:HGNC:21421
    Gene type
    protein coding
    RefSeq status
    REVIEWED
    Organism
    Homo sapiens
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
    Summary
    This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene but the biological validity of some variants has not been determined. [provided by RefSeq, Jul 2008]
    Expression
    Broad expression in testis (RPKM 3.6), thyroid (RPKM 2.8) and 24 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See WDR31 in Genome Data Viewer
    Location:
    9q32
    Exon count:
    11
    Annotation release Status Assembly Chr Location
    RS_2024_08 current GRCh38.p14 (GCF_000001405.40) 9 NC_000009.12 (113313222..113340275, complement)
    RS_2024_08 current T2T-CHM13v2.0 (GCF_009914755.1) 9 NC_060933.1 (125512036..125539119, complement)
    RS_2024_09 previous assembly GRCh37.p13 (GCF_000001405.25) 9 NC_000009.11 (116075502..116102555, complement)

    Chromosome 9 - NC_000009.12Genomic Context describing neighboring genes Neighboring gene Neanderthal introgressed variant-containing enhancer experimental_106320 Neighboring gene pre-mRNA splicing tri-snRNP complex factor PRPF4 Neighboring gene ring finger protein 183 Neighboring gene Neanderthal introgressed variant-containing enhancer experimental_106329 Neighboring gene Neanderthal introgressed variant-containing enhancer experimental_106330 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 20198 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 20199 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 20200 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 20201 Neighboring gene B-box and SPRY domain containing Neighboring gene H3K4me1 hESC enhancer GRCh37_chr9:116131698-116132198 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr9:116132199-116132699 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 20202 Neighboring gene haloacid dehalogenase like hydrolase domain containing 3

    Genomic regions, transcripts, and products

    Expression

    • Project title: Tissue-specific circular RNA induction during human fetal development
    • Description: 35 human fetal samples from 6 tissues (3 - 7 replicates per tissue) collected between 10 and 20 weeks gestational time were sequenced using Illumina TruSeq Stranded Total RNA
    • BioProject: PRJNA270632
    • Publication: PMID 26076956
    • Analysis date: Mon Apr 2 22:54:59 2018

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Clone Names

    • FLJ35921

    Gene Ontology Provided by GOA

    Function Evidence Code Pubs
    enables protein binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 

    General protein information

    Preferred Names
    WD repeat-containing protein 31

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001006615.3NP_001006616.1  WD repeat-containing protein 31 isoform 2

      Status: REVIEWED

      Description
      Transcript Variant: This variant (2) differs in the 5' UTR, lacks a portion of the 5' coding region and initiates translation at a downstream start codon, compared to variant 1. It encodes isoform 2 which is shorter than isoform 1.
      Source sequence(s)
      AI091187, AK074891, AL449305, DA209351
      UniProtKB/TrEMBL
      Q8NC90
      Conserved Domains (3) summary
      COG2319
      Location:2218
      WD40; WD40 repeat [General function prediction only]
      sd00039
      Location:2459
      7WD40; WD40 repeat [structural motif]
      cl02567
      Location:5225
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    2. NM_001012361.4NP_001012361.1  WD repeat-containing protein 31 isoform 1

      See identical proteins and their annotated locations for NP_001012361.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1).
      Source sequence(s)
      AI091187, AK093240, AL449305, DA209351
      Consensus CDS
      CCDS35110.1
      UniProtKB/Swiss-Prot
      Q5W0T9, Q8NA23, Q96EG8
      Related
      ENSP00000363308.3, ENST00000374193.9
      Conserved Domains (3) summary
      COG2319
      Location:61343
      WD40; WD40 repeat [General function prediction only]
      cd00200
      Location:61350
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      sd00039
      Location:65101
      7WD40; WD40 repeat [structural motif]
    3. NM_145241.5NP_660284.1  WD repeat-containing protein 31 isoform 3

      See identical proteins and their annotated locations for NP_660284.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (3) uses an alternate in-frame splice site in the coding region, compared to variant 1. It encodes isoform 3 which lacks one internal amino acid, compared to isoform 1.
      Source sequence(s)
      AI091187, AL449305, BC012352, DA209351
      Consensus CDS
      CCDS6792.1
      UniProtKB/Swiss-Prot
      Q8NA23
      Related
      ENSP00000345027.3, ENST00000341761.8
      Conserved Domains (2) summary
      cd00200
      Location:60349
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      sd00039
      Location:64100
      7WD40; WD40 repeat [structural motif]

    RNA

    1. NR_134677.2 RNA Sequence

      Status: REVIEWED

      Description
      Transcript Variant: This variant (4) uses alternate splice sites in two internal exons, compared to variant 1. This variant is represented as non-coding because the use of the 5'-most supported translational start codon, as used in variant 1, renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
      Source sequence(s)
      AI091187, AL449305, AL833635, DA209351

    RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2024_08

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCh38.p14 Primary Assembly

    Genomic

    1. NC_000009.12 Reference GRCh38.p14 Primary Assembly

      Range
      113313222..113340275 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_047422716.1XP_047278672.1  WD repeat-containing protein 31 isoform X4

    2. XM_011518194.3XP_011516496.1  WD repeat-containing protein 31 isoform X3

      See identical proteins and their annotated locations for XP_011516496.1

      Conserved Domains (3) summary
      COG2319
      Location:88318
      WD40; WD40 repeat [General function prediction only]
      sd00039
      Location:85119
      7WD40; WD40 repeat [structural motif]
      cl02567
      Location:61325
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    3. XM_047422715.1XP_047278671.1  WD repeat-containing protein 31 isoform X4

    4. XM_047422713.1XP_047278669.1  WD repeat-containing protein 31 isoform X2

    5. XM_047422714.1XP_047278670.1  WD repeat-containing protein 31 isoform X3

    6. XM_047422712.1XP_047278668.1  WD repeat-containing protein 31 isoform X1

      UniProtKB/Swiss-Prot
      Q5W0T9, Q8NA23, Q96EG8

    Alternate T2T-CHM13v2.0

    Genomic

    1. NC_060933.1 Alternate T2T-CHM13v2.0

      Range
      125512036..125539119 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_054361894.1XP_054217869.1  WD repeat-containing protein 31 isoform X4

    2. XM_054361891.1XP_054217866.1  WD repeat-containing protein 31 isoform X2

    3. XM_054361893.1XP_054217868.1  WD repeat-containing protein 31 isoform X3

    4. XM_054361890.1XP_054217865.1  WD repeat-containing protein 31 isoform X1

      UniProtKB/Swiss-Prot
      Q5W0T9, Q8NA23, Q96EG8
    5. XM_054361895.1XP_054217870.1  WD repeat-containing protein 31 isoform X4

    6. XM_054361892.1XP_054217867.1  WD repeat-containing protein 31 isoform X3