NEW
Try the new Transcript table
These reference sequences exist independently of genome builds. Explain
These reference sequences are curated independently of the genome
annotation cycle, so their versions may not match the RefSeq versions in the current
genome build. Identify version mismatches by comparing the version of the RefSeq in
this section to the one reported in Genomic regions,
transcripts, and products above.
Genomic
-
NG_052981.2 RefSeqGene
- Range
-
5002..39445
- Download
- GenBank, FASTA, Sequence Viewer (Graphics)
mRNA and Protein(s)
-
NM_001291411.2 → NP_001278340.2 protein SON isoform E
Status: REVIEWED
- Description
- Transcript Variant: This variant (e) lacks multiple 3' coding exons and contains an alternate 3' exon, resulting in a distinct 3' coding region and 3' UTR, compared to variant f. It encodes isoform E which is shorter and has a distinct C-terminus, compared to isoform F.
- Source sequence(s)
-
AP000303
- Consensus CDS
-
CCDS74784.1
- Related
- ENSP00000371095.4, ENST00000381679.8
- Conserved Domains (4) summary
-
- PHA03247
Location:170 → 460
- PHA03247; large tegument protein UL36; Provisional
- PHA03379
Location:340 → 673
- PHA03379; EBNA-3A; Provisional
- PRK10811
Location:1289 → 1481
- rne; ribonuclease E; Reviewed
- NF000535
Location:730 → 903
- MSCRAMM_SdrC; MSCRAMM family adhesin SdrC
-
NM_001291412.3 → NP_001278341.1 protein SON isoform H
Status: REVIEWED
- Description
- Transcript Variant: This variant (h) represents the allele encoded by the GRCh38 reference genome and encodes isoform (H).
- Source sequence(s)
-
AP000303, AP000304
- Consensus CDS
-
CCDS77624.1
- UniProtKB/TrEMBL
- A0A994J4Y9, J3QSZ5
- Related
- ENSP00000371111.2, ENST00000381692.6
- Conserved Domains (2) summary
-
- pfam01585
Location:333 → 376
- G-patch; G-patch domain
- cl00054
Location:398 → 441
- DSRM; Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, ...
-
NM_001412132.1 → NP_001399061.1 protein SON isoform I
Status: REVIEWED
- Description
- Transcript Variant: This variant (i) uses the same exon combination as variant h but represents the allele encoded by the T2T genome assembly. The encoded isoform (I) has a slightly different sequence in the C-terminal region compared to isoform H.
- Source sequence(s)
-
CP068257
- UniProtKB/TrEMBL
- A0A994J4Y9, Q6ZRV7
-
NM_001412133.1 → NP_001399062.1 protein SON isoform J
Status: REVIEWED
- Description
- Transcript Variant: This variant (j) uses the same exon combination as variant f but represents the allele encoded by the T2T genome assembly. The encoded isoform (J) has a slightly different sequence in the C-terminal region compared to isoform F.
- Source sequence(s)
-
CP068257
-
NM_032195.3 → NP_115571.3 protein SON isoform B
Status: REVIEWED
- Description
- Transcript Variant: This variant (b) lacks multiple 3' coding exons and contains an alternate 3' exon, resulting in a distinct 3' coding region and 3' UTR, compared to variant f. The encoded isoform (B) is shorter and has a distinct C-terminus, compared to isoform F.
- Source sequence(s)
-
AP000303, AP000304
- Consensus CDS
-
CCDS13631.1
- Related
- ENSP00000300278.2, ENST00000300278.8
- Conserved Domains (4) summary
-
- PHA03247
Location:170 → 460
- PHA03247; large tegument protein UL36; Provisional
- PHA03379
Location:340 → 673
- PHA03379; EBNA-3A; Provisional
- PRK10811
Location:1289 → 1481
- rne; ribonuclease E; Reviewed
- NF000535
Location:730 → 903
- MSCRAMM_SdrC; MSCRAMM family adhesin SdrC
-
NM_138927.4 → NP_620305.3 protein SON isoform F
Status: REVIEWED
- Description
- Transcript Variant: This variant (f) represents the allele encoded by the GRCh38 reference genome and encodes isoform (F).
- Source sequence(s)
-
AF380184, AK307612, AP000303
- Consensus CDS
-
CCDS13629.1
- UniProtKB/Swiss-Prot
- D3DSF5, D3DSF6, E7ETE8, E7EU67, E7EVW3, E9PFQ2, O14487, O95981, P18583, Q14120, Q6PKE0, Q9H7B1, Q9P070, Q9P072, Q9UKP9, Q9UPY0
- Related
- ENSP00000348984.4, ENST00000356577.10
- Conserved Domains (6) summary
-
- PHA03247
Location:170 → 460
- PHA03247; large tegument protein UL36; Provisional
- PHA03379
Location:340 → 673
- PHA03379; EBNA-3A; Provisional
- PRK10811
Location:1289 → 1481
- rne; ribonuclease E; Reviewed
- NF000535
Location:730 → 903
- MSCRAMM_SdrC; MSCRAMM family adhesin SdrC
- pfam01585
Location:2305 → 2349
- G-patch; G-patch domain
- cl00054
Location:2369 → 2419
- DSRM_SF; double-stranded RNA binding motif (DSRM) superfamily
RNA
-
NR_103797.2 RNA Sequence
Status: REVIEWED
- Description
- Transcript Variant: This variant (c) contains an alternate internal exon and uses an alternate splice site at the 3' exon, compared to variant f. This variant is represented as non-coding because the use of the 5'-most expected translational start codon, as used in variant f, renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
- Source sequence(s)
-
AP000303, AP000304
- Related
-
ENST00000455528.5
The following sections contain reference sequences that belong to a
specific genome build. Explain
This section includes genomic Reference
Sequences (RefSeqs) from all assemblies on which this gene is annotated, such as
RefSeqs for chromosomes and scaffolds (contigs) from both reference and alternate
assemblies. Model RNAs and proteins are also reported here.
Reference GRCh38.p14 Primary Assembly
Genomic
-
NC_000021.9 Reference GRCh38.p14 Primary Assembly
- Range
-
33543038..33577481
- Download
- GenBank, FASTA, Sequence Viewer (Graphics)
Alternate T2T-CHM13v2.0
Genomic
-
NC_060945.1 Alternate T2T-CHM13v2.0
- Range
-
31924877..31959322
- Download
- GenBank, FASTA, Sequence Viewer (Graphics)
The following Reference Sequences have been suppressed. Explain
These RefSeqs were suppressed for the
cited reason(s). Suppressed RefSeqs do not appear in BLAST databases, related
sequence links, or BLAST links (BLink), but may still be retrieved by clicking on
their accession.version below.
-
NM_003103.5: Suppressed sequence
- Description
- NM_003103.5: This RefSeq was permanently suppressed because currently there is insufficient support for the transcript and the protein.
-
NM_138925.1: Suppressed sequence
- Description
- NM_138925.1: This RefSeq was permanently suppressed because currently there is insufficient support for the transcript and the protein.
-
NR_103796.1: Suppressed sequence
- Description
- NR_103796.1: This RefSeq was permanently suppressed because it is now thought that this transcript variant does encode a protein.
-
NR_103798.1: Suppressed sequence
- Description
- NR_103798.1: This RefSeq was temporarily suppressed because currently there is not sufficient data to support this transcript.