NCBI CCDS banner
PubMed Entrez Gene BLAST OMIM
The CCDS database will be unavailable on Tuesday, December 17, 2024, starting at 8:00 a.m. EST for up to 60 minutes.
  

CCDS
Home
FTP
Process
Releases & Statistics

Collaborators
EBI
HGNC
MGI
NCBI

Contact Us
email CCDS

Genome Displays

Ensembl
NCBI
UCSC
VEGA

Related Resources
Gene
HomoloGene
MANE
RefSeq


Report for CCDS42894.1 (current version)

CCDS Status Species Chrom. Gene CCDS Release NCBI Annotation Release Ensembl Annotation Release Links
42894.1 Public Homo sapiens 20 ZNF831 24 110 108 CCDS HistoryNCBI Gene:128611Re-query CCDS DB by CCDS ID:42894.1Re-query CCDS DB by GeneID:128611See the combined annotation on chromosome 20 in Sequence Viewer

Public since: CCDS release 5, NCBI annotation release 36.3, Ensembl annotation release 47

Review status: Reviewed (by RefSeq and Havana)

Sequence IDs included in CCDS 42894.1

Original Current Source Nucleotide ID Protein ID MANE Status in CCDS Seq. Status Links
Original member Current member EBI ENST00000371030.4 ENSP00000360069.2 MANE Select Accepted alive Link to Ensembl Transcript Viewer:ENST00000371030.4Link to Ensembl Protein Viewer:ENSP00000360069.2Re-query CCDS DB by Nucleotide ID:ENST00000371030Re-query CCDS DB by Protein ID:ENSP00000360069
Original member Current member EBI ENST00000637017.1 ENSP00000490240.1 Accepted alive Link to Ensembl Transcript Viewer:ENST00000637017.1Link to Ensembl Protein Viewer:ENSP00000490240.1Re-query CCDS DB by Nucleotide ID:ENST00000637017Re-query CCDS DB by Protein ID:ENSP00000490240
Original member Current member NCBI NM_001384354.1 NP_001371283.1 Accepted alive Link to Nucleotide Sequence:NM_001384354.1Link to Protein Sequence:NP_001371283.1Re-query CCDS DB by Nucleotide ID:NM_001384354Re-query CCDS DB by Protein ID:NP_001371283Link to BLAST:NP_001371283.1
Original member Current member NCBI NM_178457.3 NP_848552.1 MANE Select Accepted alive Link to Nucleotide Sequence:NM_178457.3Link to Protein Sequence:NP_848552.1Re-query CCDS DB by Nucleotide ID:NM_178457Re-query CCDS DB by Protein ID:NP_848552Link to BLAST:NP_848552.1

RefSeq Length Related UniProtKB/SwissProt Length Identity Gaps Mismatches
NP_001371283.1 1677 Q5JPB2 1677 100% 0 0
NP_848552.1 1677 Q5JPB2 1677 100% 0 0

Chromosomal Locations for CCDS 42894.1

Assembly GRCh38.p14 (GCF_000001405.40)

On '+' strand of Chromosome 20 (NC_000020.11)
Genome Browser links: Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome 20Link to Ensembl Genome Browser on chromosome 20See the combined annotation on chromosome 20 in Sequence Viewer

Chromosome Start Stop Links
20 59191020 59194757 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome 20Link to Ensembl Genome Browser on chromosome 20
20 59195869 59196005 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome 20Link to Ensembl Genome Browser on chromosome 20
20 59206905 59207056 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome 20Link to Ensembl Genome Browser on chromosome 20
20 59252978 59253138 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome 20Link to Ensembl Genome Browser on chromosome 20
20 59253898 59254743 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome 20Link to Ensembl Genome Browser on chromosome 20

CCDS Sequence Data
Blue highlighting indicates alternating exons.
Red highlighting indicates amino acids encoded across a splice junction.
 
Mouse over the nucleotide or protein sequence below and click on the highlighted codon or residue to select the pair.

Nucleotide Sequence (5034 nt):
ATGGAGGTTCCAGAACCCACCTGCCCTGCCCCTCCTGCGAGGGACCAGCCAGCTCCCACTCCTGGCCCTC
CA
GGGGCCCCAGGTGGCCAGGCCTCACCTCACCTGACCCTGGGCCCTGTCCTTCTGCCGCCAGAGCAGGG
C
CTGGCCCCCCCCACTGTGTTCCTGAAGGCCCTGCCCATCCCACTGTACCACACGGTGCCTCCCGGGGGC
CTC
CAGCCCCGCGCCCCGCTAGTGACGGGCAGCCTAGATGGGGGCAACGTGCCCTTCATACTCAGCCCTG
TG
CTGCAGCCTGAAGGGCCTGGCCCCACCCAGGTGGGGAAGCCGGCGGCCCCTACGCTGACGGTGAACAT
C
GTGGGCACTCTGCCTGTCCTGTCGCCGGGCCTGGGCCCCACGCTGGGCAGCCCAGGCAAGGTGCGGAAT
GCG
GGCAAGTACCTGTGTCCGCACTGTGGTCGCGACTGCCTGAAGCCCAGTGTTCTAGAGAAGCACATCC
GG
TCCCACACGGGTGAGAGGCCCTTCCCGTGTGCCACCTGCGGCATCGCCTTTAAGACCCAGAGCAATCT
C
TACAAGCACAGGCGGACGCAGACGCACCTCAACAACTCCCGGCTGTCCTCAGAGTCCGAGGGCGCCGGG
GGC
GGCCTCCTGGAGGAAGGGGACAAGGCCGGAGAGCCCCCCAGACCAGAGGGCAGGGGCGAGAGCAGGT
GC
CAGGGGATGCACGAAGGCGCCTCGGAGAGACCCCTTTCTCCGGGTGCCCACGTGCCCCTACTTGCCAA
G
AACCTGGATGTGAGGACCGAAGCTGCTCCCTGTCCAGGGTCCGCATTTGCCGACAGAGAGGCTCCTTGG
GAC
TCTGCCCCCATGGCGTCACCTGGGCTCCCAGCGGCCAGCACACAACCCTGGCGTAAGTTGCCAGAGC
AG
AAGTCGCCGACCGCCGGGAAGCCGTGCGCCCTGCAGCGGCAGCAGGCGACGGCAGCGGAGAAGCCCTG
G
GATGCCAAGGCCCCCGAGGGCCGGCTGCGGAAGTGTGAGAGCACCGACTCGGGGTACCTGTCGCGCTCC
GAC
AGCGCGGAGCAGCCGCATGCGCCCTGCAGCCCCCTGCACAGCCTTTCGGAGCACAGCGCCGAGTCCG
AG
GGGGAGGGCGGCCCGGGCCCGGGGCCAGGGGTCGCAGGGGCCGAGCCCGGGGCGCGAGAAGCCGGCCT
G
GAGCTGGAGAAGAAGCGGCTGGAGGAGCGCATCGCCCAGCTCATCTCCCACAACCAGGCGGTGGTGGAC
GAT
GCCCAGCTGGACAACGTGCGGCCCCGGAAGACCGGGCTGTCCAAACAGGGCAGCATCGACCTGCCCA
CG
CCCTACACCTACAAGGACTCCTTCCACTTTGACATCCGCGCGCTGGAGCCAGGCCGTAGGAGGGCCCC
G
GGCCCCGTGCGCTCCACCTGGACGCCCCCAGACAAGTCTCGGCCCCTCTTCTTCCACTCCGTCCCCACT
CAG
CTCTCCACCACCGTGGAATGTGTCCCCGTCACCAGGAGCAACTCGCTGCCCTTCGTCGAGGGCTCCA
GG
ACGTGGCTGGAGCCCAGGGAGCCCCGGGACCCCTGGTCCAGGACGCAGAAGCCTCTGAGCCCCAGGCC
C
GGCCCAGCCCGCCTGGGCTGCCGCTCGGGACTAAGCTCGACTGACGTTCCCAGTGGGCATCCCCGGGCC
CTG
GTCAGACAGGCCGCGGTGGAGGACCTGCCAGGCACCCCCATTGGCGATGCCCTGGTGCCCGCAGAGG
AC
ACAGACGCAAAGAGAACTGCTGCGCGGGAGGCCATGGCCGGCAAGGGCAGAGCGGGCGGCAGGAAGTG
C
GGCCAGAGAAGGCTGAAGATGTTCTCCCAGGAGAAGTGGCAGGTGTACGGGGATGAGACGTTCAAAAGG
ATC
TACCAGAAAATGAAAGCCAGTCCCCATGGAGGCAAGAAAGCCAGGGAGGTGGGAATGGGCAGTGGGG
CA
GAACTGGGCTTTCCTCTGCAGAAAGAGGCAGCAGGGAGCTCAGGCACAGTCCCCACCCAAGACAGGAG
G
ACCCCTGTCCATGAGGACATATCCGCAGGGGCAACGCCAGAGCCTTGGGGAAATCCACCAGCCCTGGAG
GCC
TCCTTGGTGACTGAACCCACTAAGCATGGGGAGACGGTGGCCAGGAGAGGAGACAGTGACCGACCCA
GG
GTGGAAGAGGCTGTGTCATCCCCTGCACTGGGTGGCAGAGACAGTCCCTGTTCAGGCAGTAGGAGCCC
C
CTGGTCTCTCCAAATGGGAGGCTGGAACTGGGGTGGCAGATGCCCCCAGCACCTGGCCCCCTCAAAGGG
GGT
GATGTAGAGGCTCCCAGGCCAGTTTGGCCGGACCCCAAGCTGGAAGGAGGTGCCCGAGGTGTGGGGG
AT
GTTCAGGAGACCTGCCTGTGGGCCCAGACTGTCCTGAGATGGCCCAGCAGGGGCTCAGGGGAGGACAA
G
CTCCCCTCAGAGAGGAAGAAGCTGAAAGTGGAGGACCTGCACAGCTGGAAGCAACCAGAGCCTGTGAGC
GCA
GAGACCCCAGGTGGGCCCACGCAGCCTGCCTCTTTGTCATCCCAGAAGCAGGATGCCGATCCCGGGG
AG
GTGCCAGGGGGCTCAAAGGAGAGTGCCAGGCAGGTGGGCGAGCCTCTGGAGTCCTCTGGAGCCTCCTT
G
GCTGCTGCTTCTGTTGCCCTGAAGAGGGTGGGGCCAAGGGACAAGGCTACCCCACTGCATCCTGCAGCC
CCA
GCCCCCGCAGAGCACCCCTCGCTGGCCACCCCACCTCAGGCTCCTAGAGTGCTCTCTGCCCTGGCAG
AT
AATGCCTTTTCCCCCAAGTACCTCCTCAGGTTACCTCAGGCAGAGACCCCCTTACCACTGCCCATTCC
C
TGGGGACCAAGGCACAGCCAGGACTCTCTCTGCAGCAGTGGGTGGCCTGAAGAACGGGCATCATTTGTT
GGG
TCAGGACTGGGGACCCCTCTTTCTCCCAGCCCAGCCTCAGGCCCCTCCCCAGGTGAGGCGGACAGCA
TC
CTGGAGGACCCCAGCTGTTCCAGGCCACAGGATGGGAGAAAAGGGGCACAGTTGGGGGGGGACAAGGG
G
GACAGGATGGCCACTAGCAGGCCAGCAGCCAGGGAGTTGCCCATCTCAGCACCAGGGGCTCCCAGGGAG
GCT
ACCTCCTCCCCGCCCACTCCAACGTGTGAGGCACACTTAGTTCAGGACATGGAGGGTGACAGCCACC
GT
ATCCATCGCCTCTGCATGGGCAGCACTTTGGCAAGGGCCAGGCTCTCTGGGGATGTCCTGAATCCCTG
G
GTACCCAACTGGGAGCTGGGGGAGCCTCCTGGGAATGCCCCAGAAGATCCTTCTTCAGGGCCCCTGGTG
GGC
CCCGACCCGTGTTCCCCCCTCCAGCCTGGCTCCTTCCTCACTGCCCTCACTCGGCCTCAGGGTGTGC
CC
CCAGGCTGGCCAGAGCTGGCCTTGTCTTCCCACTCAGGGACGTCCCGGAGCCACAGCACCCGCAGTCC
C
CACAGCACCCAAAACCCCTTTCCCTCACTGAAGGCTGAGCCGCGGCTCACGTGGTGTTGCCTGAGCCGC
AGT
GTCCCTCTGCCCGCGGAGCAGAAGGCAAAGGCGGCATCTGTGTACTTGGCGGTGCACTTTCCTGGTA
GC
AGCCTCCGAGATGAGGGTCCCAATGGCCCTCCTGGGAGCAATGGAGGATGGACCTGGACAAGCCCTGG
A
GAAGGAGGGCCGGCGCAGATGTCCAAGTTCTCCTACCCAACAGTCCCAGGGGTGATGCCCCAGCACCAG
GTG
TCTGAGCCAGAATGGAAGAAAGGCCTGCCTTGGAGGGCAAAGATGTCTCGTGGGAACAGCAAGCAGA
GA
AAACTGAAGATCAACCCTAAAAGGTACAAAGGGAATTTCTTGCAGAGCTGTGTTCAGCTGAGAGCCAG
T
AGACTTCGCACACCAACCTGGGTGCGAAGAAGAAGCCGCCACCCTCCCGCACTTGAGGGACTGAAGCCA
TGC
AGGACCCCTGGGCAGACCTCTTCAGAAATAGCAGGTCTGAATCTGCAAGAGGAGCCATCTTGTGCCA
CC
TCAGAATCACCTCCTTGTTGTGGGAAGGAAGAGAAGAAGGAAGGTGACTGCAGACAAACCTTAGGAAC
C
CTCTCTCTTGGTACAAGTTCAAGAATTGTCAGGGAAATGGACAAACGAACTGTGAAGGATATTTCTCCA
TCT
GCTGGTGAGCATGGTGACTGTACTACTCACAGCACTGCTGCCACATCAGGATTATCTCTGCAATCTG
AC
ACCTGCCTGGCAGTGGTTAATGACGTGCCTCTACCCCCTGGCAAAGGTCTTGACCTTGGGTTGCTGGA
G
ACTCAGCTGCTGGCCTCCCAGGATTCAGTCTCAACAGATCCCAAACCATACATCTTCTCAGATGCTCAA
AGG
CCTTCTTCCTTTGGGTCCAAAGGAACTTTTCCCCACCATGACATTGCTACCTCTGTGGCTGCCGTTT
GT
ATTTCTCTGCCAGTGAGAACAGATCACATAGCCCAGGAAATTCACAGTGCTGAATCACGAGACCACAG
C
CAGACTGCAGGGAGGACTCTGACATCAAGCTCCCCAGACAGCAAAGTCACAGAAGAGGGCAGAGCACAG
ACC
CTCTTGCCAGGGAGACCTTCATCTGGACAAAGAATTTCAGATTCGGTTCCACTGGAGTCAACTGAAA
AA
ACTCATCTTGAAATACCAGCTTCAGGACCAAGTTCAGCTAGTTCACACCACAAGGAAGGGAGACACAA
G
ACGTTTTTTCCTTCCAGAGGCCAGTATGGGTGTGGGGAAATGACTGTCCCCTGCCCCTCTTTAGGAAGT
GAC
GGTAGGAAACGTCAGGTATCTGGATTAATCACTCGGAAAGATTCTGTGGTTCCTTCTAAGCCAGAGC
AG
CCCATAGAAATTCCTGAAGCCCCTTCTAAATCCCTCAAGAAGAGGAGTCTGGAAGGAATGAGAAAGCA
A
ACTCGAGTAGAGTTCAGTGACACCAGCAGCGACGATGAAGACCGATTAGTTATAGAAATATGA


Translation (1677 aa):
MEVPEPTCPAPPARDQPAPTPGPPGAPGGQASPHLTLGPVLLPPEQGLAPPTVFLKALPIPLYHTVPPGG
L
QPRAPLVTGSLDGGNVPFILSPVLQPEGPGPTQVGKPAAPTLTVNIVGTLPVLSPGLGPTLGSPGKVRN
A
GKYLCPHCGRDCLKPSVLEKHIRSHTGERPFPCATCGIAFKTQSNLYKHRRTQTHLNNSRLSSESEGAG
G
GLLEEGDKAGEPPRPEGRGESRCQGMHEGASERPLSPGAHVPLLAKNLDVRTEAAPCPGSAFADREAPW
D
SAPMASPGLPAASTQPWRKLPEQKSPTAGKPCALQRQQATAAEKPWDAKAPEGRLRKCESTDSGYLSRS
D
SAEQPHAPCSPLHSLSEHSAESEGEGGPGPGPGVAGAEPGAREAGLELEKKRLEERIAQLISHNQAVVD
D
AQLDNVRPRKTGLSKQGSIDLPTPYTYKDSFHFDIRALEPGRRRAPGPVRSTWTPPDKSRPLFFHSVPT
Q
LSTTVECVPVTRSNSLPFVEGSRTWLEPREPRDPWSRTQKPLSPRPGPARLGCRSGLSSTDVPSGHPRA
L
VRQAAVEDLPGTPIGDALVPAEDTDAKRTAAREAMAGKGRAGGRKCGQRRLKMFSQEKWQVYGDETFKR
I
YQKMKASPHGGKKAREVGMGSGAELGFPLQKEAAGSSGTVPTQDRRTPVHEDISAGATPEPWGNPPALE
A
SLVTEPTKHGETVARRGDSDRPRVEEAVSSPALGGRDSPCSGSRSPLVSPNGRLELGWQMPPAPGPLKG
G
DVEAPRPVWPDPKLEGGARGVGDVQETCLWAQTVLRWPSRGSGEDKLPSERKKLKVEDLHSWKQPEPVS
A
ETPGGPTQPASLSSQKQDADPGEVPGGSKESARQVGEPLESSGASLAAASVALKRVGPRDKATPLHPAA
P
APAEHPSLATPPQAPRVLSALADNAFSPKYLLRLPQAETPLPLPIPWGPRHSQDSLCSSGWPEERASFV
G
SGLGTPLSPSPASGPSPGEADSILEDPSCSRPQDGRKGAQLGGDKGDRMATSRPAARELPISAPGAPRE
A
TSSPPTPTCEAHLVQDMEGDSHRIHRLCMGSTLARARLSGDVLNPWVPNWELGEPPGNAPEDPSSGPLV
G
PDPCSPLQPGSFLTALTRPQGVPPGWPELALSSHSGTSRSHSTRSPHSTQNPFPSLKAEPRLTWCCLSR
S
VPLPAEQKAKAASVYLAVHFPGSSLRDEGPNGPPGSNGGWTWTSPGEGGPAQMSK
FSYPTVPGVMPQHQ
V
SEPEWKKGLPWRAKMSRGNSKQRKLKINPK
RYKGNFLQSCVQLRASRLRTPTWVRRRSRHPPALEGLKP
C
RTPGQTSSEIA
GLNLQEEPSCATSESPPCCGKEEKKEGDCRQTLGTLSLGTSSRIVREMDKRTVKDISP
S
AGEHGDCTTHSTAATSGLSLQSDTCLAVVNDVPLPPGKGLDLGLLETQLLASQDSVSTDPKPYIFSDAQ
R
PSSFGSKGTFPHHDIATSVAAVCISLPVRTDHIAQEIHSAESRDHSQTAGRTLTSSSPDSKVTEEGRAQ
T
LLPGRPSSGQRISDSVPLESTEKTHLEIPASGPSSASSHHKEGRHKTFFPSRGQYGCGEMTVPCPSLGS
D
GRKRQVSGLITRKDSVVPSKPEQPIEIPEAPSKSLKKRSLEGMRKQTRVEFSDTSSDDEDRLVIEI




Links Key
 Links to:   History report
  BLAST report
  Entrez Gene
  Nucleotide report
  Protein report
 Re-query CCDS DB by:   CCDS ID
  Gene ID
  Nucleotide ID
  Protein ID
 Genome Browser Links:   Ensembl Genome Browser
  NCBI Sequence Viewer
  UCSC Genome Browser
  VEGA Genome Browser