NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1007352648|ref|NP_001308010|]
View 

histone-lysine N-methyltransferase EZH1 isoform 4 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
599-734 2.79e-94

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


:

Pssm-ID: 380994  Cd Length: 136  Bit Score: 288.89  E-value: 2.79e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 599 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 678
Cdd:cd19217     1 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1007352648 679 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALKYVGIERE 734
Cdd:cd19217    81 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRYSQADALKYVGIERE 136
PRC2_HTH_1 pfam18118
Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb ...
150-253 3.99e-37

Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb repressive complex 2 (PRC2) present in Homo sapiens. Polycomb complexes maintain repressive chromatin states by silencing gene expression. PRC2 does this by methylating lysine 27 of histone H3. This domain makes up part of the N-lobe which is involved in regulation.


:

Pssm-ID: 436286  Cd Length: 101  Bit Score: 134.05  E-value: 3.99e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 150 HGEEEmipgSVLISDAVFLELVDALNQYSDEEEEGHNDTSdGKQDDSKEDLPVTRKRKRH--AIEGNKKSSKKQFPNDMI 227
Cdd:pfam18118   1 HGDRE----GGFINDDIFVELVNALMQYYDDDDESEPESS-EKMSQAKKDERKTEEDERTekKDGDEKSESKKPFPSDII 75
                          90       100
                  ....*....|....*....|....*.
gi 1007352648 228 FSAIASMFPENGVPDDMKERYRELTE 253
Cdd:pfam18118  76 FQAISSMFPDKGTPEELKEKYKELTE 101
EZH2_WD-Binding pfam11616
WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, ...
39-68 2.49e-12

WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression. This interaction is required for the HMTase activity of PCR2.


:

Pssm-ID: 463308  Cd Length: 30  Bit Score: 61.31  E-value: 2.49e-12
                          10        20        30
                  ....*....|....*....|....*....|
gi 1007352648  39 KALYVANFAKVQEKTQILNEEWKKLRVQPV 68
Cdd:pfam11616   1 KSLFVSNRQKIQERTELLNEEWKKLRIQPI 30
preSET_CXC pfam18264
CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It ...
551-582 5.89e-10

CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It is a zinc binding domain.ED L9LD52.1/505-536;


:

Pssm-ID: 408079  Cd Length: 32  Bit Score: 54.84  E-value: 5.89e-10
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1007352648 551 GCRCKTQCNTKQCPCYLAVRECDPDLCLTCGA 582
Cdd:pfam18264   1 GCSCRATCYTKACLCYRANRECDPDLCNMCGA 32
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
424-465 9.68e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


:

Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 40.25  E-value: 9.68e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1007352648 424 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 465
Cdd:cd00167     1 PWTEEEDELLLEAVKKYgKNNWEKIAKELPGRTPKQCRERWRN 43
 
Name Accession Description Interval E-value
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
599-734 2.79e-94

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380994  Cd Length: 136  Bit Score: 288.89  E-value: 2.79e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 599 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 678
Cdd:cd19217     1 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1007352648 679 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALKYVGIERE 734
Cdd:cd19217    81 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRYSQADALKYVGIERE 136
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
604-725 1.58e-41

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 147.48  E-value: 1.58e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648  604 KHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFNLNNDFVVDATRKGNKIRFANH 681
Cdd:smart00317   1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPKAYDTDGakAFYLFDIDSDLCIDARRKGNLARFINH 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1007352648  682 SVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 725
Cdd:smart00317  81 SCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
PRC2_HTH_1 pfam18118
Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb ...
150-253 3.99e-37

Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb repressive complex 2 (PRC2) present in Homo sapiens. Polycomb complexes maintain repressive chromatin states by silencing gene expression. PRC2 does this by methylating lysine 27 of histone H3. This domain makes up part of the N-lobe which is involved in regulation.


Pssm-ID: 436286  Cd Length: 101  Bit Score: 134.05  E-value: 3.99e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 150 HGEEEmipgSVLISDAVFLELVDALNQYSDEEEEGHNDTSdGKQDDSKEDLPVTRKRKRH--AIEGNKKSSKKQFPNDMI 227
Cdd:pfam18118   1 HGDRE----GGFINDDIFVELVNALMQYYDDDDESEPESS-EKMSQAKKDERKTEEDERTekKDGDEKSESKKPFPSDII 75
                          90       100
                  ....*....|....*....|....*.
gi 1007352648 228 FSAIASMFPENGVPDDMKERYRELTE 253
Cdd:pfam18118  76 FQAISSMFPDKGTPEELKEKYKELTE 101
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
602-725 5.45e-29

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 112.36  E-value: 5.45e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 602 LKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRR---GKVYDKYmssfLFNLNNDFVVDATRKGNKIRF 678
Cdd:COG2940     4 LHPRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERRephKEPLHTY----LFELDDDGVIDGALGGNPARF 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 1007352648 679 ANHSVNPNCYAkvvmVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 725
Cdd:COG2940    80 INHSCDPNCEA----DEEDGRIFIVALRDIAAGEELTYDYGLDYDEE 122
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
615-718 1.68e-26

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 104.53  E-value: 1.68e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 615 GWGTFIKESVQKNEFISEYCGE-LISQDEADRRGKVYDKYM-----SSFLFNLNND--FVVDAT--RKGNKIRFANHSVN 684
Cdd:pfam00856   1 GRGLFATEDIPKGEFIGEYVEVlLITKEEADKRELLYYDKLelrlwGPYLFTLDEDseYCIDARalYYGNWARFINHSCD 80
                          90       100       110
                  ....*....|....*....|....*....|....
gi 1007352648 685 PNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:pfam00856  81 PNCEVRVVYVNGGPRIVIFALRDIKPGEELTIDY 114
EZH2_WD-Binding pfam11616
WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, ...
39-68 2.49e-12

WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression. This interaction is required for the HMTase activity of PCR2.


Pssm-ID: 463308  Cd Length: 30  Bit Score: 61.31  E-value: 2.49e-12
                          10        20        30
                  ....*....|....*....|....*....|
gi 1007352648  39 KALYVANFAKVQEKTQILNEEWKKLRVQPV 68
Cdd:pfam11616   1 KSLFVSNRQKIQERTELLNEEWKKLRIQPI 30
preSET_CXC pfam18264
CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It ...
551-582 5.89e-10

CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It is a zinc binding domain.ED L9LD52.1/505-536;


Pssm-ID: 408079  Cd Length: 32  Bit Score: 54.84  E-value: 5.89e-10
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1007352648 551 GCRCKTQCNTKQCPCYLAVRECDPDLCLTCGA 582
Cdd:pfam18264   1 GCSCRATCYTKACLCYRANRECDPDLCNMCGA 32
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
424-465 9.68e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 40.25  E-value: 9.68e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1007352648 424 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 465
Cdd:cd00167     1 PWTEEEDELLLEAVKKYgKNNWEKIAKELPGRTPKQCRERWRN 43
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
424-465 6.62e-04

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 37.97  E-value: 6.62e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1007352648  424 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 465
Cdd:smart00717   3 EWTEEEDELLIELVKKYgKNNWEKIAKELPGRTAEQCRERWRN 45
 
Name Accession Description Interval E-value
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
599-734 2.79e-94

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380994  Cd Length: 136  Bit Score: 288.89  E-value: 2.79e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 599 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 678
Cdd:cd19217     1 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1007352648 679 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALKYVGIERE 734
Cdd:cd19217    81 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRYSQADALKYVGIERE 136
SET_EZH2 cd19218
SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43) ...
601-720 1.48e-86

SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43), also termed lysine N-methyltransferase 6, or ENX-1, or histone-lysine N-methyltransferase EZH2, is a catalytic subunit of the polycomb repressive complex 2 (PRC2)/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380995  Cd Length: 120  Bit Score: 268.32  E-value: 1.48e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 601 GLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRFAN 680
Cdd:cd19218     1 GSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFAN 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 1007352648 681 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 720
Cdd:cd19218    81 HSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFDYRY 120
SET_EZH cd10519
SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar ...
604-720 9.81e-84

SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both, EZH1 and EZH2, can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380917  Cd Length: 117  Bit Score: 260.64  E-value: 9.81e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 604 KHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRFANHSV 683
Cdd:cd10519     1 KRLLLGKSDVAGWGLFLKEPIKKDEFIGEYTGELISQDEADRRGKIYDKYNSSYLFNLNDQFVVDATRKGNKIRFANHSS 80
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1007352648 684 NPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 720
Cdd:cd10519    81 NPNCYAKVMMVNGDHRIGIFAKRDIEAGEELFFDYGY 117
SET_SETD1-like cd10518
SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), ...
594-727 6.97e-43

SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), histone-lysine N-methyltransferases (KMT2A/KMT2B/KMT2C/KMT2D) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A), 1B (SETD1B), as well as histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B), 2C (KMT2C), 2D (KMT2D). These proteins are histone-lysine N-methyltransferases (EC 2.1.1.43) that specifically methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380916  Cd Length: 150  Bit Score: 151.98  E-value: 6.97e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 594 KNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATR 671
Cdd:cd10518     4 RFRQLRSRLKERLRVGKSGIHGWGLFAKRPIAAGEMVIEYVGEVIRPIVADKREKRYDEegGGGTYMFRIDEDLVIDATK 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1007352648 672 KGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALK 727
Cdd:cd10518    84 KGNIARFINHSCDPNCYAKIITVDGEKHIVIFAKRDIAPGEELTYDYKFPIEDEEK 139
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
604-725 1.58e-41

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 147.48  E-value: 1.58e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648  604 KHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFNLNNDFVVDATRKGNKIRFANH 681
Cdd:smart00317   1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPKAYDTDGakAFYLFDIDSDLCIDARRKGNLARFINH 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1007352648  682 SVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 725
Cdd:smart00317  81 SCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
PRC2_HTH_1 pfam18118
Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb ...
150-253 3.99e-37

Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb repressive complex 2 (PRC2) present in Homo sapiens. Polycomb complexes maintain repressive chromatin states by silencing gene expression. PRC2 does this by methylating lysine 27 of histone H3. This domain makes up part of the N-lobe which is involved in regulation.


Pssm-ID: 436286  Cd Length: 101  Bit Score: 134.05  E-value: 3.99e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 150 HGEEEmipgSVLISDAVFLELVDALNQYSDEEEEGHNDTSdGKQDDSKEDLPVTRKRKRH--AIEGNKKSSKKQFPNDMI 227
Cdd:pfam18118   1 HGDRE----GGFINDDIFVELVNALMQYYDDDDESEPESS-EKMSQAKKDERKTEEDERTekKDGDEKSESKKPFPSDII 75
                          90       100
                  ....*....|....*....|....*.
gi 1007352648 228 FSAIASMFPENGVPDDMKERYRELTE 253
Cdd:pfam18118  76 FQAISSMFPDKGTPEELKEKYKELTE 101
SET_SETD2-like cd10531
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), ...
615-722 1.75e-36

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2), ASH1-like protein (ASH1L) and similar proteins; This family includes SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2) and ASH1-like protein (ASH1L), which function as histone-lysine N-methyltransferases. SETD2 specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. NSD2 shows histone H3 'Lys-27' (H3K27me) methyltransferase activity. ASH1L specifically methylates 'Lys-36' of histone H3 (H3K36me). The family also includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins.


Pssm-ID: 380929  Cd Length: 136  Bit Score: 133.53  E-value: 1.75e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 615 GWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY--DKYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 692
Cdd:cd10531    11 GWGVKAKEDIQKGEFIIEYVGEVIDKKEFKERLDEYeeLGKSNFYILSLSDDVVIDATRKGNLSRFINHSCEPNCETQKW 90
                          90       100       110
                  ....*....|....*....|....*....|
gi 1007352648 693 MVNGDHRIGIFAKRAIQAGEELFFDYRYSQ 722
Cdd:cd10531    91 IVNGEYRIGIFALRDIPAGEELTFDYNFVN 120
SET_SETD1 cd19169
SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and ...
603-724 4.22e-35

SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A) and SET domain-containing protein 1B (SETD1B). These proteins are histone-lysine N-methyltransferases that specifically methylate 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated.


Pssm-ID: 380946  Cd Length: 148  Bit Score: 130.15  E-value: 4.22e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 603 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATRKGNKIRFAN 680
Cdd:cd19169    12 KKQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQVIRQSVADEREKRYEAigIGSSYLFRVDDDTIIDATKCGNLARFIN 91
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1007352648 681 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 724
Cdd:cd19169    92 HSCNPNCYAKIITVESQKKIVIYSKRPIAVNEEITYDYKFPIED 135
SET_SET1 cd20072
SET domain (including post-SET domain) found in catalytic component of the Saccharomyces ...
603-724 2.37e-34

SET domain (including post-SET domain) found in catalytic component of the Saccharomyces cerevisiae COMPASS complex and similar proteins; The family contains mostly fungal SET domains, including SET1 found in the catalytic component of the Saccharomyces cerevisiae COMPASS (complex of proteins associated with Set1). SET1 is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex. The activity of this catalytic domain is established through forming a complex with a set of core proteins; it is extensively contacted by Cps60 (Bre2), Cps50 (Swd1), and Cps30 (Swd3).


Pssm-ID: 380998  Cd Length: 148  Bit Score: 127.93  E-value: 2.37e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 603 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATRKGNKIRFAN 680
Cdd:cd20072    12 KKQLKFARSAIHNWGLYAMENISAKDMVIEYVGEVIRQQVADEREKRYLRqgIGSSYLFRIDDDTVVDATKKGNIARFIN 91
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1007352648 681 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 724
Cdd:cd20072    92 HCCDPNCTAKIIKVEGEKRIVIYAKRDIAAGEELTYDYKFPREE 135
SET_SETD2 cd19172
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and ...
615-720 9.66e-34

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and similar proteins; SETD2 (also termed HIF-1, huntingtin yeast partner B, huntingtin-interacting protein 1 (HIP-1), huntingtin-interacting protein B, lysine N-methyltransferase 3A or protein-lysine N-methyltransferase SETD2) acts as histone-lysine N-methyltransferase that specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. It has been shown that methylation is a posttranslational modification of dynamic microtubules and that SETD2 methylates alpha-tubulin at lysine 40, the same lysine that is marked by acetylation on microtubules. Methylation of microtubules occurs during mitosis and cytokinesis and can be ablated by SETD2 deletion, which causes mitotic spindle and cytokinesis defects, micronuclei, and polyploidy.


Pssm-ID: 380949 [Multi-domain]  Cd Length: 142  Bit Score: 126.16  E-value: 9.66e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 615 GWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK-YMSSFLF-NLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 692
Cdd:cd19172    13 GWGLRAAEDLPKGTFVIEYVGEVLDEKEFKRRMKEYAReGNRHYYFmALKSDEIIDATKKGNLSRFINHSCEPNCETQKW 92
                          90       100
                  ....*....|....*....|....*...
gi 1007352648 693 MVNGDHRIGIFAKRAIQAGEELFFDYRY 720
Cdd:cd19172    93 TVNGELRVGFFAKRDIPAGEELTFDYQF 120
SET_SETDB-like cd10538
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
548-718 1.60e-32

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2, and similar proteins; The family includes SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2. SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis. SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin. This family also includes the pre-SET domain, which is found in a number of histone methyltransferases (HMTase), N-terminal to the SET domain. Pre-SET domain is a zinc binding motif which contains 9 conserved cysteines that coordinate three zinc ions. It is thought that this region plays a structural role in stabilizing SET domains. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380936 [Multi-domain]  Cd Length: 217  Bit Score: 125.18  E-value: 1.60e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 548 RFPGCRCKTQCNTKQCPCY---------------------LAVRECDPdlclTCGASEHwdckvvsCKNCSIQRGLKKHL 606
Cdd:cd10538    23 DSVGCKCKDDCLDSKCACAaesdgifaytkngllrlnnspPPIFECNS----KCSCDDD-------CKNRVVQRGLQARL 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 607 LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNND---------FVVDATRKGNKIR 677
Cdd:cd10538    92 QVFRTSKKGWGVRSLEFIPKGSFVCEYVGEVITTSEADRRGKIYDKSGGSYLFDLDEFsdsdgdgeeLCVDATFCGNVSR 171
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 1007352648 678 FANHSVNPNCYA-KVVMVNGD---HRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10538   172 FINHSCDPNLFPfNVVIDHDDlryPRIALFATRDILPGEELTFDY 216
SET_SUV39H cd10542
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
538-737 7.03e-32

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homologs, SUV39H1, SUV39H2 and similar proteins; This family includes SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. Also included are Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (SUV39H homolog) and Neurospora crassa DIM-5, both of which also methylate 'Lys-9' of histone H3.


Pssm-ID: 380940 [Multi-domain]  Cd Length: 245  Bit Score: 124.33  E-value: 7.03e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 538 FCQCNPDCQNRFPGCrCKTQCNTK---------QCPCYLAVRECDPdlclTCGASEhwdckvvSCKNCSIQRGLKKHL-L 607
Cdd:cd10542    24 GCECTEDCHNNNPTC-CPAESGVKfaydkqgrlRLPPGTPIYECNS----RCKCGP-------DCPNRVVQRGRKVPLcI 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 608 LAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN-----NDFVVDATRKGNKIRFANHS 682
Cdd:cd10542    92 FRTSNGRGWGVKTLEDIKKGTFVMEYVGEIITSEEAERRGKIYDANGRTYLFDLDyndddCEYTVDAAYYGNISHFINHS 171
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 683 VNPN--CYAkVVMVNGD---HRIGIFAKRAIQAGEELFFDYRYSQADALKYVGIERETDV 737
Cdd:cd10542   172 CDPNlaVYA-VWINHLDprlPRIAFFAKRDIKAGEELTFDYLMTGTGGSSESTIPKPKDV 230
SET_ASH1L cd19174
SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ...
605-721 8.22e-32

SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ASH1L (EC 2.1.1.43; also termed absent small and homeotic disks protein 1 homolog, KMT2H, or lysine N-methyltransferase 2H) acts as histone-lysine N-methyltransferase that specifically methylates 'Lys-36' of histone H3 (H3K36me). It plays important roles in development; heterozygous mutation of ASH1L is associated with severe intellectual disability (ID) and multiple congenital anomaly (MCA).


Pssm-ID: 380951 [Multi-domain]  Cd Length: 141  Bit Score: 120.47  E-value: 8.22e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 605 HLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGK-VYDKYMSSFLFNLNNDFVVDATRKGNKIRFANHSV 683
Cdd:cd19174     1 GLERFRTEDKGWGVRTKEPIKAGQFIIEYVGEVVSEQEFRRRMIeQYHNHSHHYCLNLDSGMVIDGYRMGNEARFVNHSC 80
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 1007352648 684 NPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYS 721
Cdd:cd19174    81 DPNCEMQKWSVNGVYRIGLFALKDIPAGEELTYDYNFH 118
SET_EZH-like cd19168
SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb ...
604-718 2.97e-30

SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb repressive complex 2 (PRC2), and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both EZH1 and EZH2 can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380945  Cd Length: 124  Bit Score: 115.36  E-value: 2.97e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 604 KHLLLAPSDV-AGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRFANHS 682
Cdd:cd19168     1 KAVVLGKSQLeCGLGLFAAEDIKEGEFVIEYTGELISHDEGVRREHRRGDVSYLYLFEEQEGIWVDAAIYGNLSRYINHA 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 1007352648 683 VNP----NCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd19168    81 TDKvktgNCMPKIMYVNHEWRIKFTAIKDIKIGEELFFNY 120
SET_NSD cd19173
SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, ...
615-718 3.32e-30

SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, NSD2, NSD3 and similar proteins; The nuclear receptor-binding SET Domain (NSD) family of histone H3 lysine 36 methyltransferases is comprised of NSD1, NSD2, and NSD3, which are primarily known to be involved in chromatin integrity and gene expression through mono-, di-, or tri-methylating lysine 36 of histone H3 (H3K36), respectively. NSD1 (EC 2.1.1.43; also termed histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B) or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3.


Pssm-ID: 380950 [Multi-domain]  Cd Length: 142  Bit Score: 115.88  E-value: 3.32e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 615 GWGTFIKESVQKNEFISEYCGELISQDEADRR-GKVYDKYMSSFLF-NLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 692
Cdd:cd19173    13 GWGLRTKRDIKKGDFVIEYVGELIDEEECRRRlKKAHENNITNFYMlTLDKDRIIDAGPKGNLSRFMNHSCQPNCETQKW 92
                          90       100
                  ....*....|....*....|....*.
gi 1007352648 693 MVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd19173    93 TVNGDTRVGLFAVRDIPAGEELTFNY 118
SET_KMT2C_2D cd19171
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), ...
603-720 1.94e-29

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), 2D (KMT2D) and similar proteins; This family includes KMT2C and KMT2D. Both, KMT2C (also termed HALR or MLL3) and KMT2D (also termed ALR or MLL2), act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me). They are subunits of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380948 [Multi-domain]  Cd Length: 153  Bit Score: 114.45  E-value: 1.94e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 603 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK-----YMssflFNLNNDFVVDATRKGNKIR 677
Cdd:cd19171    13 RSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGEIIRNEVANRREKIYESqnrgiYM----FRIDNDWVIDATMTGGPAR 88
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 1007352648 678 FANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 720
Cdd:cd19171    89 YINHSCNPNCVAEVVTFDKEKKIIIISNRRIAKGEELTYDYKF 131
SET_ASHR3-like cd19175
SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 ...
608-722 1.99e-29

SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins; This family includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3, also termed protein SET DOMAIN GROUP 4 or protein stamen loss), ASH1 homolog 3 (ASHH3, also termed protein SET DOMAIN GROUP 7) and homolog 4 (ASHH4, also termed protein SET DOMAIN GROUP 24). They all function as histone-lysine N-methyltransferases (EC 2.1.1.43).


Pssm-ID: 380952 [Multi-domain]  Cd Length: 139  Bit Score: 113.67  E-value: 1.99e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 608 LAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY-DKYMSSF-LFNLNNDFVVDATRKGNKIRFANHSVNP 685
Cdd:cd19175     4 LVKTEKCGWGLVADEDINAGEFIIEYVGEVIDDKTCEERLWDMkHKGEKNFyMCEIDKDMVIDATFKGNLSRFINHSCDP 83
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1007352648 686 NCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQ 722
Cdd:cd19175    84 NCELQKWQVDGETRIGVFAIRDIKKGEELTYDYQFVQ 120
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
602-725 5.45e-29

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 112.36  E-value: 5.45e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 602 LKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRR---GKVYDKYmssfLFNLNNDFVVDATRKGNKIRF 678
Cdd:COG2940     4 LHPRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERRephKEPLHTY----LFELDDDGVIDGALGGNPARF 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 1007352648 679 ANHSVNPNCYAkvvmVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 725
Cdd:COG2940    80 INHSCDPNCEA----DEEDGRIFIVALRDIAAGEELTYDYGLDYDEE 122
SET_LegAS4-like cd10522
SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and ...
614-718 6.44e-29

SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and similar proteins; LegAS4 is a type IV secretion system effector of Legionella pneumophila. It contains a SET domain that is involved in the modification of Lys4 of histone H3 (H3K4) in the nucleolus of the host cell, thereby enhancing heterochromatic rDNA transcription. It also contains an ankyrin repeat domain of unknown function at its C-terminal region.


Pssm-ID: 380920 [Multi-domain]  Cd Length: 122  Bit Score: 111.66  E-value: 6.44e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 614 AGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNdFVVDATRKGNKIRFANHSVNPNCYAKVVM 693
Cdd:cd10522    13 NGLGLFAAETIAKGEFVGEYTGEVLDRWEEDRDSVYHYDPLYPFDLNGDI-LVIDAGKKGNLTRFINHSDQPNLELIVRT 91
                          90       100
                  ....*....|....*....|....*
gi 1007352648 694 VNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10522    92 LKGEQHIGFVAIRDIKPGEELFISY 116
SET_KMT2A_2B cd19170
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), ...
611-720 9.58e-29

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B) and similar proteins; This family includes KMT2A and KMT2B. Both KMT2A (also termed ALL-1 or CXXC7 or MLL or MLL1 or TRX1 or HRX) and KMT2B (also termed MLL4 or TRX2) act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380947 [Multi-domain]  Cd Length: 152  Bit Score: 112.10  E-value: 9.58e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 611 SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYD-KYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYA 689
Cdd:cd19170    21 SPIHGRGLFCKRNIDAGEMVIEYAGEVIRSVLTDKREKYYEsKGIGCYMFRIDDDEVVDATMHGNAARFINHSCEPNCYS 100
                          90       100       110
                  ....*....|....*....|....*....|.
gi 1007352648 690 KVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 720
Cdd:cd19170   101 RVVNIDGKKHIVIFALRRILRGEELTYDYKF 131
SET_SETMAR cd10544
SET domain (including pre-SET and post-SET domains) found in SET domain and mariner ...
549-718 1.32e-26

SET domain (including pre-SET and post-SET domains) found in SET domain and mariner transposase fusion protein (SETMAR) and similar proteins; SETMAR (also termed metnase) is a DNA-binding protein that is indirectly recruited to sites of DNA damage through protein-protein interactions. It has a sequence-specific DNA-binding activity recognizing the 19-mer core of the 5'-terminal inverted repeats (TIRs) of the Hsmar1 element and displays a DNA nicking and end joining activity. SETMAR also acts as a histone-lysine N-methyltransferase that methylates 'Lys-4' and 'Lys-36' of histone H3. It specifically mediates dimethylation of H3 'Lys-36' at sites of DNA double-strand break and may recruit proteins required for efficient DSB repair through non-homologous end-joining.


Pssm-ID: 380942 [Multi-domain]  Cd Length: 254  Bit Score: 109.31  E-value: 1.32e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 549 FPGCRCKTQ-CNTKQCPC-------YLA--------------VRECDpDLClTCGASehwdckvvsCKNCSIQRGLKKHL 606
Cdd:cd10544    24 FPGCDCKTSsCEPETCSClrkygpnYDDdgclldfdgkysgpVFECN-SMC-KCSES---------CQNRVVQNGLQFKL 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 607 LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN----NDFV----VDATRKGNKIRF 678
Cdd:cd10544    93 QVFKTPKKGWGLRTLEFIPKGRFVCEYAGEVIGFEEARRRTKSQTKGDMNYIIVLRehlsSGKVletfVDPTYIGNIGRF 172
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 1007352648 679 ANHSVNPNCYAKVVMVNGD-HRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10544   173 LNHSCEPNLFMVPVRVDSMvPKLALFAARDIVAGEELSFDY 213
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
615-718 1.68e-26

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 104.53  E-value: 1.68e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 615 GWGTFIKESVQKNEFISEYCGE-LISQDEADRRGKVYDKYM-----SSFLFNLNND--FVVDAT--RKGNKIRFANHSVN 684
Cdd:pfam00856   1 GRGLFATEDIPKGEFIGEYVEVlLITKEEADKRELLYYDKLelrlwGPYLFTLDEDseYCIDARalYYGNWARFINHSCD 80
                          90       100       110
                  ....*....|....*....|....*....|....
gi 1007352648 685 PNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:pfam00856  81 PNCEVRVVYVNGGPRIVIFALRDIKPGEELTIDY 114
SET_SUV39H1 cd10525
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
592-719 1.13e-25

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 1 (SUV39H1) and similar proteins; SUV39H1 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A (KMT1A), position-effect variegation 3-9 homolog (SUV39H), or Su(var)3-9 homolog 1) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380923 [Multi-domain]  Cd Length: 255  Bit Score: 106.90  E-value: 1.13e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 592 SCKNCSIQRGLKKHL-LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN---NDFVV 667
Cdd:cd10525    74 DCPNRVVQKGIQYDLcIFRTDNGRGWGVRTLEKIRKNSFVMEYVGEIITSEEAERRGQIYDRQGATYLFDLDyveDVYTV 153
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1007352648 668 DATRKGNKIRFANHSVNPNCYAKVVMV-NGDH---RIGIFAKRAIQAGEELFFDYR 719
Cdd:cd10525   154 DAAYYGNISHFVNHSCDPNLQVYNVFIdNLDErlpRIALFATRTIRAGEELTFDYN 209
SET_KMT2A cd19206
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) ...
611-725 2.96e-24

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) and similar proteins; KMT2A (EC2.1.1.43; also termed lysine N-methyltransferase 2A, ALL-1, CXXC-type zinc finger protein 7 (CXXC7), myeloid/lymphoid or mixed-lineage leukemia (MLL), myeloid/lymphoid or mixed-lineage leukemia protein 1 (MLL1), trithorax-like protein (TRX1), or zinc finger protein HRX) acts as a histone methyltransferase that plays an essential role in early development and hematopoiesis. It is a catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac).


Pssm-ID: 380983 [Multi-domain]  Cd Length: 154  Bit Score: 99.33  E-value: 2.96e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 611 SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYD-KYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYA 689
Cdd:cd19206    21 SPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDsKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYS 100
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 1007352648 690 KVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 725
Cdd:cd19206   101 RVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDA 136
SET_EHMT cd10543
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
551-718 3.06e-24

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase EHMT1, EHMT2 and similar proteins; This family includes EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380941 [Multi-domain]  Cd Length: 231  Bit Score: 102.03  E-value: 3.06e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 551 GCRCKTQCNTKQCPCYLAVREC--DPDLCLTcGASEHWDCKVV-----------SCKNCSIQRGLKKHLLLAPSDVAGWG 617
Cdd:cd10543    26 TCSCRDDCSSDNCVCGRLSVRCwyDKEGRLL-PDFNKLDPPLIfecnracscwrNCRNRVVQNGIRYRLQLFRTRGMGWG 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 618 TFIKESVQKNEFISEYCGELISQDEADRRGKvydkymSSFLFNLNND----FVVDATRKGNKIRFANHSVNPNCYAKVVM 693
Cdd:cd10543   105 VRALQDIPKGTFVCEYIGELISDSEADSRED------DSYLFDLDNKdgetYCIDARRYGNISRFINHLCEPNLIPVRVF 178
                         170       180       190
                  ....*....|....*....|....*....|.
gi 1007352648 694 VngDH------RIGIFAKRAIQAGEELFFDY 718
Cdd:cd10543   179 V--EHqdlrfpRIAFFASRDIKAGEELGFDY 207
SET_SETD1A cd19204
SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and ...
603-724 3.23e-24

SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and similar proteins; SETD1A (EC2.1.1.43), also termed lysine N-methyltransferase 2F, or Set1/Ash2 histone methyltransferase complex subunit SET1, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Human SET domain containing protein 1A (hSETD1A) expression occurs at a high rate in hepatocellular carcinoma patients and controls tumor metastasis in breast cancer by activating MMP expression.


Pssm-ID: 380981 [Multi-domain]  Cd Length: 153  Bit Score: 99.33  E-value: 3.23e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 603 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATRKGNKIRFAN 680
Cdd:cd19204    13 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQVVADMREKRYVQegIGSSYLFRVDHDTIIDATKCGNLARFIN 92
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1007352648 681 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 724
Cdd:cd19204    93 HCCTPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPIED 136
SET_SUV39H2 cd10532
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
593-719 4.08e-24

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 2 (SUV39H2) and similar proteins; SUV39H2 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B (KMT1B), or Su(var)3-9 homolog 2) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380930 [Multi-domain]  Cd Length: 243  Bit Score: 101.89  E-value: 4.08e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 593 CKNCSIQRGLKKHL-LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN---NDFVVD 668
Cdd:cd10532    73 CPNRVVQKGTQYSLcIFRTSNGRGWGVKTLQKIKKNSFVMEYVGEVITSEEAERRGQFYDSKGITYLFDLDyesDEFTVD 152
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1007352648 669 ATRKGNKIRFANHSVNPNCYA-KVVMVNGD---HRIGIFAKRAIQAGEELFFDYR 719
Cdd:cd10532   153 AARYGNVSHFVNHSCDPNLQVfNVFIDNLDtrlPRIALFSTRTIKAGEELTFDYQ 207
SET_SETD8 cd10528
SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2. ...
598-718 9.66e-24

SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2.1.1.43; also termed N-lysine methyltransferase KMT5A, H4-K20-HMTase KMT5A, lysine N-methyltransferase 5A, lysine-specific methylase 5A, PR/SET domain-containing protein 07, PR-Set7 or PR/SET07) is a nucleosomal histone-lysine N-methyltransferase that specifically monomethylates 'Lys-20' of histone H4 (H4K20me1). It plays a central role in the silencing of euchromatic genes.


Pssm-ID: 380926 [Multi-domain]  Cd Length: 141  Bit Score: 97.65  E-value: 9.66e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 598 IQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY------DKYMSSFLFNlNNDFVVDAT- 670
Cdd:cd10528    11 ILSGKEEGLKVIEIDGKGRGVIATRPFEKGDFVVEYHGDLITITEAKKREALYakdpstGCYMYYFQYK-GKTYCVDATk 89
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1007352648 671 ---RKGNKIrfaNHSV-NPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10528    90 esgRLGRLI---NHSKkKPNLKTKLLVIDGVPHLILVAKRDIKPGEELLYDY 138
SET_NSD2 cd19211
SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) ...
615-718 1.39e-23

SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) and similar proteins; NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-36' (H3K36me) methyltransferase activity. NSD2 has been shown to mediate di- and trimethylation of H3K36 and dimethylation of H4K20 in different systems, and has been characterized as a transcriptional repressor interacting with histone deacetylase HDAC1 and histone demethylase LSD1. NSD2 mediates constitutive NF-kappaB signaling for cancer cell proliferation, survival and tumor growth. It is highly overexpressed in several types of human cancers, including small-cell lung cancers, neuroblastoma, carcinomas of stomach and colon, and bladder cancers, and its overexpression tends to be associated with tumor aggressiveness. WHSC1 is frequently deleted in Wolf-Hirschhorn syndrome (WHS).


Pssm-ID: 380988 [Multi-domain]  Cd Length: 142  Bit Score: 96.98  E-value: 1.39e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 615 GWGTFIKESVQKNEFISEYCGELISQDEA-DRRGKVYDKYMSSF-LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 692
Cdd:cd19211    13 GWGLIAKRDIKKGEFVNEYVGELIDEEECmARIKHAHENDITHFyMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKW 92
                          90       100
                  ....*....|....*....|....*.
gi 1007352648 693 MVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd19211    93 TVNGDTRVGLFAVCDIPAGTELTFNY 118
SET_NSD3 cd19212
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
615-718 1.70e-23

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 3 (NSD3) and similar proteins; NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3. NSD3 is amplified and overexpressed in multiple cancer types, including acute myeloid leukemia (AML), breast, lung, pancreatic and bladder cancers, as well as squamous cell carcinoma of the head and neck (SCCHN). NSD3 contributes to tumorigenesis by interacting with bromodomain-containing protein 4 (BRD4), the bromodomain and extraterminal (BET) protein, which is a potential therapeutic target in acute myeloid leukemia (AML). NSD3 is amplified in primary tumors and cell lines from breast carcinoma, and can promote the cell viability of small-cell lung cancer and pancreatic ductal adenocarcinoma. High NSD3 expression is implicated in poor grade and heavy smoking history in SCCHN. Thus, NSD3 may serve as a potential druggable target for selective cancer therapy.


Pssm-ID: 380989 [Multi-domain]  Cd Length: 142  Bit Score: 96.92  E-value: 1.70e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 615 GWGTFIKESVQKNEFISEYCGELISQDEADRRGK-VYDKYMSSF-LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 692
Cdd:cd19212    13 GWGLRTKRSIKKGEFVNEYVGELIDEEECRLRIKrAHENSVTNFyMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKW 92
                          90       100
                  ....*....|....*....|....*.
gi 1007352648 693 MVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd19212    93 TVNGDVRVGLFALCDIPAGMELTFNY 118
SET_SETD1B cd19205
SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and ...
603-724 6.80e-23

SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and similar proteins; SETD1B (EC2.1.1.43), also termed lysine N-methyltransferase 2G, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Loss of SETD1B occurs in up to half the gastric and colorectal cancers, most commonly via SETD1B mutations, while de novo variants in SETD1B are associated with intellectual disability, epilepsy and autism.


Pssm-ID: 380982 [Multi-domain]  Cd Length: 153  Bit Score: 95.51  E-value: 6.80e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 603 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATRKGNKIRFAN 680
Cdd:cd19205    13 KKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDegIGSSYMFRVDHDTIIDATKCGNFARFIN 92
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1007352648 681 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 724
Cdd:cd19205    93 HSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 136
SET_NSD1 cd19210
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
615-718 4.15e-22

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 1 (NSD1) and similar proteins; NSD1 (EC 2.1.1.43; also termed Histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B), or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD1 is altered in approximately 10% of head and neck cancer patients with 55% decrease in risk of death in NSD1-mutated versus non-mutated patients; its disruption promotes favorable chemotherapeutic responses linked to hypomethylation.


Pssm-ID: 380987 [Multi-domain]  Cd Length: 142  Bit Score: 93.07  E-value: 4.15e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 615 GWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKY-MSSF-LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 692
Cdd:cd19210    13 GWGLRCKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHdITNFyMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKW 92
                          90       100
                  ....*....|....*....|....*.
gi 1007352648 693 MVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd19210    93 TVNGDTRVGLFALCDIKAGTELTFNY 118
SET_KMT2C cd19208
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) ...
603-724 1.21e-21

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) and similar proteins; KMT2C (EC2.1.1.43; also termed lysine N-methyltransferase 2C, homologous to ALR protein (HALR) myeloid/lymphoid, or mixed-lineage leukemia protein 3 (MLL3)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me) and may be involved in leukemogenesis and developmental disorder. KMT2C is a catalytic subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation. Overexpression of KMT2C is associated with estrogen receptor-positive breast cancer; KMT2C mediates the estrogen dependence of breast cancer through regulation of estrogen receptor alpha (ERalpha) enhancer function. KMT2C is frequently mutated in certain populations with diffuse-type gastric adenocarcinomas (DGA); its loss promotes epithelial-to-mesenchymal transition (EMT) and is associated with worse overall survival.


Pssm-ID: 380985 [Multi-domain]  Cd Length: 154  Bit Score: 92.00  E-value: 1.21e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 603 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYD-KYMSSFLFNLNNDFVVDATRKGNKIRFANH 681
Cdd:cd19208    14 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYEsQNRGVYMFRIDNDHVIDATLTGGPARYINH 93
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 1007352648 682 SVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 724
Cdd:cd19208    94 SCAPNCVAEVVTFEKGHKIIISSSRRIQKGEELCYDYKFDFED 136
SET_KMT2B cd19207
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) ...
611-725 8.90e-21

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) and similar proteins; KMT2B (EC2.1.1.43; also termed lysine N-methyltransferase 2B, myeloid/lymphoid or mixed-lineage leukemia protein 4 (MLL2/MLL4), trithorax homolog 2 (TRX2), or WW domain-binding protein 7 (WBP-7)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is required during the transcriptionally active period of oocyte growth for the establishment and/or maintenance of bulk H3K4 trimethylation (H3K4me3), global transcriptional silencing that precedes resumption of meiosis, oocyte survival and normal zygotic genome activation.


Pssm-ID: 380984 [Multi-domain]  Cd Length: 154  Bit Score: 89.70  E-value: 8.90e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 611 SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYD-KYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYA 689
Cdd:cd19207    21 SAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDsKGIGCYMFRIDDFDVVDATMHGNAARFINHSCEPNCYS 100
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 1007352648 690 KVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 725
Cdd:cd19207   101 RVIHVEGQKHIVIFALRKIYRGEELTYDYKFPIEDA 136
SET_KMT2D cd19209
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) ...
603-724 1.45e-20

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) and similar proteins; KMT2D (EC2.1.1.43; also termed lysine N-methyltransferase 2D, ALL1-related protein (ALR), or myeloid/lymphoid or mixed-lineage leukemia protein 2 (MLL2)), acts as histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is a coactivator for estrogen receptor by being recruited by ESR1, thereby activating transcription. KMT2D is a subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380986 [Multi-domain]  Cd Length: 155  Bit Score: 88.98  E-value: 1.45e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 603 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSS-FLFNLNNDFVVDATRKGNKIRFANH 681
Cdd:cd19209    15 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGiYMFRINNEHVIDATLTGGPARYINH 94
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 1007352648 682 SVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 724
Cdd:cd19209    95 SCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFED 137
SET_EHMT2 cd10533
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
552-718 1.97e-20

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 2 (EHMT2) and similar proteins; EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C (KMT1C), or protein G9a) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380931 [Multi-domain]  Cd Length: 239  Bit Score: 91.23  E-value: 1.97e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 552 CRCKTQCNTKQCPC-YLAVR--------------ECDPDLCLTCG-ASEHWDckvvSCKNCSIQRGLKKHLLLAPSDVAG 615
Cdd:cd10533    27 CTCVDDCSSSNCLCgQLSIRcwydkdgrllqefnKIEPPLIFECNqACSCWR----NCKNRVVQSGIKVRLQLYRTAKMG 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 616 WGTFIKESVQKNEFISEYCGELISQDEADRRGKvydkymSSFLFNLNND----FVVDATRKGNKIRFANHSVNPNCY-AK 690
Cdd:cd10533   103 WGVRALQTIPQGTFICEYVGELISDAEADVRED------DSYLFDLDNKdgevYCIDARYYGNISRFINHLCDPNIIpVR 176
                         170       180       190
                  ....*....|....*....|....*....|.
gi 1007352648 691 VVMVNGD---HRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10533   177 VFMLHQDlrfPRIAFFSSRDIRTGEELGFDY 207
SET_EHMT1 cd10535
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
537-718 5.91e-20

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 1 (EHMT1) and similar proteins; EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, or lysine N-methyltransferase 1D (KMT1D)) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380933 [Multi-domain]  Cd Length: 231  Bit Score: 89.61  E-value: 5.91e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 537 KFCQCNPDCQNRFPGC-----RCKTQCNTKQCPCYLAVrecDPDLCLTCG-ASEHWDckvvSCKNCSIQRGLKKHLLLAP 610
Cdd:cd10535    25 QYCVCIDDCSSSNCMCgqlsmRCWYDKDGRLLPEFNMA---EPPLIFECNhACSCWR----NCRNRVVQNGLRARLQLYR 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 611 SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKvydkymSSFLFNLNND----FVVDATRKGNKIRFANHSVNPN 686
Cdd:cd10535    98 TRDMGWGVRSLQDIPPGTFVCEYVGELISDSEADVREE------DSYLFDLDNKdgevYCIDARFYGNVSRFINHHCEPN 171
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 1007352648 687 CY-AKVVMVNGD---HRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10535   172 LVpVRVFMAHQDlrfPRIAFFSTRLIEAGEQLGFDY 207
SET_SUV39H_Clr4-like cd20073
SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 ...
583-718 1.28e-19

SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 methyltransferase Clr4, and similar proteins; This subfamily contains fission yeast Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (also known as Suv39h), the sole homolog of the mammalian SUV39H1 and SUV39H2 enzymes, that has a critical role in preventing aberrant heterochromatin formation. It is known to di- and tri-methylate Lys-9 of histone H3, a central heterochromatic histone modification, with its specificity profile most similar to that of the human SUV39H2 homolog.


Pssm-ID: 380999 [Multi-domain]  Cd Length: 259  Bit Score: 89.17  E-value: 1.28e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 583 SEHWDCKVvSCKNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN 662
Cdd:cd20073    73 NENCDCGI-NCPNRVVQRGRKLPLEIFKTKHKGWGLRCPRFIKAGTFIGVYLGEVITQSEAEIRGKKYDNVGVTYLFDLD 151
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1007352648 663 -------NDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHR----IGIFAKRAIQAGEELFFDY 718
Cdd:cd20073   152 lfedqvdEYYTVDAQYCGDVTRFINHSCDPNLAIYSVLRDKSDSkiydLAFFAIKDIPALEELTFDY 218
SET_SETDB1 cd10517
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
592-722 4.09e-18

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes.


Pssm-ID: 380915 [Multi-domain]  Cd Length: 288  Bit Score: 85.42  E-value: 4.09e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 592 SCKNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY-DKYmssfLFNLN-------- 662
Cdd:cd10517   117 RCYNRVVQNGLQVRLQVFKTEKKGWGIRCLDDIPKGSFVCIYAGQILTEDEANEEGLQYgDEY----FAELDyievvekl 192
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1007352648 663 ----------NDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNG-DHR---IGIFAKRAIQAGEELFFDYRYSQ 722
Cdd:cd10517   193 kegyesdveeHCYIIDAKSEGNLGRYLNHSCSPNLFVQNVFVDThDLRfpwVAFFASRYIRAGTELTWDYNYEV 266
SET_SETD5-like cd10529
SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine ...
617-720 5.01e-17

SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. KMT2E (also termed inactive lysine N-methyltransferase 2E or myeloid/lymphoid or mixed-lineage leukemia protein 5 (MLL5)) associates with chromatin regions downstream of transcriptional start sites of active genes and thus regulates gene transcription. The family also includes Saccharomyces cerevisiae SET domain-containing proteins, SET3 and SET4, and Schizosaccharomyces pombe SET3. Most of these family members contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380927  Cd Length: 127  Bit Score: 77.70  E-value: 5.01e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 617 GTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKyMSSFLFNL----NNDFVVDATRKGNKIRFANHSVNPNCYAKVV 692
Cdd:cd10529    18 GLVATEDISPGEPILEYKGEVSLRSEFKEDNGFFKR-PSPFVFFYdgfeGLPLCVDARKYGNEARFIRRSCRPNAELRHV 96
                          90       100       110
                  ....*....|....*....|....*....|.
gi 1007352648 693 MV-NGDHRIGIFAKRAIQAGEELF--FDYRY 720
Cdd:cd10529    97 VVsNGELRLFIFALKDIRKGTEITipFDYDY 127
SET_SETDB cd10541
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), ...
544-720 1.03e-16

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), SET domain bifurcated 2 (SETDB2), and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380939 [Multi-domain]  Cd Length: 236  Bit Score: 80.28  E-value: 1.03e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 544 DCQNrfpGCRCKTQC--------NTKQCPC----------YLAVRECDPDLCLTCgaSEHWDCKVVSCKNCSIQRGLKKH 605
Cdd:cd10541    19 DCTD---GCRDKSKCachqltiqATACTPGgqdnptagyqYKRLEECLPTGVYEC--NKLCKCDPNMCQNRLVQHGLQVR 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 606 LLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY-DKYMSSFLFNLNNDFVVDATRKGNKIRFANHSVN 684
Cdd:cd10541    94 LQLFKTQNKGWGIRCLDDIAKGTFVCIYAGKILTDDFADKEGLEMgDEYFANLDHIEESCYIIDAKLEGNLGRYLNHSCS 173
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 1007352648 685 PNCYAKVVMVNGDHR----IGIFAKRAIQAGEELFFDYRY 720
Cdd:cd10541   174 PNLFVQNVFVDTHDLrfpwVAFFASKRIKAGTELTWDYNY 213
SET_AtSUVH-like cd10545
SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar ...
551-720 2.25e-16

SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar proteins; Arabidopsis thaliana SUVH protein (also termed suppressor of variegation 3-9 homolog protein) is a histone-lysine N-methyltransferase that methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. Some family members contain a post-SET domain which binds a Zn2+ ion. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380943 [Multi-domain]  Cd Length: 232  Bit Score: 78.98  E-value: 2.25e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 551 GCRCKTQC--NTKQCPC--------------YLAVR-----ECDPdLClTCGASehwdckvvsCKNCSIQRGLKKHLLLA 609
Cdd:cd10545    23 GCDCKNRCtdGASDCACvkknggeipynfngRLIRAkpaiyECGP-LC-KCPPS---------CYNRVTQKGLRYRLEVF 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 610 PSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKvYDKYmssfLFNLNN-------------------------- 663
Cdd:cd10545    92 KTAERGWGVRSWDSIPAGSFICEYVGELLDTSEADTRSG-NDDY----LFDIDNrqtnrgwdggqrldvgmsdgerssae 166
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1007352648 664 -----DFVVDATRKGNKIRFANHSVNPNCYAKVVMVngDH------RIGIFAKRAIQAGEELFFDYRY 720
Cdd:cd10545   167 deessEFTIDAGSFGNVARFINHSCSPNLFVQCVLY--DHndlrlpRVMLFAADNIPPLQELTYDYGY 232
SET cd08161
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, ...
677-719 1.67e-15

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain superfamily corresponds to SET domain-containing lysine methyltransferases, which catalyze site and state-specific methylation of lysine residues in histones that are fundamental in epigenetic regulation of gene activation and silencing in eukaryotic organisms. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains has been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as N-SET and C-SET. C-SET forms an unusual and conserved knot-like structure of probable functional importance. In addition to N-SET and C-SET, an insert region (I-SET) and flanking regions of high structural variability form part of the overall structure. Some family members contain a pre-SET domain, which is found in a number of histone methyltransferases (HMTase), and a post-SET domain, which harbors a zinc-binding site.


Pssm-ID: 380914 [Multi-domain]  Cd Length: 72  Bit Score: 71.51  E-value: 1.67e-15
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1007352648 677 RFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYR 719
Cdd:cd08161    30 RFINHSCEPNCEFEEVYVGGKPRVFIVALRDIKAGEELTVDYG 72
EZH2_WD-Binding pfam11616
WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, ...
39-68 2.49e-12

WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression. This interaction is required for the HMTase activity of PCR2.


Pssm-ID: 463308  Cd Length: 30  Bit Score: 61.31  E-value: 2.49e-12
                          10        20        30
                  ....*....|....*....|....*....|
gi 1007352648  39 KALYVANFAKVQEKTQILNEEWKKLRVQPV 68
Cdd:pfam11616   1 KSLFVSNRQKIQERTELLNEEWKKLRIQPI 30
SET_SUV39H_DIM5-like cd19473
SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; ...
592-734 1.87e-11

SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; This subfamily contains Neurospora crassa DIM-5 (also termed H3-K9-HMTase dim-5, or HKMT) which functions as histone-lysine N-methyltransferase that specifically trimethylates histone H3 to form H3K9me3.


Pssm-ID: 380996 [Multi-domain]  Cd Length: 274  Bit Score: 65.42  E-value: 1.87e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 592 SCKNCSIQRGLKKHLLLAP-SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGK---------VY----DKY--MS 655
Cdd:cd19473    93 DCPNRVVERGRKVPLQIFRtSDGRGWGVRSTVDIKRGQFVDCYVGEIITPEEAQRRRDaatiaqrkdVYlfalDKFsdPD 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 656 SFLFNLNND-FVVDATRKGNKIRFANHSVNPN--CYAKVvmvnGD------HRIGIFAKRAIQAGEELFFDYRYSQADAL 726
Cdd:cd19473   173 SLDPRLRGDpYEIDGEFMSGPTRFINHSCDPNlrIFARV----GDhadkhiHDLAFFAIKDIPRGTELTFDYVDGVTGLD 248

                  ....*...
gi 1007352648 727 KYVGIERE 734
Cdd:cd19473   249 DDAGDEEK 256
SET_SETD7 cd10530
SET domain found in SET domain-containing protein 7 (SETD7) and similar proteins; SETD7 (EC 2. ...
604-720 5.18e-11

SET domain found in SET domain-containing protein 7 (SETD7) and similar proteins; SETD7 (EC 2.1.1.43; also termed histone H3-K4 methyltransferase SETD7, H3-K4-HMTase SETD7, lysine N-methyltransferase 7 (KMT7) or SET7/9) is a histone-lysine N-methyltransferase that specifically monomethylates 'Lys-4' of histone H3. It plays a central role in the transcriptional activation of genes such as collagenase or insulin. Set7/9 also methylates non-histone proteins, including estrogen receptor alpha (ERa), suggesting it has a role in diverse biological processes. ERa methylation by Set7/9 stabilizes ERa and activates its transcriptional activities, which are involved in the carcinogenesis of breast cancer. In a high-throughput screen, treatment of human breast cancer cells (MCF7 cells) with cyproheptadine, a Set7/9 inhibitor, decreased the expression and transcriptional activity of ERa, thereby inhibiting estrogen-dependent cell growth.


Pssm-ID: 380928  Cd Length: 130  Bit Score: 60.78  E-value: 5.18e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 604 KHLLLAPSDV--AGWGTFIKESVQKNEFISEYCGELISQDEADRRgkvyDKYMSSFLFNLNNDFVVDATRKGNKIRF--- 678
Cdd:cd10530     7 ERVYVAESLIpsAGEGLFAKVAVGPNTVMSFYNGVRITHQEVDSR----DWSLNGNTISLDEETVIDVPEPYNSVSKyca 82
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1007352648 679 -----ANHSVNPNC-YAKVVmvngdH-RIG----IFAKRAIQAGEELFFDYRY 720
Cdd:cd10530    83 slghkANHSFTPNCiYDPFV-----HpRFGpikcIRTLRAVEAGEELTVAYGY 130
SET_SETDB2 cd10523
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) ...
588-720 2.88e-10

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) and similar proteins; SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380921 [Multi-domain]  Cd Length: 266  Bit Score: 61.77  E-value: 2.88e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 588 CKVVSCKNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQ--------------DEADRRGKVYDKY 653
Cdd:cd10523    92 CNRMLCQNRVVQHGLQVRLQVFKTEKKGWGVRCLDDIDKGTFVCIYAGRVLSRarspteplppklelPSENEVEVVTSWL 171
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1007352648 654 MSSFLFNLN-NDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHR----IGIFAKRAIQAGEELFFDYRY 720
Cdd:cd10523   172 ILSKKRKLReNVCFLDASKEGNVGRFLNHSCCPNLFVQNVFVDTHDKnfpwVAFFTNRVVKAGTELTWDYSY 243
preSET_CXC pfam18264
CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It ...
551-582 5.89e-10

CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It is a zinc binding domain.ED L9LD52.1/505-536;


Pssm-ID: 408079  Cd Length: 32  Bit Score: 54.84  E-value: 5.89e-10
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1007352648 551 GCRCKTQCNTKQCPCYLAVRECDPDLCLTCGA 582
Cdd:pfam18264   1 GCSCRATCYTKACLCYRANRECDPDLCNMCGA 32
SET_SpSet7-like cd10540
SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces ...
605-718 1.85e-06

SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces pombe Set7 is a novel histone-lysine N-methyltransferase. The family also includes a viral histone H3 lysine 27 methyltransferase from Paramecium bursaria Chlorella virus 1 (PBCV-1).


Pssm-ID: 380938  Cd Length: 112  Bit Score: 47.25  E-value: 1.85e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 605 HLLLAPSDVAGWGTFIKESVQKNEFIsEYCGELISQDEAdrrgkvYDKYMSSFLFNLNndFVVDATRKGNKIRF---ANH 681
Cdd:cd10540     1 RLEVKPSTLKGRGVFATRPIKKGEVI-EEAPVIVLPKEE------YQHLCKTVLDHYV--FSWGDGCLALALGYgsmFNH 71
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1007352648 682 SVNPNCYakVVMVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10540    72 SYTPNAE--YEIDFENQTIVFYALRDIEAGEELTINY 106
SET_SpSET3-like cd19183
SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET ...
617-718 2.28e-06

SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET domain-containing protein 3 (SETD3) and similar proteins; Schizosaccharomyces pombe SETD3 functions as a transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. It is required for both, gene activation and repression.


Pssm-ID: 380960  Cd Length: 173  Bit Score: 48.55  E-value: 2.28e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 617 GTFIKESVQKNEFISEYCGELISQDE--ADRRgkvyDKYMSSF------LFNLNNDFVVDATRKGNKIRFANHSVNPNCy 688
Cdd:cd19183    15 GLFADRPIPAGDPIQELLGEIGLQSEyiADPE----NQYQILGapkphvFFHPQSPLYIDTRRSGSVARFIRRSCRPNA- 89
                          90       100       110
                  ....*....|....*....|....*....|....
gi 1007352648 689 aKVVMV----NGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd19183    90 -ELVTVasdsGSVLKFVLYASRDISPGEEITIGW 122
SET_SMYD cd20071
SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing ...
678-718 8.69e-06

SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing protein, and similar proteins; The family includes SET and MYND domain-containing proteins, SMYD1-SYMD5. SMYD1 (EC 2.1.1.43; also termed BOP) is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD2 (also termed HSKM-B, or lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex. SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions.


Pssm-ID: 380997 [Multi-domain]  Cd Length: 122  Bit Score: 45.45  E-value: 8.69e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1007352648 678 FANHSVNPNCyakVVMVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd20071    58 LLNHSCDPNA---VVVFDGNGTLRVRALRDIKAGEELTISY 95
SET_Suv4-20-like cd10524
SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of ...
621-718 1.12e-05

SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of variegation 4-20 (Suv4-20) and similar proteins; Suv4-20 (also termed Su(var)4-20) is a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-20' of histone H4. It acts as a dominant suppressor of position-effect variegation. The family also includes Suv4-20 homologs, lysine N-methyltransferase 5B (KMT5B) and lysine N-methyltransferase 5C (KMT5C). Both KMT5B (also termed lysine-specific methyltransferase 5B, or suppressor of variegation 4-20 homolog 1, or Su(var)4-20 homolog 1, or Suv4-20h1) and KMT5C (also termed lysine-specific methyltransferase 5C, or suppressor of variegation 4-20 homolog 2, or Su(var)4-20 homolog 2, or Suv4-20h2) are histone methyltransferases that specifically trimethylate 'Lys-20' of histone H4 (H4K20me3). They play central roles in the establishment of constitutive heterochromatin in pericentric heterochromatin regions.


Pssm-ID: 380922 [Multi-domain]  Cd Length: 141  Bit Score: 45.73  E-value: 1.12e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 621 KESVQKNEFISEYCGELISQDEADRRgkvydkymssFLFNLNNDF-VVDATRKGNK------IRFANHSVNPNCyakVVM 693
Cdd:cd10524    25 TKPIKKGEKIHELCGCIAELSEEEEA----------LLRPGGNDFsVMYSSRKKCSqlwlgpAAFINHDCRPNC---KFV 91
                          90       100
                  ....*....|....*....|....*
gi 1007352648 694 VNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10524    92 PTGKSTACVKVLRDIEPGEEITVYY 116
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
424-465 9.68e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 40.25  E-value: 9.68e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1007352648 424 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 465
Cdd:cd00167     1 PWTEEEDELLLEAVKKYgKNNWEKIAKELPGRTPKQCRERWRN 43
SET_ATXR5_6-like cd10539
SET domain found in fungal protein lysine methyltransferase SET5 and similar protein; The ...
629-718 3.09e-04

SET domain found in fungal protein lysine methyltransferase SET5 and similar protein; The family includes Arabidopsis thaliana ATXR5 and ATXR6. Both ATXR5 (also termed protein SET DOMAIN GROUP 15, or TRX-related protein 5) and ATXR6 (also termed protein SET DOMAIN GROUP 34, or TRX-related protein 6) function as histone methyltransferase that specifically monomethylates 'Lys-37' of histone H3 (H3K27me1). They are required for chromatin structure and gene silencing.


Pssm-ID: 380937  Cd Length: 138  Bit Score: 41.63  E-value: 3.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 629 FISEYCGEL-----ISQDEADRrgkvydkyMSSFLFNLNND--FVVDATRKGNKIRFA----NHSVN----PNCYAKVVM 693
Cdd:cd10539    29 IIAEYTGDVdyirnREFDDNDS--------IMTLLLAGDPSksLVICPDKRGNIARFIsginNHTKDgkkkQNCKCVRYS 100
                          90       100
                  ....*....|....*....|....*
gi 1007352648 694 VNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10539   101 INGEARVLLVATRDIAKGERLYYDY 125
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
424-465 6.62e-04

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 37.97  E-value: 6.62e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1007352648  424 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 465
Cdd:smart00717   3 EWTEEEDELLIELVKKYgKNNWEKIAKELPGRTAEQCRERWRN 45
SET_SETD5 cd19181
SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and ...
627-721 1.26e-03

SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. SETD5 loss-of-function mutations are a likely cause of a familial syndromic intellectual disability with variable phenotypic expression.


Pssm-ID: 380958  Cd Length: 150  Bit Score: 39.99  E-value: 1.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1007352648 627 NEFISEYCGELISQDEADRRGKVYDKYMSSFLF--NLNN-DFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIF 703
Cdd:cd19181    30 DTLIIEYRGKVMLRQQFEVNGHFFKRPYPFVLFysKFNGvEMCVDARTFGNDARFIRRSCTPNAEVRHMIADGMIHLCIY 109
                          90       100
                  ....*....|....*....|
gi 1007352648 704 AKRAIQAGEE--LFFDYRYS 721
Cdd:cd19181   110 AVAAIAKDAEvtIAFDYEYS 129
SET_LSMT cd10527
SET domain found in Rubisco large subunit methyltransferase (LSMT) and similar proteins; ...
678-718 4.71e-03

SET domain found in Rubisco large subunit methyltransferase (LSMT) and similar proteins; Rubisco LSMT is a non-histone protein methyl transferase responsible for the trimethylation of lysine14 in the large subunit of Rubisco (ribulose-1,5-bisphosphate carboxylase/oxygenase). The family also includes SET domain-containing proteins, SETD3, SETD4 and SETD6, which belong to methyltransferase class VII that represents classical non-histone SET domain methyltransferases. Members in this family contain a SET domain and a C-terminal RubisCO LSMT substrate-binding (Rubis-subs-bind) domain.


Pssm-ID: 380925 [Multi-domain]  Cd Length: 236  Bit Score: 39.35  E-value: 4.71e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 1007352648 678 FANHSVN-PNCyaKVVMVNGDHRIGIFAKRAIQAGEELFFDY 718
Cdd:cd10527   182 MLNHSPDaPNV--RYEYDEDEGSFVLVATRDIAAGEEVFISY 221
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH