NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|281366755|ref|NP_001163851|]
View 

SET domain containing 1, isoform I [Drosophila melanogaster]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SET_SETD1 cd19169
SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and ...
1490-1637 5.39e-110

SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A) and SET domain-containing protein 1B (SETD1B). These proteins are histone-lysine N-methyltransferases that specifically methylate 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated.


:

Pssm-ID: 380946  Cd Length: 148  Bit Score: 345.09  E-value: 5.39e-110
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1490 ELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDA 1569
Cdd:cd19169     1 DLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQVIRQSVADEREKRYEAIGIGSSYLFRVDDDTIIDA 80
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755 1570 TKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCR 1637
Cdd:cd19169    81 TKCGNLARFINHSCNPNCYAKIITVESQKKIVIYSKRPIAVNEEITYDYKFPIEDEKIPCLCGAPQCR 148
RRM_Set1 cd12304
RNA recognition motif in the Set1-like family of histone-lysine N-methyltransferases; This ...
99-191 7.50e-46

RNA recognition motif in the Set1-like family of histone-lysine N-methyltransferases; This subfamily corresponds to the RRM of the Set1-like family of histone-lysine N-methyltransferases which includes Set1A and Set1B that are ubiquitously expressed vertebrates histone methyltransferases exhibiting high homology to yeast Set1. Set1A and Set1B proteins exhibit a largely non-overlapping subnuclear distribution in euchromatic nuclear speckles, strongly suggesting that they bind to a unique set of target genes and thus make non-redundant contributions to the epigenetic control of chromatin structure and gene expression. With the exception of the catalytic component, the subunit composition of the Set1A and Set1B histone methyltransferase complexes are identical. Each complex contains six human homologs of the yeast Set1/COMPASS complex, including Set1A or Set1B, Ash2 (homologous to yeast Bre2), CXXC finger protein 1 (CFP1; homologous to yeast Spp1), Rbbp5 (homologous to yeast Swd1), Wdr5 (homologous to yeast Swd3), and Wdr82 (homologous to yeast Swd2). The genomic targeting of these complexes is determined by the identity of the catalytic subunit present in each histone methyltransferase complex. Thus, the Set1A and Set1B complexes may exhibit both overlapping and non-redundant properties. Both Set1A and Set1B contain an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), an N- SET domain, and a C-terminal catalytic SET domain followed by a post-SET domain. In contrast to Set1B, Set1A additionally contains an HCF-1 binding motif that interacts with HCF-1 in vivo.


:

Pssm-ID: 409745 [Multi-domain]  Cd Length: 93  Bit Score: 159.82  E-value: 7.50e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   99 PAVEVTIVNLNDNIDKQFLASMLDKCGTSDEINIYHHPITNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDVFCDP 178
Cdd:cd12304     1 PPREVTFANLNDNINEGFLKDMCKKYGEVEEVKIYFHPKTGKHLGLARVVFDTTKGAKDCVEKLNQTSVMGNIIHVFLDP 80
                          90
                  ....*....|...
gi 281366755  179 FGATLKKSLESLT 191
Cdd:cd12304    81 KGRIIARLYEDIV 93
N-SET super family cl13245
COMPASS (Complex proteins associated with Set1p) component N; The n-SET or N-SET domain is a ...
1352-1496 5.27e-26

COMPASS (Complex proteins associated with Set1p) component N; The n-SET or N-SET domain is a component of the COMPASS complex, associated with SET1, conserved in yeasts and in other eukaryotes up to humans. The COMPASS complex functions to methylate the fourth lysine of Histone 3 and for the silencing of genes close to the telomeres of chromosomes. This domain promotes trimethylation in conjunction with an RRM domain and is necessary for binding of the Spp1 component of COMPASS into the complex.


The actual alignment was detected with superfamily member pfam11764:

Pssm-ID: 463344  Cd Length: 172  Bit Score: 106.22  E-value: 5.27e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755  1352 DAEDINFIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAF--WPPPSKKRRK-DDELIRHKTGCARTEGFYKLDVREKAK 1428
Cdd:pfam11764   15 DDEDLKFLKKVLADLLQDDNSSDIGNLEYWAWKQKEIKALnrLRGPTEKEKKiEGYYVPNPTGCARTEGYKKIPDSEKSK 94
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755  1429 HKYH---YAKA-NTEDSFNEDRSDEPTALTNHHHNKLISK-MQGISREARSNQRRLLTAFGSM-----GESELLKFNQ 1496
Cdd:pfam11764   95 YLPHrrkVQKPrETRQAHAKEDPAAAAAAAALAAEKSVSKgEISSSRVNRANNRRFVADINAQkaalgTESDLLSLNQ 172
U2AF_lg super family cl36941
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
255-386 6.81e-08

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


The actual alignment was detected with superfamily member TIGR01642:

Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 57.21  E-value: 6.81e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   255 RDKERERDRhfkERSR-HSSERSYDRDRGMREnvgtsirrrrtfyrrrssdiSPEDSRDILIMTRERSRDSDSRPRDYCR 333
Cdd:TIGR01642    1 RDEEPDRER---EKSRgRDRDRSSERPRRRSR--------------------DRSRFRDRHRRSRERSYREDSRPRDRRR 57
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 281366755   334 SRERESFRDRKRSHEKGRDQPReKREHYYNSSKDREYRGRDRDRSAEIDQRDR 386
Cdd:TIGR01642   58 YDSRSPRSLRYSSVRRSRDRPR-RRSRSVRSIEQHRRRLRDRSPSNQWRKDDK 109
 
Name Accession Description Interval E-value
SET_SETD1 cd19169
SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and ...
1490-1637 5.39e-110

SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A) and SET domain-containing protein 1B (SETD1B). These proteins are histone-lysine N-methyltransferases that specifically methylate 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated.


Pssm-ID: 380946  Cd Length: 148  Bit Score: 345.09  E-value: 5.39e-110
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1490 ELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDA 1569
Cdd:cd19169     1 DLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQVIRQSVADEREKRYEAIGIGSSYLFRVDDDTIIDA 80
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755 1570 TKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCR 1637
Cdd:cd19169    81 TKCGNLARFINHSCNPNCYAKIITVESQKKIVIYSKRPIAVNEEITYDYKFPIEDEKIPCLCGAPQCR 148
RRM_Set1 cd12304
RNA recognition motif in the Set1-like family of histone-lysine N-methyltransferases; This ...
99-191 7.50e-46

RNA recognition motif in the Set1-like family of histone-lysine N-methyltransferases; This subfamily corresponds to the RRM of the Set1-like family of histone-lysine N-methyltransferases which includes Set1A and Set1B that are ubiquitously expressed vertebrates histone methyltransferases exhibiting high homology to yeast Set1. Set1A and Set1B proteins exhibit a largely non-overlapping subnuclear distribution in euchromatic nuclear speckles, strongly suggesting that they bind to a unique set of target genes and thus make non-redundant contributions to the epigenetic control of chromatin structure and gene expression. With the exception of the catalytic component, the subunit composition of the Set1A and Set1B histone methyltransferase complexes are identical. Each complex contains six human homologs of the yeast Set1/COMPASS complex, including Set1A or Set1B, Ash2 (homologous to yeast Bre2), CXXC finger protein 1 (CFP1; homologous to yeast Spp1), Rbbp5 (homologous to yeast Swd1), Wdr5 (homologous to yeast Swd3), and Wdr82 (homologous to yeast Swd2). The genomic targeting of these complexes is determined by the identity of the catalytic subunit present in each histone methyltransferase complex. Thus, the Set1A and Set1B complexes may exhibit both overlapping and non-redundant properties. Both Set1A and Set1B contain an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), an N- SET domain, and a C-terminal catalytic SET domain followed by a post-SET domain. In contrast to Set1B, Set1A additionally contains an HCF-1 binding motif that interacts with HCF-1 in vivo.


Pssm-ID: 409745 [Multi-domain]  Cd Length: 93  Bit Score: 159.82  E-value: 7.50e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   99 PAVEVTIVNLNDNIDKQFLASMLDKCGTSDEINIYHHPITNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDVFCDP 178
Cdd:cd12304     1 PPREVTFANLNDNINEGFLKDMCKKYGEVEEVKIYFHPKTGKHLGLARVVFDTTKGAKDCVEKLNQTSVMGNIIHVFLDP 80
                          90
                  ....*....|...
gi 281366755  179 FGATLKKSLESLT 191
Cdd:cd12304    81 KGRIIARLYEDIV 93
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
1504-1625 9.52e-37

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 135.15  E-value: 9.52e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   1504 LKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSC 1583
Cdd:smart00317    3 LEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPKAYDTDGAKAFYLFDIDSDLCIDARRKGNLARFINHSC 82
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|..
gi 281366755   1584 NPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDE 1625
Cdd:smart00317   83 EPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
1501-1639 8.68e-34

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 127.00  E-value: 8.68e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1501 KKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIgssYLFRIDMETIIDATKCGNLARFIN 1580
Cdd:COG2940     5 HPRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERREPHKEPLHT---YLFELDDDGVIDGALGGNPARFIN 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 281366755 1581 HSCNPNCYAkvitIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGaqGCRGT 1639
Cdd:COG2940    82 HSCDPNCEA----DEEDGRIFIVALRDIAAGEELTYDYGLDYDEEEYPCRCP--NCRGT 134
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
1514-1619 1.76e-28

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 111.08  E-value: 1.76e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755  1514 WGLFAMEPIAADEMVIEYVG-QMIRPVVADLRETKYEAIGIG---SSYLFRID--METIIDAT--KCGNLARFINHSCNP 1585
Cdd:pfam00856    2 RGLFATEDIPKGEFIGEYVEvLLITKEEADKRELLYYDKLELrlwGPYLFTLDedSEYCIDARalYYGNWARFINHSCDP 81
                           90       100       110
                   ....*....|....*....|....*....|....
gi 281366755  1586 NCYAKVITIESEKKIVIYSKQPIGINEEITYDYK 1619
Cdd:pfam00856   82 NCEVRVVYVNGGPRIVIFALRDIKPGEELTIDYG 115
N-SET pfam11764
COMPASS (Complex proteins associated with Set1p) component N; The n-SET or N-SET domain is a ...
1352-1496 5.27e-26

COMPASS (Complex proteins associated with Set1p) component N; The n-SET or N-SET domain is a component of the COMPASS complex, associated with SET1, conserved in yeasts and in other eukaryotes up to humans. The COMPASS complex functions to methylate the fourth lysine of Histone 3 and for the silencing of genes close to the telomeres of chromosomes. This domain promotes trimethylation in conjunction with an RRM domain and is necessary for binding of the Spp1 component of COMPASS into the complex.


Pssm-ID: 463344  Cd Length: 172  Bit Score: 106.22  E-value: 5.27e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755  1352 DAEDINFIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAF--WPPPSKKRRK-DDELIRHKTGCARTEGFYKLDVREKAK 1428
Cdd:pfam11764   15 DDEDLKFLKKVLADLLQDDNSSDIGNLEYWAWKQKEIKALnrLRGPTEKEKKiEGYYVPNPTGCARTEGYKKIPDSEKSK 94
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755  1429 HKYH---YAKA-NTEDSFNEDRSDEPTALTNHHHNKLISK-MQGISREARSNQRRLLTAFGSM-----GESELLKFNQ 1496
Cdd:pfam11764   95 YLPHrrkVQKPrETRQAHAKEDPAAAAAAAALAAEKSVSKgEISSSRVNRANNRRFVADINAQkaalgTESDLLSLNQ 172
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
255-386 6.81e-08

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 57.21  E-value: 6.81e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   255 RDKERERDRhfkERSR-HSSERSYDRDRGMREnvgtsirrrrtfyrrrssdiSPEDSRDILIMTRERSRDSDSRPRDYCR 333
Cdd:TIGR01642    1 RDEEPDRER---EKSRgRDRDRSSERPRRRSR--------------------DRSRFRDRHRRSRERSYREDSRPRDRRR 57
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 281366755   334 SRERESFRDRKRSHEKGRDQPReKREHYYNSSKDREYRGRDRDRSAEIDQRDR 386
Cdd:TIGR01642   58 YDSRSPRSLRYSSVRRSRDRPR-RRSRSVRSIEQHRRRLRDRSPSNQWRKDDK 109
RRM smart00360
RNA recognition motif;
107-174 1.77e-07

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 49.90  E-value: 1.77e-07
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755    107 NLNDNIDKQFLASMLDKCGTSDEINIYHHPITNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDV 174
Cdd:smart00360    6 NLPPDTTEEELRELFSKFGKVESVRLVRDKETGKSKGFAFVEFESEEDAEKALEALNGKELDGRPLKV 73
PRP38_assoc pfam12871
Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region ...
304-386 2.08e-05

Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region found at the C-terminus of plant and metazoan pre-mRNA-splicing factor 38 proteins. The function is not known.


Pssm-ID: 463734 [Multi-domain]  Cd Length: 98  Bit Score: 44.77  E-value: 2.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   304 DISPEDSRDILIMTRERSRDSDSRPRDYCRSRERESFRDRKRSHEKGRDQPREKrehyynsSKDREYRGRDRDRSAEIDQ 383
Cdd:pfam12871   23 EASDESERASLSRKRRSRSRRRSSTRDRSRSRSRSRSRDRRSRGTRDRRRDRDR-------DRYRSLRSRSRDRSRDRDR 95

                   ...
gi 281366755   384 RDR 386
Cdd:pfam12871   96 DRR 98
PRK12678 PRK12678
transcription termination factor Rho; Provisional
245-387 6.19e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 44.51  E-value: 6.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755  245 PNRSRDRNWNRDKERERDRHFKERSRHSSERSYDRDRGMRENVGTSirRRRTFYRRRSSDISPEDSRDilimTRERSRDS 324
Cdd:PRK12678  145 AGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGER--GRREERGRDGDDRDRRDRRE----QGDRREER 218
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281366755  325 DSRPRDYCRSRERESFRDRKRSHEKGRDQPREKREHYYNSSKDREYRGRDRDRsaeidQRDRG 387
Cdd:PRK12678  219 GRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDR-----RGRRG 276
 
Name Accession Description Interval E-value
SET_SETD1 cd19169
SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and ...
1490-1637 5.39e-110

SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A) and SET domain-containing protein 1B (SETD1B). These proteins are histone-lysine N-methyltransferases that specifically methylate 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated.


Pssm-ID: 380946  Cd Length: 148  Bit Score: 345.09  E-value: 5.39e-110
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1490 ELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDA 1569
Cdd:cd19169     1 DLLKFNQLKFRKKQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQVIRQSVADEREKRYEAIGIGSSYLFRVDDDTIIDA 80
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755 1570 TKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCR 1637
Cdd:cd19169    81 TKCGNLARFINHSCNPNCYAKIITVESQKKIVIYSKRPIAVNEEITYDYKFPIEDEKIPCLCGAPQCR 148
SET_SETD1A cd19204
SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and ...
1489-1641 4.37e-90

SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and similar proteins; SETD1A (EC2.1.1.43), also termed lysine N-methyltransferase 2F, or Set1/Ash2 histone methyltransferase complex subunit SET1, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Human SET domain containing protein 1A (hSETD1A) expression occurs at a high rate in hepatocellular carcinoma patients and controls tumor metastasis in breast cancer by activating MMP expression.


Pssm-ID: 380981 [Multi-domain]  Cd Length: 153  Bit Score: 288.85  E-value: 4.37e-90
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1489 SELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIID 1568
Cdd:cd19204     1 SDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQVVADMREKRYVQEGIGSSYLFRVDHDTIID 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281366755 1569 ATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1641
Cdd:cd19204    81 ATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPIEDNKIPCLCGTENCRGTLN 153
SET_SETD1B cd19205
SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and ...
1489-1641 3.83e-83

SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and similar proteins; SETD1B (EC2.1.1.43), also termed lysine N-methyltransferase 2G, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Loss of SETD1B occurs in up to half the gastric and colorectal cancers, most commonly via SETD1B mutations, while de novo variants in SETD1B are associated with intellectual disability, epilepsy and autism.


Pssm-ID: 380982 [Multi-domain]  Cd Length: 153  Bit Score: 269.23  E-value: 3.83e-83
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1489 SELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIID 1568
Cdd:cd19205     1 SDLLKFNQLKFRKKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDEGIGSSYMFRVDHDTIID 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281366755 1569 ATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1641
Cdd:cd19205    81 ATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIEDVKIPCLCGSENCRGTLN 153
SET_SETD1-like cd10518
SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), ...
1490-1637 4.95e-79

SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), histone-lysine N-methyltransferases (KMT2A/KMT2B/KMT2C/KMT2D) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A), 1B (SETD1B), as well as histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B), 2C (KMT2C), 2D (KMT2D). These proteins are histone-lysine N-methyltransferases (EC 2.1.1.43) that specifically methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380916  Cd Length: 150  Bit Score: 257.14  E-value: 4.95e-79
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1490 ELLKFNQLKFRKKQ-LKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIID 1568
Cdd:cd10518     1 MSKRFRQLRSRLKErLRVGKSGIHGWGLFAKRPIAAGEMVIEYVGEVIRPIVADKREKRYDEEGGGGTYMFRIDEDLVID 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1569 ATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLED-EKIPCLCGAQGCR 1637
Cdd:cd10518    81 ATKKGNIARFINHSCDPNCYAKIITVDGEKHIVIFAKRDIAPGEELTYDYKFPIEDeEKIPCLCGAPNCR 150
SET_SET1 cd20072
SET domain (including post-SET domain) found in catalytic component of the Saccharomyces ...
1490-1637 1.35e-76

SET domain (including post-SET domain) found in catalytic component of the Saccharomyces cerevisiae COMPASS complex and similar proteins; The family contains mostly fungal SET domains, including SET1 found in the catalytic component of the Saccharomyces cerevisiae COMPASS (complex of proteins associated with Set1). SET1 is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex. The activity of this catalytic domain is established through forming a complex with a set of core proteins; it is extensively contacted by Cps60 (Bre2), Cps50 (Swd1), and Cps30 (Swd3).


Pssm-ID: 380998  Cd Length: 148  Bit Score: 250.03  E-value: 1.35e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1490 ELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDA 1569
Cdd:cd20072     1 DLLRFNQLKKRKKQLKFARSAIHNWGLYAMENISAKDMVIEYVGEVIRQQVADEREKRYLRQGIGSSYLFRIDDDTVVDA 80
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755 1570 TKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCR 1637
Cdd:cd20072    81 TKKGNIARFINHCCDPNCTAKIIKVEGEKRIVIYAKRDIAAGEELTYDYKFPREEDKIPCLCGAPNCR 148
SET_KMT2A_2B cd19170
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), ...
1508-1641 5.67e-59

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B) and similar proteins; This family includes KMT2A and KMT2B. Both KMT2A (also termed ALL-1 or CXXC7 or MLL or MLL1 or TRX1 or HRX) and KMT2B (also termed MLL4 or TRX2) act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380947 [Multi-domain]  Cd Length: 152  Bit Score: 199.93  E-value: 5.67e-59
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1508 KSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSsYLFRIDMETIIDATKCGNLARFINHSCNPNC 1587
Cdd:cd19170    20 RSPIHGRGLFCKRNIDAGEMVIEYAGEVIRSVLTDKREKYYESKGIGC-YMFRIDDDEVVDATMHGNAARFINHSCEPNC 98
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 281366755 1588 YAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1641
Cdd:cd19170    99 YSRVVNIDGKKHIVIFALRRILRGEELTYDYKFPIEDVKIPCTCGSKKCRKYLN 152
SET_KMT2C_2D cd19171
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), ...
1493-1638 5.81e-52

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), 2D (KMT2D) and similar proteins; This family includes KMT2C and KMT2D. Both, KMT2C (also termed HALR or MLL3) and KMT2D (also termed ALR or MLL2), act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me). They are subunits of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380948 [Multi-domain]  Cd Length: 153  Bit Score: 179.93  E-value: 5.81e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1493 KFNQLKFRKKQLK----FAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSsYLFRIDMETIID 1568
Cdd:cd19171     1 KSSQYRKLKTEWRsnvyLARSRIQGLGLYAARDIEKHTMVIEYIGEIIRNEVANRREKIYESQNRGI-YMFRIDNDWVID 79
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 281366755 1569 ATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDE--KIPCLCGAQGCRG 1638
Cdd:cd19171    80 ATMTGGPARYINHSCNPNCVAEVVTFDKEKKIIIISNRRIAKGEELTYDYKFDFEDDqhKIPCLCGAPNCRK 151
RRM_Set1 cd12304
RNA recognition motif in the Set1-like family of histone-lysine N-methyltransferases; This ...
99-191 7.50e-46

RNA recognition motif in the Set1-like family of histone-lysine N-methyltransferases; This subfamily corresponds to the RRM of the Set1-like family of histone-lysine N-methyltransferases which includes Set1A and Set1B that are ubiquitously expressed vertebrates histone methyltransferases exhibiting high homology to yeast Set1. Set1A and Set1B proteins exhibit a largely non-overlapping subnuclear distribution in euchromatic nuclear speckles, strongly suggesting that they bind to a unique set of target genes and thus make non-redundant contributions to the epigenetic control of chromatin structure and gene expression. With the exception of the catalytic component, the subunit composition of the Set1A and Set1B histone methyltransferase complexes are identical. Each complex contains six human homologs of the yeast Set1/COMPASS complex, including Set1A or Set1B, Ash2 (homologous to yeast Bre2), CXXC finger protein 1 (CFP1; homologous to yeast Spp1), Rbbp5 (homologous to yeast Swd1), Wdr5 (homologous to yeast Swd3), and Wdr82 (homologous to yeast Swd2). The genomic targeting of these complexes is determined by the identity of the catalytic subunit present in each histone methyltransferase complex. Thus, the Set1A and Set1B complexes may exhibit both overlapping and non-redundant properties. Both Set1A and Set1B contain an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), an N- SET domain, and a C-terminal catalytic SET domain followed by a post-SET domain. In contrast to Set1B, Set1A additionally contains an HCF-1 binding motif that interacts with HCF-1 in vivo.


Pssm-ID: 409745 [Multi-domain]  Cd Length: 93  Bit Score: 159.82  E-value: 7.50e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   99 PAVEVTIVNLNDNIDKQFLASMLDKCGTSDEINIYHHPITNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDVFCDP 178
Cdd:cd12304     1 PPREVTFANLNDNINEGFLKDMCKKYGEVEEVKIYFHPKTGKHLGLARVVFDTTKGAKDCVEKLNQTSVMGNIIHVFLDP 80
                          90
                  ....*....|...
gi 281366755  179 FGATLKKSLESLT 191
Cdd:cd12304    81 KGRIIARLYEDIV 93
SET_KMT2A cd19206
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) ...
1492-1641 2.25e-44

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) and similar proteins; KMT2A (EC2.1.1.43; also termed lysine N-methyltransferase 2A, ALL-1, CXXC-type zinc finger protein 7 (CXXC7), myeloid/lymphoid or mixed-lineage leukemia (MLL), myeloid/lymphoid or mixed-lineage leukemia protein 1 (MLL1), trithorax-like protein (TRX1), or zinc finger protein HRX) acts as a histone methyltransferase that plays an essential role in early development and hematopoiesis. It is a catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac).


Pssm-ID: 380983 [Multi-domain]  Cd Length: 154  Bit Score: 158.26  E-value: 2.25e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1492 LKFNQLKFRKKQ-LKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSsYLFRIDMETIIDAT 1570
Cdd:cd19206     3 MRFRHLKKTSKEaVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDSKGIGC-YMFRIDDSEVVDAT 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281366755 1571 KCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLED--EKIPCLCGAQGCRGTLN 1641
Cdd:cd19206    82 MHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDasNKLPCNCGAKKCRKFLN 154
SET_KMT2B cd19207
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) ...
1492-1641 1.92e-39

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) and similar proteins; KMT2B (EC2.1.1.43; also termed lysine N-methyltransferase 2B, myeloid/lymphoid or mixed-lineage leukemia protein 4 (MLL2/MLL4), trithorax homolog 2 (TRX2), or WW domain-binding protein 7 (WBP-7)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is required during the transcriptionally active period of oocyte growth for the establishment and/or maintenance of bulk H3K4 trimethylation (H3K4me3), global transcriptional silencing that precedes resumption of meiosis, oocyte survival and normal zygotic genome activation.


Pssm-ID: 380984 [Multi-domain]  Cd Length: 154  Bit Score: 144.01  E-value: 1.92e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1492 LKFNQLKFRKKQ-LKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSsYLFRIDMETIIDAT 1570
Cdd:cd19207     3 MRFRHLKKTSKEaVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDSKGIGC-YMFRIDDFDVVDAT 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281366755 1571 KCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLED--EKIPCLCGAQGCRGTLN 1641
Cdd:cd19207    82 MHGNAARFINHSCEPNCYSRVIHVEGQKHIVIFALRKIYRGEELTYDYKFPIEDasNKLPCNCGAKRCRRFLN 154
SET_EZH cd10519
SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar ...
1502-1618 3.50e-37

SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both, EZH1 and EZH2, can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380917  Cd Length: 117  Bit Score: 136.22  E-value: 3.50e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1502 KQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGigSSYLFRIDMETIIDATKCGNLARFINH 1581
Cdd:cd10519     1 KRLLLGKSDVAGWGLFLKEPIKKDEFIGEYTGELISQDEADRRGKIYDKYN--SSYLFNLNDQFVVDATRKGNKIRFANH 78
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 281366755 1582 SCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDY 1618
Cdd:cd10519    79 SSNPNCYAKVMMVNGDHRIGIFAKRDIEAGEELFFDY 115
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
1504-1625 9.52e-37

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 135.15  E-value: 9.52e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   1504 LKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSC 1583
Cdd:smart00317    3 LEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPKAYDTDGAKAFYLFDIDSDLCIDARRKGNLARFINHSC 82
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|..
gi 281366755   1584 NPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDE 1625
Cdd:smart00317   83 EPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
SET_KMT2C cd19208
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) ...
1501-1637 7.66e-35

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) and similar proteins; KMT2C (EC2.1.1.43; also termed lysine N-methyltransferase 2C, homologous to ALR protein (HALR) myeloid/lymphoid, or mixed-lineage leukemia protein 3 (MLL3)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me) and may be involved in leukemogenesis and developmental disorder. KMT2C is a catalytic subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation. Overexpression of KMT2C is associated with estrogen receptor-positive breast cancer; KMT2C mediates the estrogen dependence of breast cancer through regulation of estrogen receptor alpha (ERalpha) enhancer function. KMT2C is frequently mutated in certain populations with diffuse-type gastric adenocarcinomas (DGA); its loss promotes epithelial-to-mesenchymal transition (EMT) and is associated with worse overall survival.


Pssm-ID: 380985 [Multi-domain]  Cd Length: 154  Bit Score: 130.90  E-value: 7.66e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1501 KKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSsYLFRIDMETIIDATKCGNLARFIN 1580
Cdd:cd19208    14 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGV-YMFRIDNDHVIDATLTGGPARYIN 92
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 281366755 1581 HSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDE--KIPCLCGAQGCR 1637
Cdd:cd19208    93 HSCAPNCVAEVVTFEKGHKIIISSSRRIQKGEELCYDYKFDFEDDqhKIPCHCGAVNCR 151
SET_SETD2-like cd10531
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), ...
1514-1637 9.56e-35

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2), ASH1-like protein (ASH1L) and similar proteins; This family includes SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2) and ASH1-like protein (ASH1L), which function as histone-lysine N-methyltransferases. SETD2 specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. NSD2 shows histone H3 'Lys-27' (H3K27me) methyltransferase activity. ASH1L specifically methylates 'Lys-36' of histone H3 (H3K36me). The family also includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins.


Pssm-ID: 380929  Cd Length: 136  Bit Score: 130.07  E-value: 9.56e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVIT 1593
Cdd:cd10531    12 WGVKAKEDIQKGEFIIEYVGEVIDKKEFKERLDEYEELGKSNFYILSLSDDVVIDATRKGNLSRFINHSCEPNCETQKWI 91
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 281366755 1594 IESEKKIVIYSKQPIGINEEITYDYKFPLE-DEKIPCLCGAQGCR 1637
Cdd:cd10531    92 VNGEYRIGIFALRDIPAGEELTFDYNFVNYnEAKQVCLCGAQNCR 136
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
1501-1639 8.68e-34

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 127.00  E-value: 8.68e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1501 KKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIgssYLFRIDMETIIDATKCGNLARFIN 1580
Cdd:COG2940     5 HPRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERREPHKEPLHT---YLFELDDDGVIDGALGGNPARFIN 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 281366755 1581 HSCNPNCYAkvitIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGaqGCRGT 1639
Cdd:COG2940    82 HSCDPNCEA----DEEDGRIFIVALRDIAAGEELTYDYGLDYDEEEYPCRCP--NCRGT 134
SET_KMT2D cd19209
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) ...
1501-1637 1.75e-32

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) and similar proteins; KMT2D (EC2.1.1.43; also termed lysine N-methyltransferase 2D, ALL1-related protein (ALR), or myeloid/lymphoid or mixed-lineage leukemia protein 2 (MLL2)), acts as histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is a coactivator for estrogen receptor by being recruited by ESR1, thereby activating transcription. KMT2D is a subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380986 [Multi-domain]  Cd Length: 155  Bit Score: 124.04  E-value: 1.75e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1501 KKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSsYLFRIDMETIIDATKCGNLARFIN 1580
Cdd:cd19209    15 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGI-YMFRINNEHVIDATLTGGPARYIN 93
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 281366755 1581 HSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEK--IPCLCGAQGCR 1637
Cdd:cd19209    94 HSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQhkIPCHCGAWNCR 152
SET_ASHR3-like cd19175
SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 ...
1503-1640 4.34e-31

SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins; This family includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3, also termed protein SET DOMAIN GROUP 4 or protein stamen loss), ASH1 homolog 3 (ASHH3, also termed protein SET DOMAIN GROUP 7) and homolog 4 (ASHH4, also termed protein SET DOMAIN GROUP 24). They all function as histone-lysine N-methyltransferases (EC 2.1.1.43).


Pssm-ID: 380952 [Multi-domain]  Cd Length: 139  Bit Score: 119.44  E-value: 4.34e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1503 QLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1582
Cdd:cd19175     1 KMKLVKTEKCGWGLVADEDINAGEFIIEYVGEVIDDKTCEERLWDMKHKGEKNFYMCEIDKDMVIDATFKGNLSRFINHS 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755 1583 CNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTL 1640
Cdd:cd19175    81 CDPNCELQKWQVDGETRIGVFAIRDIKKGEELTYDYQFVQFGADQDCHCGSKNCRGKL 138
SET_SETD2 cd19172
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and ...
1514-1640 4.89e-31

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and similar proteins; SETD2 (also termed HIF-1, huntingtin yeast partner B, huntingtin-interacting protein 1 (HIP-1), huntingtin-interacting protein B, lysine N-methyltransferase 3A or protein-lysine N-methyltransferase SETD2) acts as histone-lysine N-methyltransferase that specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. It has been shown that methylation is a posttranslational modification of dynamic microtubules and that SETD2 methylates alpha-tubulin at lysine 40, the same lysine that is marked by acetylation on microtubules. Methylation of microtubules occurs during mitosis and cytokinesis and can be ablated by SETD2 deletion, which causes mitotic spindle and cytokinesis defects, micronuclei, and polyploidy.


Pssm-ID: 380949 [Multi-domain]  Cd Length: 142  Bit Score: 119.61  E-value: 4.89e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVIT 1593
Cdd:cd19172    14 WGLRAAEDLPKGTFVIEYVGEVLDEKEFKRRMKEYAREGNRHYYFMALKSDEIIDATKKGNLSRFINHSCEPNCETQKWT 93
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 281366755 1594 IESEKKIVIYSKQPIGINEEITYDYKFPL-EDEKIPCLCGAQGCRGTL 1640
Cdd:cd19172    94 VNGELRVGFFAKRDIPAGEELTFDYQFERyGKEAQKCYCGSPNCRGYI 141
SET_NSD cd19173
SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, ...
1514-1640 9.56e-29

SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, NSD2, NSD3 and similar proteins; The nuclear receptor-binding SET Domain (NSD) family of histone H3 lysine 36 methyltransferases is comprised of NSD1, NSD2, and NSD3, which are primarily known to be involved in chromatin integrity and gene expression through mono-, di-, or tri-methylating lysine 36 of histone H3 (H3K36), respectively. NSD1 (EC 2.1.1.43; also termed histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B) or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3.


Pssm-ID: 380950 [Multi-domain]  Cd Length: 142  Bit Score: 112.79  E-value: 9.56e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVIT 1593
Cdd:cd19173    14 WGLRTKRDIKKGDFVIEYVGELIDEEECRRRLKKAHENNITNFYMLTLDKDRIIDAGPKGNLSRFMNHSCQPNCETQKWT 93
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 281366755 1594 IESEKKIVIYSKQPIGINEEITYDYKFP-LEDEKIPCLCGAQGCRGTL 1640
Cdd:cd19173    94 VNGDTRVGLFAVRDIPAGEELTFNYNLDcLGNEKKVCRCGAPNCSGFL 141
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
1514-1619 1.76e-28

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 111.08  E-value: 1.76e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755  1514 WGLFAMEPIAADEMVIEYVG-QMIRPVVADLRETKYEAIGIG---SSYLFRID--METIIDAT--KCGNLARFINHSCNP 1585
Cdd:pfam00856    2 RGLFATEDIPKGEFIGEYVEvLLITKEEADKRELLYYDKLELrlwGPYLFTLDedSEYCIDARalYYGNWARFINHSCDP 81
                           90       100       110
                   ....*....|....*....|....*....|....
gi 281366755  1586 NCYAKVITIESEKKIVIYSKQPIGINEEITYDYK 1619
Cdd:pfam00856   82 NCEVRVVYVNGGPRIVIFALRDIKPGEELTIDYG 115
SET_ASH1L cd19174
SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ...
1514-1641 1.40e-26

SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ASH1L (EC 2.1.1.43; also termed absent small and homeotic disks protein 1 homolog, KMT2H, or lysine N-methyltransferase 2H) acts as histone-lysine N-methyltransferase that specifically methylates 'Lys-36' of histone H3 (H3K36me). It plays important roles in development; heterozygous mutation of ASH1L is associated with severe intellectual disability (ID) and multiple congenital anomaly (MCA).


Pssm-ID: 380951 [Multi-domain]  Cd Length: 141  Bit Score: 106.61  E-value: 1.40e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQmirpVVADlRETKYEAIGIGSS----YLFRIDMETIIDATKCGNLARFINHSCNPNCYA 1589
Cdd:cd19174    12 WGVRTKEPIKAGQFIIEYVGE----VVSE-QEFRRRMIEQYHNhshhYCLNLDSGMVIDGYRMGNEARFVNHSCDPNCEM 86
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 281366755 1590 KVITIESEKKIVIYSKQPIGINEEITYDYKFPL--EDEKIPCLCGAQGCRGTLN 1641
Cdd:cd19174    87 QKWSVNGVYRIGLFALKDIPAGEELTYDYNFHSfnVEKQQPCKCGSPNCRGVIG 140
N-SET pfam11764
COMPASS (Complex proteins associated with Set1p) component N; The n-SET or N-SET domain is a ...
1352-1496 5.27e-26

COMPASS (Complex proteins associated with Set1p) component N; The n-SET or N-SET domain is a component of the COMPASS complex, associated with SET1, conserved in yeasts and in other eukaryotes up to humans. The COMPASS complex functions to methylate the fourth lysine of Histone 3 and for the silencing of genes close to the telomeres of chromosomes. This domain promotes trimethylation in conjunction with an RRM domain and is necessary for binding of the Spp1 component of COMPASS into the complex.


Pssm-ID: 463344  Cd Length: 172  Bit Score: 106.22  E-value: 5.27e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755  1352 DAEDINFIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAF--WPPPSKKRRK-DDELIRHKTGCARTEGFYKLDVREKAK 1428
Cdd:pfam11764   15 DDEDLKFLKKVLADLLQDDNSSDIGNLEYWAWKQKEIKALnrLRGPTEKEKKiEGYYVPNPTGCARTEGYKKIPDSEKSK 94
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755  1429 HKYH---YAKA-NTEDSFNEDRSDEPTALTNHHHNKLISK-MQGISREARSNQRRLLTAFGSM-----GESELLKFNQ 1496
Cdd:pfam11764   95 YLPHrrkVQKPrETRQAHAKEDPAAAAAAAALAAEKSVSKgEISSSRVNRANNRRFVADINAQkaalgTESDLLSLNQ 172
SET_EZH2 cd19218
SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43) ...
1501-1620 5.11e-25

SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43), also termed lysine N-methyltransferase 6, or ENX-1, or histone-lysine N-methyltransferase EZH2, is a catalytic subunit of the polycomb repressive complex 2 (PRC2)/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380995  Cd Length: 120  Bit Score: 101.53  E-value: 5.11e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1501 KKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGigSSYLFRIDMETIIDATKCGNLARFIN 1580
Cdd:cd19218     3 KKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDKYM--CSFLFNLNNDFVVDATRKGNKIRFAN 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 281366755 1581 HSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKF 1620
Cdd:cd19218    81 HSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFDYRY 120
SET_SUV39H cd10542
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
1514-1640 8.36e-25

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homologs, SUV39H1, SUV39H2 and similar proteins; This family includes SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. Also included are Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (SUV39H homolog) and Neurospora crassa DIM-5, both of which also methylate 'Lys-9' of histone H3.


Pssm-ID: 380940 [Multi-domain]  Cd Length: 245  Bit Score: 105.07  E-value: 8.36e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIgsSYLFRIDMETI-----IDATKCGNLARFINHSCNPN-- 1586
Cdd:cd10542   100 WGVKTLEDIKKGTFVMEYVGEIITSEEAERRGKIYDANGR--TYLFDLDYNDDdceytVDAAYYGNISHFINHSCDPNla 177
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755 1587 CYAKVIT--IESEKKIVIYSKQPIGINEEITYDYKFPL------------EDEKIPCLCGAQGCRGTL 1640
Cdd:cd10542   178 VYAVWINhlDPRLPRIAFFAKRDIKAGEELTFDYLMTGtggssestipkpKDVRVPCLCGSKNCRKYL 245
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
1501-1624 3.30e-23

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380994  Cd Length: 136  Bit Score: 97.06  E-value: 3.30e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1501 KKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGigSSYLFRIDMETIIDATKCGNLARFIN 1580
Cdd:cd19217     5 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFNLNNDFVVDATRKGNKIRFAN 82
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 281366755 1581 HSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLED 1624
Cdd:cd19217    83 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRYSQAD 126
SET_NSD2 cd19211
SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) ...
1503-1640 7.56e-22

SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) and similar proteins; NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-36' (H3K36me) methyltransferase activity. NSD2 has been shown to mediate di- and trimethylation of H3K36 and dimethylation of H4K20 in different systems, and has been characterized as a transcriptional repressor interacting with histone deacetylase HDAC1 and histone demethylase LSD1. NSD2 mediates constitutive NF-kappaB signaling for cancer cell proliferation, survival and tumor growth. It is highly overexpressed in several types of human cancers, including small-cell lung cancers, neuroblastoma, carcinomas of stomach and colon, and bladder cancers, and its overexpression tends to be associated with tumor aggressiveness. WHSC1 is frequently deleted in Wolf-Hirschhorn syndrome (WHS).


Pssm-ID: 380988 [Multi-domain]  Cd Length: 142  Bit Score: 93.13  E-value: 7.56e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1503 QLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHS 1582
Cdd:cd19211     3 ETKIIKTEGKGWGLIAKRDIKKGEFVNEYVGELIDEEECMARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHS 82
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 281366755 1583 CNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFP-LEDEKIPCLCGAQGCRGTL 1640
Cdd:cd19211    83 CQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDcLGNEKTVCRCGAPNCSGFL 141
SET_LegAS4-like cd10522
SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and ...
1515-1618 2.19e-21

SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and similar proteins; LegAS4 is a type IV secretion system effector of Legionella pneumophila. It contains a SET domain that is involved in the modification of Lys4 of histone H3 (H3K4) in the nucleolus of the host cell, thereby enhancing heterochromatic rDNA transcription. It also contains an ankyrin repeat domain of unknown function at its C-terminal region.


Pssm-ID: 380920 [Multi-domain]  Cd Length: 122  Bit Score: 91.25  E-value: 2.19e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1515 GLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEaigiGSSYLFRIDMETI-IDATKCGNLARFINHSCNPNCYAKVIT 1593
Cdd:cd10522    16 GLFAAETIAKGEFVGEYTGEVLDRWEEDRDSVYHY----DPLYPFDLNGDILvIDAGKKGNLTRFINHSDQPNLELIVRT 91
                          90       100
                  ....*....|....*....|....*
gi 281366755 1594 IESEKKIVIYSKQPIGINEEITYDY 1618
Cdd:cd10522    92 LKGEQHIGFVAIRDIKPGEELFISY 116
SET_SETDB-like cd10538
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
1500-1618 3.75e-21

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2, and similar proteins; The family includes SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2. SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis. SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin. This family also includes the pre-SET domain, which is found in a number of histone methyltransferases (HMTase), N-terminal to the SET domain. Pre-SET domain is a zinc binding motif which contains 9 conserved cysteines that coordinate three zinc ions. It is thought that this region plays a structural role in stabilizing SET domains. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380936 [Multi-domain]  Cd Length: 217  Bit Score: 93.59  E-value: 3.75e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1500 RKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGigSSYLFRID---------METIIDAT 1570
Cdd:cd10538    87 LQARLQVFRTSKKGWGVRSLEFIPKGSFVCEYVGEVITTSEADRRGKIYDKSG--GSYLFDLDefsdsdgdgEELCVDAT 164
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 281366755 1571 KCGNLARFINHSCNPNCyaKVITIESEK------KIVIYSKQPIGINEEITYDY 1618
Cdd:cd10538   165 FCGNVSRFINHSCDPNL--FPFNVVIDHddlrypRIALFATRDILPGEELTFDY 216
RRM_Set1A cd12548
RNA recognition motif in vertebrate histone-lysine N-methyltransferase Setd1A (Set1A); This ...
99-192 2.36e-19

RNA recognition motif in vertebrate histone-lysine N-methyltransferase Setd1A (Set1A); This subgroup corresponds to the RRM of Setd1A, also termed SET domain-containing protein 1A (Set1A), or lysine N-methyltransferase 2F, or Set1/Ash2 histone methyltransferase complex subunit Set1, a ubiquitously expressed vertebrates histone methyltransferase that exhibits high homology to yeast Set1. Set1A is localized to euchromatic nuclear speckles and associates with a complex containing six human homologs of the yeast Set1/COMPASS complex, including CXXC finger protein 1 (CFP1; homologous to yeast Spp1), Rbbp5 (homologous to yeast Swd1), Ash2 (homologous to yeast Bre2), Wdr5 (homologous to yeast Swd3), and Wdr82 (homologous to yeast Swd2). Set1A contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), an N- SET domain, and a C-terminal catalytic SET domain followed by a post-SET domain. In contrast to Set1B, Set1A additionally contains an HCF-1 binding motif that interacts with HCF-1 in vivo.


Pssm-ID: 409964 [Multi-domain]  Cd Length: 95  Bit Score: 84.63  E-value: 2.36e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   99 PAVEVTIVNLNDNIDKQFLASMLDKCGTSDEINIYHHPITNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDVFCDP 178
Cdd:cd12548     1 PLKEVTFARLNDNVREPFLADMCRKFGEVEEVEILLHPKTRKHLGLAKVLFTSTRGAKDTVKHLHNTSVMGNIIHAQLDI 80
                          90
                  ....*....|....
gi 281366755  179 FGATLKKSLESLTN 192
Cdd:cd12548    81 KGQQRMKYYDLIVS 94
RRM_Set1B cd12549
RNA recognition motif in vertebrate histone-lysine N-methyltransferase Setd1B (Set1B); This ...
99-190 3.58e-19

RNA recognition motif in vertebrate histone-lysine N-methyltransferase Setd1B (Set1B); This subgroup corresponds to the RRM of Setd1B, also termed SET domain-containing protein 1B (Set1B), or lysine N-methyltransferase 2G, a ubiquitously expressed vertebrates histone methyltransferase that exhibits high homology to yeast Set1. Set1B is localized to euchromatic nuclear speckles and associates with a complex containing six human homologs of the yeast Set1/COMPASS complex, including CXXC finger protein 1 (CFP1; homologous to yeast Spp1), Rbbp5 (homologous to yeast Swd1), Ash2 (homologous to yeast Bre2), Wdr5 (homologous to yeast Swd3), and Wdr82 (homologous to yeast Swd2). Set1B complex is a histone methyltransferase that produces trimethylated histone H3 at Lys4. Set1B contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), an N- SET domain, and a C-terminal catalytic SET domain followed by a post-SET domain.


Pssm-ID: 409965 [Multi-domain]  Cd Length: 93  Bit Score: 83.85  E-value: 3.58e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   99 PAVEVTIVNLNDNIDKQFLASMLDKCGTSDEINIYHHPITNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDVFCDP 178
Cdd:cd12549     1 PPKQVTFAKLNDNIRENFLRDMCKKYGEVEEVEILYNPKNKKHLGIAKVVFATVKGAKDAVQHLHNTSVMGNIIHVELDT 80
                          90
                  ....*....|..
gi 281366755  179 FGATLKKSLESL 190
Cdd:cd12549    81 KGETRMRFYELL 92
SET_EZH-like cd19168
SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb ...
1502-1618 3.51e-18

SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb repressive complex 2 (PRC2), and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both EZH1 and EZH2 can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380945  Cd Length: 124  Bit Score: 82.24  E-value: 3.51e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1502 KQLKFAKSAIHD-WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAigIGSSYLFRIDMETIIDATKCGNLARFIN 1580
Cdd:cd19168     1 KAVVLGKSQLECgLGLFAAEDIKEGEFVIEYTGELISHDEGVRREHRRGD--VSYLYLFEEQEGIWVDAAIYGNLSRYIN 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 281366755 1581 HSCNP----NCYAKVITIESEKKIVIYSKQPIGINEEITYDY 1618
Cdd:cd19168    79 HATDKvktgNCMPKIMYVNHEWRIKFTAIKDIKIGEELFFNY 120
SET_NSD1 cd19210
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
1514-1640 4.67e-18

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 1 (NSD1) and similar proteins; NSD1 (EC 2.1.1.43; also termed Histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B), or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD1 is altered in approximately 10% of head and neck cancer patients with 55% decrease in risk of death in NSD1-mutated versus non-mutated patients; its disruption promotes favorable chemotherapeutic responses linked to hypomethylation.


Pssm-ID: 380987 [Multi-domain]  Cd Length: 142  Bit Score: 82.28  E-value: 4.67e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVIT 1593
Cdd:cd19210    14 WGLRCKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWT 93
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 281366755 1594 IESEKKIVIYSKQPIGINEEITYDYKFP-LEDEKIPCLCGAQGCRGTL 1640
Cdd:cd19210    94 VNGDTRVGLFALCDIKAGTELTFNYNLEcLGNGKTVCKCGAPNCSGFL 141
SET_NSD3 cd19212
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
1514-1640 4.81e-18

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 3 (NSD3) and similar proteins; NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3. NSD3 is amplified and overexpressed in multiple cancer types, including acute myeloid leukemia (AML), breast, lung, pancreatic and bladder cancers, as well as squamous cell carcinoma of the head and neck (SCCHN). NSD3 contributes to tumorigenesis by interacting with bromodomain-containing protein 4 (BRD4), the bromodomain and extraterminal (BET) protein, which is a potential therapeutic target in acute myeloid leukemia (AML). NSD3 is amplified in primary tumors and cell lines from breast carcinoma, and can promote the cell viability of small-cell lung cancer and pancreatic ductal adenocarcinoma. High NSD3 expression is implicated in poor grade and heavy smoking history in SCCHN. Thus, NSD3 may serve as a potential druggable target for selective cancer therapy.


Pssm-ID: 380989 [Multi-domain]  Cd Length: 142  Bit Score: 82.28  E-value: 4.81e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVIT 1593
Cdd:cd19212    14 WGLRTKRSIKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWT 93
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 281366755 1594 IESEKKIVIYSKQPIGINEEITYDYKFP-LEDEKIPCLCGAQGCRGTL 1640
Cdd:cd19212    94 VNGDVRVGLFALCDIPAGMELTFNYNLDcLGNGRTECHCGADNCSGFL 141
SET_SETMAR cd10544
SET domain (including pre-SET and post-SET domains) found in SET domain and mariner ...
1502-1640 5.80e-18

SET domain (including pre-SET and post-SET domains) found in SET domain and mariner transposase fusion protein (SETMAR) and similar proteins; SETMAR (also termed metnase) is a DNA-binding protein that is indirectly recruited to sites of DNA damage through protein-protein interactions. It has a sequence-specific DNA-binding activity recognizing the 19-mer core of the 5'-terminal inverted repeats (TIRs) of the Hsmar1 element and displays a DNA nicking and end joining activity. SETMAR also acts as a histone-lysine N-methyltransferase that methylates 'Lys-4' and 'Lys-36' of histone H3. It specifically mediates dimethylation of H3 'Lys-36' at sites of DNA double-strand break and may recruit proteins required for efficient DSB repair through non-homologous end-joining.


Pssm-ID: 380942 [Multi-domain]  Cd Length: 254  Bit Score: 85.43  E-value: 5.80e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1502 KQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMI-----RPVVADLRETKyeaigigSSYLFRID--------METIID 1568
Cdd:cd10544    90 FKLQVFKTPKKGWGLRTLEFIPKGRFVCEYAGEVIgfeeaRRRTKSQTKGD-------MNYIIVLRehlssgkvLETFVD 162
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1569 ATKCGNLARFINHSCNPNCYAKVITIESE-KKIVIYSKQPIGINEEITYDY-----------KFPLEDE---KIPCLCGA 1633
Cdd:cd10544   163 PTYIGNIGRFLNHSCEPNLFMVPVRVDSMvPKLALFAARDIVAGEELSFDYsgefsnsvesvTLARQDEsksRKPCLCGA 242

                  ....*..
gi 281366755 1634 QGCRGTL 1640
Cdd:cd10544   243 ENCRGFL 249
SET_SETD5-like cd10529
SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine ...
1515-1618 5.21e-17

SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. KMT2E (also termed inactive lysine N-methyltransferase 2E or myeloid/lymphoid or mixed-lineage leukemia protein 5 (MLL5)) associates with chromatin regions downstream of transcriptional start sites of active genes and thus regulates gene transcription. The family also includes Saccharomyces cerevisiae SET domain-containing proteins, SET3 and SET4, and Schizosaccharomyces pombe SET3. Most of these family members contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380927  Cd Length: 127  Bit Score: 78.86  E-value: 5.21e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1515 GLFAMEPIAADEMVIEYVGQMIrpvvadLRE---TKYEAIGIGSSYLFRI----DMETIIDATKCGNLARFINHSCNPNC 1587
Cdd:cd10529    18 GLVATEDISPGEPILEYKGEVS------LRSefkEDNGFFKRPSPFVFFYdgfeGLPLCVDARKYGNEARFIRRSCRPNA 91
                          90       100       110
                  ....*....|....*....|....*....|....
gi 281366755 1588 YAKVITIE-SEKKIVIYSKQPIGINEEIT--YDY 1618
Cdd:cd10529    92 ELRHVVVSnGELRLFIFALKDIRKGTEITipFDY 125
SET_SETD8 cd10528
SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2. ...
1515-1618 3.52e-16

SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2.1.1.43; also termed N-lysine methyltransferase KMT5A, H4-K20-HMTase KMT5A, lysine N-methyltransferase 5A, lysine-specific methylase 5A, PR/SET domain-containing protein 07, PR-Set7 or PR/SET07) is a nucleosomal histone-lysine N-methyltransferase that specifically monomethylates 'Lys-20' of histone H4 (H4K20me1). It plays a central role in the silencing of euchromatic genes.


Pssm-ID: 380926 [Multi-domain]  Cd Length: 141  Bit Score: 76.85  E-value: 3.52e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1515 GLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYL--FRIDMETI-IDATK-CGNLARFINHSC-NPNCYA 1589
Cdd:cd10528    30 GVIATRPFEKGDFVVEYHGDLITITEAKKREALYAKDPSTGCYMyyFQYKGKTYcVDATKeSGRLGRLINHSKkKPNLKT 109
                          90       100
                  ....*....|....*....|....*....
gi 281366755 1590 KVITIESEKKIVIYSKQPIGINEEITYDY 1618
Cdd:cd10528   110 KLLVIDGVPHLILVAKRDIKPGEELLYDY 138
SET_SUV39H2 cd10532
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
1514-1640 5.82e-16

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 2 (SUV39H2) and similar proteins; SUV39H2 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B (KMT1B), or Su(var)3-9 homolog 2) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380930 [Multi-domain]  Cd Length: 243  Bit Score: 79.16  E-value: 5.82e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIgsSYLFRIDMET---IIDATKCGNLARFINHSCNPNCYAK 1590
Cdd:cd10532    97 WGVKTLQKIKKNSFVMEYVGEVITSEEAERRGQFYDSKGI--TYLFDLDYESdefTVDAARYGNVSHFVNHSCDPNLQVF 174
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 281366755 1591 VITIESEK----KIVIYSKQPIGINEEITYDYKF---------------PLEDEKIPCLCGAQGCRGTL 1640
Cdd:cd10532   175 NVFIDNLDtrlpRIALFSTRTIKAGEELTFDYQMkgsgdlssdsidnspAKKRVRTVCKCGAVTCRGYL 243
SET_EHMT cd10543
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
1514-1637 8.11e-15

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase EHMT1, EHMT2 and similar proteins; This family includes EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380941 [Multi-domain]  Cd Length: 231  Bit Score: 75.45  E-value: 8.11e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETkyeaigigSSYLFRID---MET-IIDATKCGNLARFINHSCNPNCYA 1589
Cdd:cd10543   103 WGVRALQDIPKGTFVCEYIGELISDSEADSRED--------DSYLFDLDnkdGETyCIDARRYGNISRFINHLCEPNLIP 174
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 281366755 1590 KVITIESEK----KIVIYSKQPIGINEEITYDY--KF-PLEDEKIPCLCGAQGCR 1637
Cdd:cd10543   175 VRVFVEHQDlrfpRIAFFASRDIKAGEELGFDYgeKFwRIKGKYFTCRCGSPKCK 229
SET_SUV39H_Clr4-like cd20073
SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 ...
1500-1640 8.22e-15

SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 methyltransferase Clr4, and similar proteins; This subfamily contains fission yeast Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (also known as Suv39h), the sole homolog of the mammalian SUV39H1 and SUV39H2 enzymes, that has a critical role in preventing aberrant heterochromatin formation. It is known to di- and tri-methylate Lys-9 of histone H3, a central heterochromatic histone modification, with its specificity profile most similar to that of the human SUV39H2 homolog.


Pssm-ID: 380999 [Multi-domain]  Cd Length: 259  Bit Score: 76.46  E-value: 8.22e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1500 RKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIgsSYLFRIDM-------ETIIDATKC 1572
Cdd:cd20073    91 RKLPLEIFKTKHKGWGLRCPRFIKAGTFIGVYLGEVITQSEAEIRGKKYDNVGV--TYLFDLDLfedqvdeYYTVDAQYC 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1573 GNLARFINHSCNPNCYAKVITIESEKK----IVIYSKQPIGINEEITYDY------------------KFPLEDEKIPCL 1630
Cdd:cd20073   169 GDVTRFINHSCDPNLAIYSVLRDKSDSkiydLAFFAIKDIPALEELTFDYsgrnnfdqlgfignrsnsKYINLKNKRPCY 248
                         170
                  ....*....|
gi 281366755 1631 CGAQGCRGTL 1640
Cdd:cd20073   249 CGSANCRGWL 258
SET_EHMT1 cd10535
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
1495-1637 2.25e-14

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 1 (EHMT1) and similar proteins; EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, or lysine N-methyltransferase 1D (KMT1D)) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380933 [Multi-domain]  Cd Length: 231  Bit Score: 74.20  E-value: 2.25e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1495 NQLKFRkkqLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETkyeaigigSSYLFRIDMET----IIDAT 1570
Cdd:cd10535    87 NGLRAR---LQLYRTRDMGWGVRSLQDIPPGTFVCEYVGELISDSEADVREE--------DSYLFDLDNKDgevyCIDAR 155
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 281366755 1571 KCGNLARFINHSCNPNCYAKVITIESEK----KIVIYSKQPIGINEEITYDYKFPLEDEK---IPCLCGAQGCR 1637
Cdd:cd10535   156 FYGNVSRFINHHCEPNLVPVRVFMAHQDlrfpRIAFFSTRLIEAGEQLGFDYGERFWDIKgklFSCRCGSPKCR 229
SET_SUV39H1 cd10525
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
1514-1640 2.44e-14

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 1 (SUV39H1) and similar proteins; SUV39H1 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A (KMT1A), position-effect variegation 3-9 homolog (SUV39H), or Su(var)3-9 homolog 1) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380923 [Multi-domain]  Cd Length: 255  Bit Score: 74.93  E-value: 2.44e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGigSSYLFRID-METI--IDATKCGNLARFINHSCNPNCYAK 1590
Cdd:cd10525    99 WGVRTLEKIRKNSFVMEYVGEIITSEEAERRGQIYDRQG--ATYLFDLDyVEDVytVDAAYYGNISHFVNHSCDPNLQVY 176
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755 1591 VITI----ESEKKIVIYSKQPIGINEEITYDYKF------------------------PLEDEKIPCLCGAQGCRGTL 1640
Cdd:cd10525   177 NVFIdnldERLPRIALFATRTIRAGEELTFDYNMqvdpvdaestkmdsnfglaglpgsPKKRVRIECKCGVRSCRKYL 254
SET_EHMT2 cd10533
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
1501-1637 2.68e-14

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 2 (EHMT2) and similar proteins; EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C (KMT1C), or protein G9a) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380931 [Multi-domain]  Cd Length: 239  Bit Score: 74.28  E-value: 2.68e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1501 KKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETkyeaigigSSYLFRIDMET----IIDATKCGNLA 1576
Cdd:cd10533    90 KVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVRED--------DSYLFDLDNKDgevyCIDARYYGNIS 161
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755 1577 RFINHSCNPNCY-AKVITIESE---KKIVIYSKQPIGINEEITYDYK---FPLEDEKIPCLCGAQGCR 1637
Cdd:cd10533   162 RFINHLCDPNIIpVRVFMLHQDlrfPRIAFFSSRDIRTGEELGFDYGdrfWDIKSKYFTCQCGSEKCK 229
SET cd08161
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, ...
1503-1618 3.17e-14

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain superfamily corresponds to SET domain-containing lysine methyltransferases, which catalyze site and state-specific methylation of lysine residues in histones that are fundamental in epigenetic regulation of gene activation and silencing in eukaryotic organisms. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains has been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as N-SET and C-SET. C-SET forms an unusual and conserved knot-like structure of probable functional importance. In addition to N-SET and C-SET, an insert region (I-SET) and flanking regions of high structural variability form part of the overall structure. Some family members contain a pre-SET domain, which is found in a number of histone methyltransferases (HMTase), and a post-SET domain, which harbors a zinc-binding site.


Pssm-ID: 380914 [Multi-domain]  Cd Length: 72  Bit Score: 68.82  E-value: 3.17e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1503 QLKFAKSAIHDWGLFAMEPIAADEMVIeyvgqmirpvvadlretkyeaigigssylfridmetiidatkcgnLARFINHS 1582
Cdd:cd08161     1 EIRPSTIPGAGFGLFATRDIPKGEVIG---------------------------------------------LARFINHS 35
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 281366755 1583 CNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDY 1618
Cdd:cd08161    36 CEPNCEFEEVYVGGKPRVFIVALRDIKAGEELTVDY 71
SET_SETDB1 cd10517
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
1514-1640 3.88e-14

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes.


Pssm-ID: 380915 [Multi-domain]  Cd Length: 288  Bit Score: 75.02  E-value: 3.88e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYeaigiGSSYLFRID-MET-----------------IIDATKCGNL 1575
Cdd:cd10517   141 WGIRCLDDIPKGSFVCIYAGQILTEDEANEEGLQY-----GDEYFAELDyIEVveklkegyesdveehcyIIDAKSEGNL 215
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 281366755 1576 ARFINHSCNPNCYAKVITIESEK----KIVIYSKQPIGINEEITYDYKFPL---EDEKIPCLCGAQGCRGTL 1640
Cdd:cd10517   216 GRYLNHSCSPNLFVQNVFVDTHDlrfpWVAFFASRYIRAGTELTWDYNYEVgsvPGKVLYCYCGSSNCRGRL 287
SET_SpSET3-like cd19183
SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET ...
1513-1622 4.14e-13

SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET domain-containing protein 3 (SETD3) and similar proteins; Schizosaccharomyces pombe SETD3 functions as a transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. It is required for both, gene activation and repression.


Pssm-ID: 380960  Cd Length: 173  Bit Score: 68.97  E-value: 4.14e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1513 DWGLFAMEPIAADEMVIEYVGQMIR--PVVADlRETKYEAIGIGSSYLFRIDMETI-IDATKCGNLARFINHSCNPNCya 1589
Cdd:cd19183    13 RFGLFADRPIPAGDPIQELLGEIGLqsEYIAD-PENQYQILGAPKPHVFFHPQSPLyIDTRRSGSVARFIRRSCRPNA-- 89
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 281366755 1590 KVITIESEK----KIVIYSKQPIGINEEITYDYKFPL 1622
Cdd:cd19183    90 ELVTVASDSgsvlKFVLYASRDISPGEEITIGWDWDN 126
SET_SETDB2 cd10523
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) ...
1567-1640 4.35e-12

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) and similar proteins; SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380921 [Multi-domain]  Cd Length: 266  Bit Score: 68.32  E-value: 4.35e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1567 IDATKCGNLARFINHSCNPNCYAKVITIESEKK----IVIYSKQPIGINEEITYDYKF---PLEDEKIPCLCGAQGCRGT 1639
Cdd:cd10523   186 LDASKEGNVGRFLNHSCCPNLFVQNVFVDTHDKnfpwVAFFTNRVVKAGTELTWDYSYdagTSPEQEIPCLCGVNKCQKK 265

                  .
gi 281366755 1640 L 1640
Cdd:cd10523   266 I 266
SET_AtSUVH-like cd10545
SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar ...
1501-1618 1.68e-11

SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar proteins; Arabidopsis thaliana SUVH protein (also termed suppressor of variegation 3-9 homolog protein) is a histone-lysine N-methyltransferase that methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. Some family members contain a post-SET domain which binds a Zn2+ ion. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380943 [Multi-domain]  Cd Length: 232  Bit Score: 65.89  E-value: 1.68e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1501 KKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRetkyeaiGIGSSYLFRID------------------ 1562
Cdd:cd10545    85 RYRLEVFKTAERGWGVRSWDSIPAGSFICEYVGELLDTSEADTR-------SGNDDYLFDIDnrqtnrgwdggqrldvgm 157
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281366755 1563 -------------METIIDATKCGNLARFINHSCNPNCYAKVITIESEK----KIVIYSKQPIGINEEITYDY 1618
Cdd:cd10545   158 sdgerssaedeesSEFTIDAGSFGNVARFINHSCSPNLFVQCVLYDHNDlrlpRVMLFAADNIPPLQELTYDY 230
SET_SETDB cd10541
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), ...
1503-1640 3.60e-11

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), SET domain bifurcated 2 (SETDB2), and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380939 [Multi-domain]  Cd Length: 236  Bit Score: 64.87  E-value: 3.60e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1503 QLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADlretkYEAIGIGSSYLFRIDMET----IIDATKCGNLARF 1578
Cdd:cd10541    93 RLQLFKTQNKGWGIRCLDDIAKGTFVCIYAGKILTDDFAD-----KEGLEMGDEYFANLDHIEescyIIDAKLEGNLGRY 167
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 281366755 1579 INHSCNPNCYAKVITIESEKK----IVIYSKQPIGINEEITYDYKF---PLEDEKIPCLCGAQGCRGTL 1640
Cdd:cd10541   168 LNHSCSPNLFVQNVFVDTHDLrfpwVAFFASKRIKAGTELTWDYNYevgSVEGKELLCCCGSNECRGRL 236
SET_SUV39H_DIM5-like cd19473
SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; ...
1513-1640 5.75e-11

SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; This subfamily contains Neurospora crassa DIM-5 (also termed H3-K9-HMTase dim-5, or HKMT) which functions as histone-lysine N-methyltransferase that specifically trimethylates histone H3 to form H3K9me3.


Pssm-ID: 380996 [Multi-domain]  Cd Length: 274  Bit Score: 65.03  E-value: 5.75e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1513 DWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMETIIDATKcGNLA---------------R 1577
Cdd:cd19473   117 GWGVRSTVDIKRGQFVDCYVGEIITPEEAQRRRDAATIAQRKDVYLFALDKFSDPDSLD-PRLRgdpyeidgefmsgptR 195
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1578 FINHSCNPN--CYAKViTIESEKKI---VIYSKQPIGINEEITYDY--------KFPLEDEKI----PCLCGAQGCRGTL 1640
Cdd:cd19473   196 FINHSCDPNlrIFARV-GDHADKHIhdlAFFAIKDIPRGTELTFDYvdgvtgldDDAGDEEKEkemtKCLCGSPKCRGYL 274
SET_SMYD cd20071
SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing ...
1514-1618 2.30e-08

SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing protein, and similar proteins; The family includes SET and MYND domain-containing proteins, SMYD1-SYMD5. SMYD1 (EC 2.1.1.43; also termed BOP) is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD2 (also termed HSKM-B, or lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex. SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions.


Pssm-ID: 380997 [Multi-domain]  Cd Length: 122  Bit Score: 53.92  E-value: 2.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1514 WGLFAMEPIAADEMVIEyvgqmIRPVVADL-----RETKYEAIGIGssyLFRidmetiidatkcgnLARFINHSCNPNCy 1588
Cdd:cd20071    11 RGLVATRDIEPGELILV-----EKPLVSVPsnsfsLTDGLNEIGVG---LFP--------------LASLLNHSCDPNA- 67
                          90       100       110
                  ....*....|....*....|....*....|
gi 281366755 1589 akVITIESEKKIVIYSKQPIGINEEITYDY 1618
Cdd:cd20071    68 --VVVFDGNGTLRVRALRDIKAGEELTISY 95
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
255-386 6.81e-08

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 57.21  E-value: 6.81e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   255 RDKERERDRhfkERSR-HSSERSYDRDRGMREnvgtsirrrrtfyrrrssdiSPEDSRDILIMTRERSRDSDSRPRDYCR 333
Cdd:TIGR01642    1 RDEEPDRER---EKSRgRDRDRSSERPRRRSR--------------------DRSRFRDRHRRSRERSYREDSRPRDRRR 57
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 281366755   334 SRERESFRDRKRSHEKGRDQPReKREHYYNSSKDREYRGRDRDRSAEIDQRDR 386
Cdd:TIGR01642   58 YDSRSPRSLRYSSVRRSRDRPR-RRSRSVRSIEQHRRRLRDRSPSNQWRKDDK 109
RRM smart00360
RNA recognition motif;
107-174 1.77e-07

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 49.90  E-value: 1.77e-07
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281366755    107 NLNDNIDKQFLASMLDKCGTSDEINIYHHPITNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDV 174
Cdd:smart00360    6 NLPPDTTEEELRELFSKFGKVESVRLVRDKETGKSKGFAFVEFESEEDAEKALEALNGKELDGRPLKV 73
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
255-386 3.31e-07

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 54.93  E-value: 3.31e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   255 RDKERERDRhfkersrhSSERSYDRDRgmrenvgtsirrrrtfyrrrSSDISPEDSRDilimtRERSRDSdSRPRDYCRS 334
Cdd:TIGR01622    3 RDRERERLR--------DSSSAGDRDR--------------------RRDKGRERSRD-----RSRDRER-SRSRRRDRH 48
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 281366755   335 RERESFRDRKRSH----EKGRDQPREKREHYYNSSkdREYRGRDRDRSAEIDQRDR 386
Cdd:TIGR01622   49 RDRDYYRGRERRSrsrrPNRRYRPREKRRRRGDSY--RRRRDDRRSRREKPRARDG 102
SET_SETD5 cd19181
SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and ...
1484-1636 3.50e-07

SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. SETD5 loss-of-function mutations are a likely cause of a familial syndromic intellectual disability with variable phenotypic expression.


Pssm-ID: 380958  Cd Length: 150  Bit Score: 51.55  E-value: 3.50e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1484 GSMGESELLKFNQLKFRKKQLKFAKSaihdwglfamepIAADEMVIEYVGQMIrpvvadLREtKYEAIG--IGSSYLFRI 1561
Cdd:cd19181     1 GSQMQLQLGRVTRVQKHRKILRAARD------------LALDTLIIEYRGKVM------LRQ-QFEVNGhfFKRPYPFVL 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1562 ------DMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEIT--YDYKFPLEDEKIPCLC-- 1631
Cdd:cd19181    62 fyskfnGVEMCVDARTFGNDARFIRRSCTPNAEVRHMIADGMIHLCIYAVAAIAKDAEVTiaFDYEYSNCNYKVDCAChk 141

                  ....*
gi 281366755 1632 GAQGC 1636
Cdd:cd19181   142 GNRNC 146
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
247-386 5.55e-07

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 54.15  E-value: 5.55e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   247 RSRDRNWNRDKERERDRHfKERSRhssERSYDRDRGmrenvgtsirrrrtfyrrrssdispedsrdilimtRERSRDSdS 326
Cdd:TIGR01622    9 RLRDSSSAGDRDRRRDKG-RERSR---DRSRDRERS-----------------------------------RSRRRDR-H 48
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 281366755   327 RPRDYCRSRERESFRDRKRSHEKGRDQPREKREHYYNSSKDREYR---GRDRDRSAEI---DQRDR 386
Cdd:TIGR01622   49 RDRDYYRGRERRSRSRRPNRRYRPREKRRRRGDSYRRRRDDRRSRrekPRARDGTPEPlteDERDR 114
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
103-174 1.60e-06

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 47.28  E-value: 1.60e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 281366755  103 VTIVNLNDNIDKQFLASMLDKCGTSDEINIYHHPiTNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDV 174
Cdd:cd00590     1 LFVGNLPPDTTEEDLRELFSKFGEVVSVRIVRDR-DGKSKGFAFVEFESPEDAEKALEALNGTELGGRPLKV 71
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
233-360 3.33e-06

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 51.82  E-value: 3.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   233 KDIYTTQTNHEIPNRSRDRNWNRDKERERDR-HFKERSRHSSERSY---DRDRGMRENVGTSIRRRRTFYRRRSSDISPE 308
Cdd:TIGR01642    1 RDEEPDREREKSRGRDRDRSSERPRRRSRDRsRFRDRHRRSRERSYredSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRR 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 281366755   309 DSRDilimtreRSRDSDSRPRDYCRSRERESFRDRKRSheKGRDQPREKREH 360
Cdd:TIGR01642   81 RSRS-------VRSIEQHRRRLRDRSPSNQWRKDDKKR--SLWDIKPPGYEL 123
SET_Suv4-20-like cd10524
SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of ...
1576-1631 3.58e-06

SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of variegation 4-20 (Suv4-20) and similar proteins; Suv4-20 (also termed Su(var)4-20) is a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-20' of histone H4. It acts as a dominant suppressor of position-effect variegation. The family also includes Suv4-20 homologs, lysine N-methyltransferase 5B (KMT5B) and lysine N-methyltransferase 5C (KMT5C). Both KMT5B (also termed lysine-specific methyltransferase 5B, or suppressor of variegation 4-20 homolog 1, or Su(var)4-20 homolog 1, or Suv4-20h1) and KMT5C (also termed lysine-specific methyltransferase 5C, or suppressor of variegation 4-20 homolog 2, or Su(var)4-20 homolog 2, or Suv4-20h2) are histone methyltransferases that specifically trimethylate 'Lys-20' of histone H4 (H4K20me3). They play central roles in the establishment of constitutive heterochromatin in pericentric heterochromatin regions.


Pssm-ID: 380922 [Multi-domain]  Cd Length: 141  Bit Score: 48.05  E-value: 3.58e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 281366755 1576 ARFINHSCNPNCyaKVITIeSEKKIVIYSKQPIGINEEITYDYK---FPLEDEKipCLC 1631
Cdd:cd10524    77 AAFINHDCRPNC--KFVPT-GKSTACVKVLRDIEPGEEITVYYGdnyFGENNEE--CEC 130
SET_SpSet7-like cd10540
SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces ...
1515-1618 1.08e-05

SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces pombe Set7 is a novel histone-lysine N-methyltransferase. The family also includes a viral histone H3 lysine 27 methyltransferase from Paramecium bursaria Chlorella virus 1 (PBCV-1).


Pssm-ID: 380938  Cd Length: 112  Bit Score: 46.09  E-value: 1.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1515 GLFAMEPIAADEmVIEYVGQMIRPvvadLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLArfiNHSCNPNCYaKVITI 1594
Cdd:cd10540    13 GVFATRPIKKGE-VIEEAPVIVLP----KEEYQHLCKTVLDHYVFSWGDGCLALALGYGSMF---NHSYTPNAE-YEIDF 83
                          90       100
                  ....*....|....*....|....
gi 281366755 1595 ESeKKIVIYSKQPIGINEEITYDY 1618
Cdd:cd10540    84 EN-QTIVFYALRDIEAGEELTINY 106
SET_KMT2E cd19182
SET domain found in inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar ...
1516-1620 2.06e-05

SET domain found in inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar proteins; KMT2E (also termed inactive lysine N-methyltransferase 2E, myeloid/lymphoid or mixed-lineage leukemia protein 5 (MLL5)) plays a key role in hematopoiesis, spermatogenesis and cell cycle progression. It associates with chromatin regions downstream of transcriptional start sites of active genes and thus regulates gene transcription. Lack of key residues in the SET domain as well as the presence of an unusually large loop in the SET-I subdomain preclude the interaction of MLL5 SET with its cofactor and substrate thus making MLL5 devoid of any in vitro methyltransferase activity on full-length histones and histone H3 peptide.


Pssm-ID: 380959  Cd Length: 129  Bit Score: 45.65  E-value: 2.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755 1516 LFAMEPIAADEMVIEYVGQMIrpvvadLREtKYEAIG--IGSSYLFRI------DMETIIDATKCGNLARFINHSCNPNC 1587
Cdd:cd19182    21 LKAAKDLPPDTLIIEYRGKFM------LRE-QFEANGyfFKRPYPFVLfyskfhGLEMCVDARTFGNEARFIRRSCTPNA 93
                          90       100       110
                  ....*....|....*....|....*....|...
gi 281366755 1588 YAKVITIESEKKIVIYSKQPIGINEEITYDYKF 1620
Cdd:cd19182    94 EVRHVIEDGTIHLYIYSIRSIPKGTEITIAFDF 126
PRP38_assoc pfam12871
Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region ...
304-386 2.08e-05

Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region found at the C-terminus of plant and metazoan pre-mRNA-splicing factor 38 proteins. The function is not known.


Pssm-ID: 463734 [Multi-domain]  Cd Length: 98  Bit Score: 44.77  E-value: 2.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   304 DISPEDSRDILIMTRERSRDSDSRPRDYCRSRERESFRDRKRSHEKGRDQPREKrehyynsSKDREYRGRDRDRSAEIDQ 383
Cdd:pfam12871   23 EASDESERASLSRKRRSRSRRRSSTRDRSRSRSRSRSRDRRSRGTRDRRRDRDR-------DRYRSLRSRSRDRSRDRDR 95

                   ...
gi 281366755   384 RDR 386
Cdd:pfam12871   96 DRR 98
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
318-458 2.09e-05

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 49.12  E-value: 2.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   318 RERSRDSDsRPRDycRSRERESFRDRKRSHEKGRDQPREKRehyynsSKDREYRG--RDRDRSAEiDQRDRGSLKYCSRy 395
Cdd:TIGR01642    3 EEPDRERE-KSRG--RDRDRSSERPRRRSRDRSRFRDRHRR------SRERSYREdsRPRDRRRY-DSRSPRSLRYSSV- 71
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 281366755   396 slheyietdvRRSSNTISSyYSASSLPIASHGFNSCSFPSIENIKTWSDRRA-WTAFQPDFHPV 458
Cdd:TIGR01642   72 ----------RRSRDRPRR-RSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSlWDIKPPGYELV 124
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
245-384 4.13e-05

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 47.99  E-value: 4.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   245 PNRSRDRNWNRDKERERDR-HFKERSRHsSERSYDRDRgmrenvgtsirrrrtfyrrrssdispedsrdiliMTRERSRD 323
Cdd:TIGR01622   21 RRRDKGRERSRDRSRDRERsRSRRRDRH-RDRDYYRGR----------------------------------ERRSRSRR 65
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 281366755   324 SDSRPrdycrsRERESFRDRKRSHEKGRDQPREKREhyynssKDREYRGRDRDRS-AEIDQR 384
Cdd:TIGR01622   66 PNRRY------RPREKRRRRGDSYRRRRDDRRSRRE------KPRARDGTPEPLTeDERDRR 115
PRP38_assoc pfam12871
Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region ...
303-378 4.26e-04

Pre-mRNA-splicing factor 38-associated hydrophilic C-term; This domain is a hydrophilic region found at the C-terminus of plant and metazoan pre-mRNA-splicing factor 38 proteins. The function is not known.


Pssm-ID: 463734 [Multi-domain]  Cd Length: 98  Bit Score: 41.30  E-value: 4.26e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 281366755   303 SDISPEDSRDILIMTRERSRDSdSRPRDYCRSRERESFRDRKRSHEKGRDQPREKREHYYNSSKDREyRGRDRDRS 378
Cdd:pfam12871   25 SDESERASLSRKRRSRSRRRSS-TRDRSRSRSRSRSRDRRSRGTRDRRRDRDRDRYRSLRSRSRDRS-RDRDRDRR 98
PRK12678 PRK12678
transcription termination factor Rho; Provisional
245-387 6.19e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 44.51  E-value: 6.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755  245 PNRSRDRNWNRDKERERDRHFKERSRHSSERSYDRDRGMRENVGTSirRRRTFYRRRSSDISPEDSRDilimTRERSRDS 324
Cdd:PRK12678  145 AGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGER--GRREERGRDGDDRDRRDRRE----QGDRREER 218
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281366755  325 DSRPRDYCRSRERESFRDRKRSHEKGRDQPREKREHYYNSSKDREYRGRDRDRsaeidQRDRG 387
Cdd:PRK12678  219 GRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDR-----RGRRG 276
RSRP pfam17069
Arginine/Serine-Rich protein 1; RSRP1 is an eukaryotic protein family. Its function is unknown.
266-391 7.40e-04

Arginine/Serine-Rich protein 1; RSRP1 is an eukaryotic protein family. Its function is unknown.


Pssm-ID: 293674 [Multi-domain]  Cd Length: 299  Bit Score: 43.61  E-value: 7.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   266 KERSRHSSERSYDRDRGMRENVGTSIRRRRTFYRRRSSDISpedsrdilimTRERSRDSDSRPRDYCRSRERESFRDRKR 345
Cdd:pfam17069   14 QEKKSPSTSSSGSSSRLSSRSRSRSSSRSSRSHSRSSSRFS----------SRSRSRPRRSRSRSRSRRRHQRKYRRYSR 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 281366755   346 SHEKGRDqpREKREHYYNSSK----DREYRGRDRDRSAEiDQRDRGSLKY 391
Cdd:pfam17069   84 SYSRSRS--RSRRRRYYRRSRyrysRRYYRSPSRSRSRS-RSRSRGRSYY 130
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
320-407 5.69e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 41.06  E-value: 5.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281366755   320 RSRDsdsRPRDYCRSRERESFRDRKRSHEKGRDQPREkREHYYNSSKDREyRGRDRD--RSAEIDQRDRGSLKYCSRYSL 397
Cdd:TIGR01622    1 RYRD---RERERLRDSSSAGDRDRRRDKGRERSRDRS-RDRERSRSRRRD-RHRDRDyyRGRERRSRSRRPNRRYRPREK 75
                           90
                   ....*....|
gi 281366755   398 HEYIETDVRR 407
Cdd:TIGR01622   76 RRRRGDSYRR 85
PostSET smart00508
Cysteine-rich motif following a subset of SET domains;
1625-1641 6.08e-03

Cysteine-rich motif following a subset of SET domains;


Pssm-ID: 214703  Cd Length: 17  Bit Score: 35.46  E-value: 6.08e-03
                            10
                    ....*....|....*..
gi 281366755   1625 EKIPCLCGAQGCRGTLN 1641
Cdd:smart00508    1 KKQPCLCGAPNCRGFLG 17
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH