NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|795311138|ref|XP_011925457|]
View 

PREDICTED: LOW QUALITY PROTEIN: zinc finger protein 354A-like [Cercocebus atys]

Protein Classification

KRAB domain-containing zinc finger protein( domain architecture ID 12204556)

KRAB (Kruppel-associated box) domain-containing zinc finger protein (KRAB-ZFP) plays important roles in cell differentiation and organ development, and in regulating viral replication and transcription

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
KRAB smart00349
krueppel associated box;
360-418 5.56e-31

krueppel associated box;


:

Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 115.77  E-value: 5.56e-31
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 795311138   360 LTFEDVAVLFTRDEWRKLAPSQRNLYRDVMLENYRNLVALGLPFTKPKVISLLQQGEDP 418
Cdd:smart00349   1 VTFEDVAVYFTQEEWEQLDPAQKNLYRDVMLENYSNLVSLGFQVPKPDLISQLEQGEEP 59
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
586-946 2.36e-14

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 76.66  E-value: 2.36e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 586 KLFKCKECSKAFSQSSALIQHQITHTGEKPYIC--KECGKAFTLSTSLYKHLRTHTVEKSYRCK---------------- 647
Cdd:COG5048   32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCsySGCDKSFSRPLELSRHLRTHHNNPSDLNSkslplsnskassssls 111
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 648 ECGKSFSRRSGLFIH---------------QKIHAEENPCK-YNPGRKASSCSTSLS---------------GCQRIHSR 696
Cdd:COG5048  112 SSSSNSNDNNLLSSHslppssrdpqlpdllSISNLRNNPLPgNNSSSVNTPQSNSLHpplpanslskdpssnLSLLISSN 191
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 697 K----KSYLCNECGNTFKSSSSLRYHQRIHTGEKPFKCSECGRAFSQSASLIQHERIHTGEKPYRCNECGKGFTSISRLN 772
Cdd:COG5048  192 VstsiPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQ 271
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 773 RHRIIHTGE-------KFYNCNECGKALSSHSTLIIHER--IHTGE--KPCKCKV--CGKAFRQSSALIQHQRMHTGERP 839
Cdd:COG5048  272 SSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHILLHTSISP 351
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 840 YKC--NECGKTFR-----CNSSLSNHQRIHTGEKPYRCE--ECGISFGQSSALIQHRRIHTGEKP--FKCNTCGKTFRQS 908
Cdd:COG5048  352 AKEklLNSSSKFSpllnnEPPQSLQQYKDLKNDKKSETLsnSCIRNFKRDSNLSLHIITHLSFRPynCKNPPCSKSFNRH 431
                        410       420       430
                 ....*....|....*....|....*....|....*...
gi 795311138 909 SSRIAHQRIHTGEKPYECNTCGklFNHRSSLTNHYKIH 946
Cdd:COG5048  432 YNLIPHKKIHTNHAPLLCSILK--SFRRDLDLSNHGKD 467
PHA03247 super family cl33720
large tegument protein UL36; Provisional
110-315 2.01e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.88  E-value: 2.01e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  110 PESWRPPVDSISQRPPR-LPRPS----WSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPA-------VPEPPACGVRPGR 177
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRpAPRPSepavTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSplppdthAPDPPPPSPSPAA 2635
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  178 RGPGTPElreVSGGGGPGREEAAPGAGTACSRRGSESLCR---------RWR---APPRLSPASA---PGPGLSHPRGAL 242
Cdd:PHA03247 2636 NEPDPHP---PPTVPPPERPRDDPAPGRVSRPRRARRLGRaaqassppqRPRrraARPTVGSLTSladPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 795311138  243 PRRCRCTRVRL--AAARRGRGAWRFLPGPPCPLRVGLVESGHRDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:PHA03247 2713 HALVSATPLPPgpAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
SFP1 super family cl25788
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
501-639 2.12e-04

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


The actual alignment was detected with superfamily member COG5189:

Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 44.71  E-value: 2.12e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 501 IERSHKNTELSQNFSPKSvlirqQILPRDKTPPKCEIQGNSFKQNSHLLNQP----KITADKRYKCSM--CEKTFINTSS 574
Cdd:COG5189  292 IHKSVGNKEIRGGISTGE-----MIDVRKLPCTNSSSNGKLAHGGERNIDTPsrmlKVKDGKPYKCPVegCNKKYKNQNG 366
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 795311138 575 LRKHEKNhsgeklfkcKECSKAFSQSSALIQHQITHTGEKPYICKECGKAFTLSTSLYKHlRTHT 639
Cdd:COG5189  367 LKYHMLH---------GHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYH-RKHS 421
 
Name Accession Description Interval E-value
KRAB smart00349
krueppel associated box;
360-418 5.56e-31

krueppel associated box;


Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 115.77  E-value: 5.56e-31
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 795311138   360 LTFEDVAVLFTRDEWRKLAPSQRNLYRDVMLENYRNLVALGLPFTKPKVISLLQQGEDP 418
Cdd:smart00349   1 VTFEDVAVYFTQEEWEQLDPAQKNLYRDVMLENYSNLVSLGFQVPKPDLISQLEQGEEP 59
KRAB pfam01352
KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc ...
359-400 1.96e-21

KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.


Pssm-ID: 460171  Cd Length: 42  Bit Score: 87.91  E-value: 1.96e-21
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 795311138  359 SLTFEDVAVLFTRDEWRKLAPSQRNLYRDVMLENYRNLVALG 400
Cdd:pfam01352   1 SVTFEDVAVDFTQEEWALLDPAQRNLYRDVMLENYRNLVSLG 42
KRAB_A-box cd07765
KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression ...
360-399 4.32e-19

KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through recruiting corepressors. A key mechanism appears to be the following: KRAB-AFPs tethered to DNA recruit, via their KRAB domain, the repressor KAP1 (KRAB-associated protein-1, also known as transcription intermediary factor 1 beta , KRAB-A interacting protein , and tripartite motif protein 28). The KAP1/ KRAB-AFP complex in turn recruits the heterochromatin protein 1 (HP1) family, and other chromatin modulating proteins, leading to transcriptional repression through heterochromatin formation.


Pssm-ID: 143639  Cd Length: 40  Bit Score: 81.06  E-value: 4.32e-19
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 795311138 360 LTFEDVAVLFTRDEWRKLAPSQRNLYRDVMLENYRNLVAL 399
Cdd:cd07765    1 VTFEDVAVYFSQEEWELLDPAQRDLYRDVMLENYENLVSL 40
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
586-946 2.36e-14

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 76.66  E-value: 2.36e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 586 KLFKCKECSKAFSQSSALIQHQITHTGEKPYIC--KECGKAFTLSTSLYKHLRTHTVEKSYRCK---------------- 647
Cdd:COG5048   32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCsySGCDKSFSRPLELSRHLRTHHNNPSDLNSkslplsnskassssls 111
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 648 ECGKSFSRRSGLFIH---------------QKIHAEENPCK-YNPGRKASSCSTSLS---------------GCQRIHSR 696
Cdd:COG5048  112 SSSSNSNDNNLLSSHslppssrdpqlpdllSISNLRNNPLPgNNSSSVNTPQSNSLHpplpanslskdpssnLSLLISSN 191
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 697 K----KSYLCNECGNTFKSSSSLRYHQRIHTGEKPFKCSECGRAFSQSASLIQHERIHTGEKPYRCNECGKGFTSISRLN 772
Cdd:COG5048  192 VstsiPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQ 271
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 773 RHRIIHTGE-------KFYNCNECGKALSSHSTLIIHER--IHTGE--KPCKCKV--CGKAFRQSSALIQHQRMHTGERP 839
Cdd:COG5048  272 SSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHILLHTSISP 351
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 840 YKC--NECGKTFR-----CNSSLSNHQRIHTGEKPYRCE--ECGISFGQSSALIQHRRIHTGEKP--FKCNTCGKTFRQS 908
Cdd:COG5048  352 AKEklLNSSSKFSpllnnEPPQSLQQYKDLKNDKKSETLsnSCIRNFKRDSNLSLHIITHLSFRPynCKNPPCSKSFNRH 431
                        410       420       430
                 ....*....|....*....|....*....|....*...
gi 795311138 909 SSRIAHQRIHTGEKPYECNTCGklFNHRSSLTNHYKIH 946
Cdd:COG5048  432 YNLIPHKKIHTNHAPLLCSILK--SFRRDLDLSNHGKD 467
PHA03247 PHA03247
large tegument protein UL36; Provisional
110-315 2.01e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.88  E-value: 2.01e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  110 PESWRPPVDSISQRPPR-LPRPS----WSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPA-------VPEPPACGVRPGR 177
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRpAPRPSepavTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSplppdthAPDPPPPSPSPAA 2635
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  178 RGPGTPElreVSGGGGPGREEAAPGAGTACSRRGSESLCR---------RWR---APPRLSPASA---PGPGLSHPRGAL 242
Cdd:PHA03247 2636 NEPDPHP---PPTVPPPERPRDDPAPGRVSRPRRARRLGRaaqassppqRPRrraARPTVGSLTSladPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 795311138  243 PRRCRCTRVRL--AAARRGRGAWRFLPGPPCPLRVGLVESGHRDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:PHA03247 2713 HALVSATPLPPgpAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
114-315 3.03e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 51.22  E-value: 3.03e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 114 RPPvdsiSQRPPRLPRPSWSPARRGVSGAGPEVAQTPPG--RSLVPRLQSGPAVPEPPACGVRPGRRGP-------GTPE 184
Cdd:COG5180  225 RPE----AASSPKVDPPSTSEARSRPATVDAQPEMRPPAdaKERRRAAIGDTPAAEPPGLPVLEAGSEPqsdapeaETAR 300
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 185 LREVSG------GGGPGReeAAPGAGTACSRRGSESLCRRWRAPPRLSPASAPGPGLSHPRGALPRRCrctrvrlAAARR 258
Cdd:COG5180  301 PIDVKGvasappATRPVR--PPGGARDPGTPRPGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPGKP-------LEQGA 371
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 795311138 259 GRGAWRFLPGPPCPLRVGLV--ESGHRDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:COG5180  372 PRPGSSGGDGAPFQPPNGAPqpGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAG 430
zf-H2C2_2 pfam13465
Zinc-finger double domain;
714-739 6.76e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.51  E-value: 6.76e-06
                          10        20
                  ....*....|....*....|....*.
gi 795311138  714 SLRYHQRIHTGEKPFKCSECGRAFSQ 739
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
810-862 1.37e-05

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 44.08  E-value: 1.37e-05
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....
gi 795311138 810 KPcKCKVCGKAFRQSSALIQHQRMHTgerpYKCNECGKTFRCNSSLSNH-QRIH 862
Cdd:cd20908    1 KP-WCYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVH 49
fvmX7 pfam20005
FtsH ternary system domain X7; Uncharacterized domain found in the FtsH ternary system, a ...
85-315 9.10e-05

FtsH ternary system domain X7; Uncharacterized domain found in the FtsH ternary system, a class of NTP-dependent biological conflict systems.


Pssm-ID: 466254  Cd Length: 422  Bit Score: 46.01  E-value: 9.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   85 RGPLVACGRLTAQPGRRRKRMELHFPESWR--PPVDSISQRPPRlPRPSwsparrgvsgAGPEVAQTPPG---RSLVPRL 159
Cdd:pfam20005  50 RRVGIALVRLGVRFAGAPRPGGPHTLPCWAelLIADPVARLPPA-PPPL----------GHREVLVLTPDallSALVRRL 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  160 ------QSGPAVPEPPACGVRPGRRGPGtPELREVSGGGGPGREEAAPGAG--TACSRRGSESLCRRWRAP-PRLSPASa 230
Cdd:pfam20005 119 lrlgaeVAGLRLRTRPLAGGAPDRGTAG-GTLLTGSGQVPAALLVAVGGPPytVVLRAIDPDSPVRAFLERaPRVWTDV- 196
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  231 pgpGLSHPrgaLPRRCRCTRVRLAAARRGRGAWRFLP-GPPCPLRVGLVESGhrdPEGRADAAGPRFPSPSPGRPPLPSR 309
Cdd:pfam20005 197 ---GLQHP---LADRLTVPDGTLLLLDPPDACWRLEPlGTFTDIAAALLFAG---PAPPRDTPPAATPAPAPARIPVPVR 267

                  ....*..
gi 795311138  310 L-GTPSA 315
Cdd:pfam20005 268 LvRAPRA 274
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
501-639 2.12e-04

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 44.71  E-value: 2.12e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 501 IERSHKNTELSQNFSPKSvlirqQILPRDKTPPKCEIQGNSFKQNSHLLNQP----KITADKRYKCSM--CEKTFINTSS 574
Cdd:COG5189  292 IHKSVGNKEIRGGISTGE-----MIDVRKLPCTNSSSNGKLAHGGERNIDTPsrmlKVKDGKPYKCPVegCNKKYKNQNG 366
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 795311138 575 LRKHEKNhsgeklfkcKECSKAFSQSSALIQHQITHTGEKPYICKECGKAFTLSTSLYKHlRTHT 639
Cdd:COG5189  367 LKYHMLH---------GHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYH-RKHS 421
KLF14_N cd21576
N-terminal domain of Kruppel-like factor 14; Kruppel-like factor 14 (KLF14; also known as ...
122-315 1.18e-03

N-terminal domain of Kruppel-like factor 14; Kruppel-like factor 14 (KLF14; also known as Krueppel-like factor 14 or basic transcription element-binding protein 5/BTEB5) is a protein that in humans is encoded by the KLF14 gene. KLF14 regulates the transcription of various genes, including TGFbetaRII (the type II receptor for TGFbeta). KLF14 is expressed in many tissues, lacks introns, and is subject to parent-specific expression. It also appears to be a master regulator of gene expression in adipose tissue. KLF14 is associated with coronary artery disease, hypercholesterolemia, and type 2 diabetes. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. KLF14 belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF14.


Pssm-ID: 409238 [Multi-domain]  Cd Length: 195  Bit Score: 41.34  E-value: 1.18e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 122 QRPPRlPRPSWSPARRGVSGAGPEVAQTPPGrslVPRLQSGPAVPEPPACGVRPGRRGP---GTPELREVSGGGGPGREE 198
Cdd:cd21576   26 RRAPD-PEGAGGAAGSEVGAAPPESALPGPG---PPGPAWVPPLLQVPAPSPGAGGAAPhllAASVLADLRGGAGEGSRE 101
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 199 ---AAPGAGTACSrrgseslcrrwRAPPRLSPASAPGPGLSHPRGALPrrcrctrvrlaAARRGRGAWRFLPGPPCPlrv 275
Cdd:cd21576  102 dsgEAPRASSGSS-----------DPARGSSPTLGSEPAPASGEDAVS-----------GPESSFGAPAIPSAPAAP--- 156
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|.
gi 795311138 276 GLVESGHRDPEGrADAAGPR-FPSPSPGRPPLpsrlgTPSA 315
Cdd:cd21576  157 GAPAVSGEVPGG-APGAGPApAAGPAPRRRPV-----TPAA 191
PHA00733 PHA00733
hypothetical protein
867-914 6.88e-03

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 37.93  E-value: 6.88e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*...
gi 795311138 867 PYRCEECGISFGQSSALIQHRRIHTGEKpfKCNTCGKTFRQSSSRIAH 914
Cdd:PHA00733  73 PYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDH 118
 
Name Accession Description Interval E-value
KRAB smart00349
krueppel associated box;
360-418 5.56e-31

krueppel associated box;


Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 115.77  E-value: 5.56e-31
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 795311138   360 LTFEDVAVLFTRDEWRKLAPSQRNLYRDVMLENYRNLVALGLPFTKPKVISLLQQGEDP 418
Cdd:smart00349   1 VTFEDVAVYFTQEEWEQLDPAQKNLYRDVMLENYSNLVSLGFQVPKPDLISQLEQGEEP 59
KRAB pfam01352
KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc ...
359-400 1.96e-21

KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.


Pssm-ID: 460171  Cd Length: 42  Bit Score: 87.91  E-value: 1.96e-21
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 795311138  359 SLTFEDVAVLFTRDEWRKLAPSQRNLYRDVMLENYRNLVALG 400
Cdd:pfam01352   1 SVTFEDVAVDFTQEEWALLDPAQRNLYRDVMLENYRNLVSLG 42
KRAB_A-box cd07765
KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression ...
360-399 4.32e-19

KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through recruiting corepressors. A key mechanism appears to be the following: KRAB-AFPs tethered to DNA recruit, via their KRAB domain, the repressor KAP1 (KRAB-associated protein-1, also known as transcription intermediary factor 1 beta , KRAB-A interacting protein , and tripartite motif protein 28). The KAP1/ KRAB-AFP complex in turn recruits the heterochromatin protein 1 (HP1) family, and other chromatin modulating proteins, leading to transcriptional repression through heterochromatin formation.


Pssm-ID: 143639  Cd Length: 40  Bit Score: 81.06  E-value: 4.32e-19
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 795311138 360 LTFEDVAVLFTRDEWRKLAPSQRNLYRDVMLENYRNLVAL 399
Cdd:cd07765    1 VTFEDVAVYFSQEEWELLDPAQRDLYRDVMLENYENLVSL 40
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
586-946 2.36e-14

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 76.66  E-value: 2.36e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 586 KLFKCKECSKAFSQSSALIQHQITHTGEKPYIC--KECGKAFTLSTSLYKHLRTHTVEKSYRCK---------------- 647
Cdd:COG5048   32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCsySGCDKSFSRPLELSRHLRTHHNNPSDLNSkslplsnskassssls 111
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 648 ECGKSFSRRSGLFIH---------------QKIHAEENPCK-YNPGRKASSCSTSLS---------------GCQRIHSR 696
Cdd:COG5048  112 SSSSNSNDNNLLSSHslppssrdpqlpdllSISNLRNNPLPgNNSSSVNTPQSNSLHpplpanslskdpssnLSLLISSN 191
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 697 K----KSYLCNECGNTFKSSSSLRYHQRIHTGEKPFKCSECGRAFSQSASLIQHERIHTGEKPYRCNECGKGFTSISRLN 772
Cdd:COG5048  192 VstsiPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQ 271
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 773 RHRIIHTGE-------KFYNCNECGKALSSHSTLIIHER--IHTGE--KPCKCKV--CGKAFRQSSALIQHQRMHTGERP 839
Cdd:COG5048  272 SSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHILLHTSISP 351
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 840 YKC--NECGKTFR-----CNSSLSNHQRIHTGEKPYRCE--ECGISFGQSSALIQHRRIHTGEKP--FKCNTCGKTFRQS 908
Cdd:COG5048  352 AKEklLNSSSKFSpllnnEPPQSLQQYKDLKNDKKSETLsnSCIRNFKRDSNLSLHIITHLSFRPynCKNPPCSKSFNRH 431
                        410       420       430
                 ....*....|....*....|....*....|....*...
gi 795311138 909 SSRIAHQRIHTGEKPYECNTCGklFNHRSSLTNHYKIH 946
Cdd:COG5048  432 YNLIPHKKIHTNHAPLLCSILK--SFRRDLDLSNHGKD 467
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
642-970 3.26e-11

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 66.64  E-value: 3.26e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 642 KSYRCKECGKSFSRRSGLFIHQKIHAEENP--CKYNPGRKASSCSTSLSGCQRIHSRKKSYLCNECGNTFKSSSSlrYHQ 719
Cdd:COG5048   32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPsqCSYSGCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKAS--SSS 109
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 720 RIHTGEKPFK---CSECGRAFSQSASLIQHERIHTGEKPYRCNECGKGFTSISRLNRHRIIHTGekfyncNECGKALSSH 796
Cdd:COG5048  110 LSSSSSNSNDnnlLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPA------NSLSKDPSSN 183
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 797 STLIIHERIHTGEKPCKCKVCGKAFRQSSALIQHQRMHTGERPYKCNECGKTFRCNSSLSNHQRIHTGEKPYRCEECGIS 876
Cdd:COG5048  184 LSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRS 263
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 877 FGQSSALIQHRRIHTGE-------KPFKCNTCGKTFRQSSSRIAHQR--IHTGE--KPYEC--NTCGKLFNHRSSLTNHY 943
Cdd:COG5048  264 SLPTASSQSSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCpySLCGKLFSRNDALKRHI 343
                        330       340
                 ....*....|....*....|....*..
gi 795311138 944 KIHIEEDS*KVDLCVCESLKPKLIKEC 970
Cdd:COG5048  344 LLHTSISPAKEKLLNSSSKFSPLLNNE 370
PHA03247 PHA03247
large tegument protein UL36; Provisional
110-315 2.01e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.88  E-value: 2.01e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  110 PESWRPPVDSISQRPPR-LPRPS----WSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPA-------VPEPPACGVRPGR 177
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRpAPRPSepavTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSplppdthAPDPPPPSPSPAA 2635
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  178 RGPGTPElreVSGGGGPGREEAAPGAGTACSRRGSESLCR---------RWR---APPRLSPASA---PGPGLSHPRGAL 242
Cdd:PHA03247 2636 NEPDPHP---PPTVPPPERPRDDPAPGRVSRPRRARRLGRaaqassppqRPRrraARPTVGSLTSladPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 795311138  243 PRRCRCTRVRL--AAARRGRGAWRFLPGPPCPLRVGLVESGHRDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:PHA03247 2713 HALVSATPLPPgpAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
PHA03247 PHA03247
large tegument protein UL36; Provisional
87-307 3.04e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.04e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   87 PLVACGRLT-AQPGRRRKRMELHFPESWRPPVDSISQRPPRLPRPSWSPARRGVSG-------AGPEVAQTPPGRSLVPR 158
Cdd:PHA03247 2742 PAVPAGPATpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESlpspwdpADPPAAVLAPAAALPPA 2821
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  159 LQSGPAVPEPPACGVRPGRRGPGTPELREVSGGGgpgreeAAPGAGTacsrrgseslcrRWRAPPRlSPASAPgpglshp 238
Cdd:PHA03247 2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS------VAPGGDV------------RRRPPSR-SPAAKP------- 2875
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 795311138  239 rgALPRRCRCTRVRLAAARRGRGAWRFLPGPPCPLRVGLVESGHRDPEGRADAAGPRFPSPSPGRPPLP 307
Cdd:PHA03247 2876 --AAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
PHA03247 PHA03247
large tegument protein UL36; Provisional
110-315 5.92e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.34  E-value: 5.92e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  110 PESWRPP-----VDSISQ--RPPRLPRPSwSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPAVPEPPACGV------RPG 176
Cdd:PHA03247 2680 PQRPRRRaarptVGSLTSlaDPPPPPPTP-EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAtpggpaRPA 2758
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  177 RR----GPGTPELREVSGGGGPGREEAAPGAGTACSRrgsESLCRRWRAPPRLSPASAPGPGL---SHPRGALPRRcrcT 249
Cdd:PHA03247 2759 RPpttaGPPAPAPPAAPAAGPPRRLTRPAVASLSESR---ESLPSPWDPADPPAAVLAPAAALppaASPAGPLPPP---T 2832
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  250 RVRLAAARRGRGAwrflPGPPCPLRVGLVESG--HRDPEGRADAAGP---------RFPSPSPGR--------PPLPSRL 310
Cdd:PHA03247 2833 SAQPTAPPPPPGP----PPPSLPLGGSVAPGGdvRRRPPSRSPAAKPaaparppvrRLARPAVSRstesfalpPDQPERP 2908

                  ....*
gi 795311138  311 GTPSA 315
Cdd:PHA03247 2909 PQPQA 2913
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
113-309 2.17e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 58.07  E-value: 2.17e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 113 WRPPVDSISQRPPRLPRPSWSPArrGVSGAGPEVAQTPPGRSLVPRLQSGPAVPEPPACGVRPGRRGPGTPELREVSGGG 192
Cdd:PRK07764 598 EGPPAPASSGPPEEAARPAAPAA--PAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGG 675
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 193 GPGreEAAPGAGTACSRRGSESlcrrwRAPPRLSPASAPGPglsHPRGALPRRCRCTRVRLAAARRGRGAWRFLPGPPCP 272
Cdd:PRK07764 676 AAP--AAPPPAPAPAAPAAPAG-----AAPAQPAPAPAATP---PAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEP 745
                        170       180       190
                 ....*....|....*....|....*....|....*..
gi 795311138 273 LRvglvESGHRDPEGRADAAGPRFPSPSPGRPPLPSR 309
Cdd:PRK07764 746 DD----PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
96-347 2.58e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 58.26  E-value: 2.58e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   96 AQPGRRRKRMELHFPESWRPPVDSISQRPPRLPRPSWSPARrgVSGAGPEVAQTP--PGRSLVPRLQSGPAVPEPPACGv 173
Cdd:PHA03307  134 LSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAAL--PLSSPEETARAPssPPAEPPPSTPPAAASPRPPRRS- 210
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  174 RPGRRGPGTPelrevsgGGGPGREEAAPGAGTACSRRGSESLCRRWrAPPRLSPASAPGPGLSHPR-----GALPRRCRC 248
Cdd:PHA03307  211 SPISASASSP-------APAPGRSAADDAGASSSDSSSSESSGCGW-GPENECPLPRPAPITLPTRiweasGWNGPSSRP 282
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  249 TRVRLAAARRGRGAwRFLPG----PPCPLRVGLVESGHRDPEGRADAAGPRFPSPSP---GRPPLPSRLGTPSAflFKAV 321
Cdd:PHA03307  283 GPASSSSSPRERSP-SPSPSspgsGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGaavSPGPSPSRSPSPSR--PPPP 359
                         250       260
                  ....*....|....*....|....*.
gi 795311138  322 FWPGSVAHACNPSTlGGQGGRITRGS 347
Cdd:PHA03307  360 ADPSSPRKRPRPSR-APSSPAASAGR 384
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
81-326 9.03e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 56.33  E-value: 9.03e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   81 GPAHRGPLVACGRLTAQP--------GRRRKRMELHFPESWRPPVDSISQRPPRLPRPSWSPAR--RGVSGAGPEVAQTP 150
Cdd:PHA03307  205 RPPRRSSPISASASSPAPapgrsaadDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRiwEASGWNGPSSRPGP 284
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  151 PGRSLVPRLQSGPAVPEPPACGVRPGRR------GPGTPELREVSGGGGPGREEAAPGAGTACSRRGSESLCRRWRAPPR 224
Cdd:PHA03307  285 ASSSSSPRERSPSPSPSSPGSGPAPSSPrassssSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS 364
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  225 LSPASAPGPGLSHPRGALPRRCRcTRVRLAAARRGRGAwrflpgppcplrvglvESGHRDPEGRAdAAGPRFPSPSPGRP 304
Cdd:PHA03307  365 PRKRPRPSRAPSSPAASAGRPTR-RRARAAVAGRARRR----------------DATGRFPAGRP-RPSPLDAGAASGAF 426
                         250       260
                  ....*....|....*....|..
gi 795311138  305 PLPSRLGTPSAFLfkavfWPGS 326
Cdd:PHA03307  427 YARYPLLTPSGEP-----WPGS 443
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
174-313 9.74e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 52.96  E-value: 9.74e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 174 RPGRRGpgtpelrevsGGGGPGREEAAPGAGTACSRRGSESLCRRWRAPPRLSPASAPGPGLSHPRGALPRRCRCTRVRL 253
Cdd:PRK12323 364 RPGQSG----------GGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAL 433
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 795311138 254 AAAR----RGRGAWRFLPGPPCPLRVGLVESGHRDPEGRADAAGPRFPSPSPGRPPLPSRLGTP 313
Cdd:PRK12323 434 AAARqasaRGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP 497
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
115-315 1.35e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.48  E-value: 1.35e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  115 PPVDSISQRPPRLPRPSWSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPAVPEPPA---CGVRPGRRGPGTPELREVSGG 191
Cdd:PHA03307  126 PPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEEtarAPSSPPAEPPPSTPPAAASPR 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  192 GGPGREEAAPGAGTACSRRGSESLCRRWRAPPRLSPASAPGPGlSHPRGALPRRcRCTRVRLAAARRGRGAWRfLPGPPC 271
Cdd:PHA03307  206 PPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCG-WGPENECPLP-RPAPITLPTRIWEASGWN-GPSSRP 282
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 795311138  272 PlrvglvesghrdPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:PHA03307  283 G------------PASSSSSPRERSPSPSPSSPGSGPAPSSPRA 314
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
114-315 3.03e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 51.22  E-value: 3.03e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 114 RPPvdsiSQRPPRLPRPSWSPARRGVSGAGPEVAQTPPG--RSLVPRLQSGPAVPEPPACGVRPGRRGP-------GTPE 184
Cdd:COG5180  225 RPE----AASSPKVDPPSTSEARSRPATVDAQPEMRPPAdaKERRRAAIGDTPAAEPPGLPVLEAGSEPqsdapeaETAR 300
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 185 LREVSG------GGGPGReeAAPGAGTACSRRGSESLCRRWRAPPRLSPASAPGPGLSHPRGALPRRCrctrvrlAAARR 258
Cdd:COG5180  301 PIDVKGvasappATRPVR--PPGGARDPGTPRPGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPGKP-------LEQGA 371
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 795311138 259 GRGAWRFLPGPPCPLRVGLV--ESGHRDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:COG5180  372 PRPGSSGGDGAPFQPPNGAPqpGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAG 430
PHA03247 PHA03247
large tegument protein UL36; Provisional
81-315 3.78e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 3.78e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   81 GPAHRGPLVACGRLTAQPGRRRKRMELHFPEswRPPVDSISQRPPRlpRPSWSPARRGVSG-AGPEvaqtPPGRSLVPR- 158
Cdd:PHA03247 2641 HPPPTVPPPERPRDDPAPGRVSRPRRARRLG--RAAQASSPPQRPR--RRAARPTVGSLTSlADPP----PPPPTPEPAp 2712
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  159 LQSGPAVPEPPAcgvrPGRRGPGTPELREVSGGGGPGREEAAPGAGTACSRRGSESlcrrwrAPPRLSPASAPgpglshP 238
Cdd:PHA03247 2713 HALVSATPLPPG----PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA------GPPAPAPPAAP------A 2776
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 795311138  239 RGALPRRCRCTRVRLAAARRGrgawrfLPGPPCPlrvglveSGHRDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:PHA03247 2777 AGPPRRLTRPAVASLSESRES------LPSPWDP-------ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
PHA03378 PHA03378
EBNA-3B; Provisional
81-315 4.54e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 50.84  E-value: 4.54e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  81 GPAHRGPLVACGRLTAQPGRRRKRM-ELHFPESWRPPVDSISQRPPRL----------PRPSWSP-----------ARRG 138
Cdd:PHA03378 591 SYAQTPWPVPHPSQTPEPPTTQSHIpETSAPRQWPMPLRPIPMRPLRMqpitfnvlvfPTPHQPPqveitpykptwTQIG 670
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 139 VSGAGPevaqTPPGRSLVPRLQSGPAVPEPPACGvrPGRRGPgtPElrevsggGGPGREEAAPGA-GTACSRRGSESLCR 217
Cdd:PHA03378 671 HIPYQP----SPTGANTMLPIQWAPGTMQPPPRA--PTPMRP--PA-------APPGRAQRPAAAtGRARPPAAAPGRAR 735
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 218 RWRAPPRLSPASAPGPGLSHPRGALPrrcrcTRVRLAAARRGRGAWRFLP-GPPCPLRvglvesghrDPEGradaaGPRf 296
Cdd:PHA03378 736 PPAAAPGRARPPAAAPGRARPPAAAP-----GRARPPAAAPGAPTPQPPPqAPPAPQQ---------RPRG-----APT- 795
                        250
                 ....*....|....*....
gi 795311138 297 PSPSPGRPPLPSRLGTPSA 315
Cdd:PHA03378 796 PQPPPQAGPTSMQLMPRAA 814
zf-H2C2_2 pfam13465
Zinc-finger double domain;
714-739 6.76e-06

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 43.51  E-value: 6.76e-06
                          10        20
                  ....*....|....*....|....*.
gi 795311138  714 SLRYHQRIHTGEKPFKCSECGRAFSQ 739
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
154-315 7.02e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 7.02e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  154 SLVP-RLQSGPAVPEPPACGVRPGRRGPGTPElREVSGGGGPGREEAAPGAGTACSRRGSESLCRRWRAPPRLSPASAPG 232
Cdd:PHA03307  762 SLVPaKLAEALALLEPAEPQRGAGSSPPVRAE-AAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPG 840
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  233 PGlSHPRGALPRRCRCTRVRLAAARRGRGAWRFLPGPPCPLRVGLVESGHRDPEGRADAAGPRFPSPSPGRPPL-PSRLG 311
Cdd:PHA03307  841 AA-ARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLgPMPPG 919

                  ....
gi 795311138  312 TPSA 315
Cdd:PHA03307  920 GPDP 923
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
810-862 1.37e-05

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 44.08  E-value: 1.37e-05
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....
gi 795311138 810 KPcKCKVCGKAFRQSSALIQHQRMHTgerpYKCNECGKTFRCNSSLSNH-QRIH 862
Cdd:cd20908    1 KP-WCYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVH 49
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
66-328 1.46e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 49.10  E-value: 1.46e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  66 MYLLRFGYLLEKWNLGPAHRGPLVACGRLTAQPGRRRKRMELHFPESWRPPVDSISQRPPRLPRPSWSPARRGVSGAGPE 145
Cdd:PRK12323 355 MTLLRMLAFRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALA 434
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 146 VAQTPPGRSLVPRLQSGPAVPEPPACGVRPGRRGPGTPelrEVSGGGGPGREEAAPGAGTACSrrgseslcrrwRAPP-R 224
Cdd:PRK12323 435 AARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPV---AAAAAAAPARAAPAAAPAPADD-----------DPPPwE 500
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 225 LSPASAPGPGLSHPRGALPRRCRCTRVRLAAArrgrgawrflpgPPCPLRVGLVESGHRDPEGRADAAGPRFPSPspgRP 304
Cdd:PRK12323 501 ELPPEFASPAPAQPDAAPAGWVAESIPDPATA------------DPDDAFETLAPAPAAAPAPRAAAATEPVVAP---RP 565
                        250       260
                 ....*....|....*....|....
gi 795311138 305 PLPSRLGTPSAFlfkAVFWPGSVA 328
Cdd:PRK12323 566 PRASASGLPDMF---DGDWPALAA 586
zf-H2C2_2 pfam13465
Zinc-finger double domain;
826-851 2.21e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 41.97  E-value: 2.21e-05
                          10        20
                  ....*....|....*....|....*.
gi 795311138  826 ALIQHQRMHTGERPYKCNECGKTFRC 851
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
124-317 2.38e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.31  E-value: 2.38e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 124 PPRLPR--PSWSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPAVPEPPACGVRPGRRgPGTPELREVSGGGGPGreeAAP 201
Cdd:PRK07003 373 PARVAGavPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA-PAPPATADRGDDAADG---DAP 448
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 202 GAGTACSRRGSESLCRRWRAPPRLSPASAPGPGLSHPRGAlprrcrctrvRLAAARRGRGAWRFLPGPPCPLRVGLVESG 281
Cdd:PRK07003 449 VPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDA----------AFEPAPRAAAPSAATPAAVPDARAPAAASR 518
                        170       180       190
                 ....*....|....*....|....*....|....*.
gi 795311138 282 HRDPEGRADAAgPRFPSPSPGRPPLPSRLGTPSAFL 317
Cdd:PRK07003 519 EDAPAAAAPPA-PEARPPTPAAAAPAARAGGAAAAL 553
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
110-315 2.44e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.63  E-value: 2.44e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  110 PESWRPPVDSISQRPPRLPRPSWSPARRGvSGAGPEVAQTPPGRSLVPRlqSGPAVPEPPACGVRPGRRGPGTPElrevS 189
Cdd:PHA03307   82 NESRSTPTWSLSTLAPASPAREGSPTPPG-PSSPDPPPPTPPPASPPPS--PAPDLSEMLRPVGSPGPPPAASPP----A 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  190 GGGGPGREEAAPGagtacSRRGSESLCRRWRAPPRLSPASAPGPGLSHPRGALPRRCRcTRVRLAAARRGRGAWRFLPGP 269
Cdd:PHA03307  155 AGASPAAVASDAA-----SSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPP-RRSSPISASASSPAPAPGRSA 228
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 795311138  270 PCPLRVGLVESGHRDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:PHA03307  229 ADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASG 274
dnaA PRK14086
chromosomal replication initiator protein DnaA;
110-239 2.68e-05

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 47.90  E-value: 2.68e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 110 PESWRPPVDsisqrpprLPRPSWSPARRGVSGAGPEvaQTPPGRSLVPRlqsgpavPEPPACGVRPGRRGPGTPELREVS 189
Cdd:PRK14086 190 QERDREPYD--------AGRPEYDQRRRDYDHPRPD--WDRPRRDRTDR-------PEPPPGAGHVHRGGPGPPERDDAP 252
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 795311138 190 ggggpgREEAAPGAGTacsrrgseslcrrwraPPRLSPASAPGPGLSHPR 239
Cdd:PRK14086 253 ------VVPIRPSAPG----------------PLAAQPAPAPGPGEPTAR 280
zf-H2C2_2 pfam13465
Zinc-finger double domain;
742-767 2.69e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 41.59  E-value: 2.69e-05
                          10        20
                  ....*....|....*....|....*.
gi 795311138  742 SLIQHERIHTGEKPYRCNECGKGFTS 767
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
85-271 2.82e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.24  E-value: 2.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   85 RGPLVACGRLTAQPGRRRKRMELHFPESWRPPVDSISQRPPRLPRPSwSPARRGVSGAGPEVAQTPPGRSLVPRlQSGPA 164
Cdd:PHA03307  759 SNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRS-GPAADAASRTASKRKSRSHTPDGGSE-SSGPA 836
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  165 VPEPPACGVRPGRRGPGTPELREVSGGGGPGREEAAPGAGTACSRRGSEslcrrwrAPPRLSPASAPGPGLSHPRGALPR 244
Cdd:PHA03307  837 RPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGA-------AAPPKAAAAAPPAGAPAPRPRPAP 909
                         170       180
                  ....*....|....*....|....*..
gi 795311138  245 RCRCTRvRLAAARRGRGAWRFLPGPPC 271
Cdd:PHA03307  910 RVKLGP-MPPGGPDPRGGFRRVPPGDL 935
zf-H2C2_2 pfam13465
Zinc-finger double domain;
630-655 2.85e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 41.59  E-value: 2.85e-05
                          10        20
                  ....*....|....*....|....*.
gi 795311138  630 SLYKHLRTHTVEKSYRCKECGKSFSR 655
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
110-354 4.21e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.54  E-value: 4.21e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 110 PESWRPPVDSISQRPPRLPRPSWSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPAVPEPPACGVRPGRRGPGTPELrevs 189
Cdd:PRK07003 412 PKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPD---- 487
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 190 ggggPGREEAAPGAGTACSRRGSESLCRRWRAPPRLSPASAPGPGLSHPRGALPRRCRctrvrlaAARRGRGAWRFLPgp 269
Cdd:PRK07003 488 ----AAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAA-------PAARAGGAAAALD-- 554
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 270 pcPLR-VGL-VESGH-RDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSAFLFKAVFWPGSVAHACNPSTlggqggriTRG 346
Cdd:PRK07003 555 --VLRnAGMrVSSDRgARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAE--------SRG 624

                 ....*...
gi 795311138 347 SWVSWNDL 354
Cdd:PRK07003 625 APPPWEDI 632
zf-H2C2_2 pfam13465
Zinc-finger double domain;
882-907 4.39e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 41.20  E-value: 4.39e-05
                          10        20
                  ....*....|....*....|....*.
gi 795311138  882 ALIQHRRIHTGEKPFKCNTCGKTFRQ 907
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
101-257 4.47e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 4.47e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 101 RRKRMELHFPESWRPPVDSISQRPPRLPRPSWSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPAVP---EPPACGVRPGR 177
Cdd:PRK07764 377 RLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPspaGNAPAGGAPSP 456
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 178 RGPGTPELREVSGGGGPGREEAAPGAGTACSRRGSEslcrrwrAPPRLSPASAPGPGLSHP---------RGALPRRCRC 248
Cdd:PRK07764 457 PPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAA-------APAAPAAPAAPAGADDAAtlrerwpeiLAAVPKRSRK 529
                        170
                 ....*....|
gi 795311138 249 T-RVRLAAAR 257
Cdd:PRK07764 530 TwAILLPEAT 539
fvmX7 pfam20005
FtsH ternary system domain X7; Uncharacterized domain found in the FtsH ternary system, a ...
85-315 9.10e-05

FtsH ternary system domain X7; Uncharacterized domain found in the FtsH ternary system, a class of NTP-dependent biological conflict systems.


Pssm-ID: 466254  Cd Length: 422  Bit Score: 46.01  E-value: 9.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   85 RGPLVACGRLTAQPGRRRKRMELHFPESWR--PPVDSISQRPPRlPRPSwsparrgvsgAGPEVAQTPPG---RSLVPRL 159
Cdd:pfam20005  50 RRVGIALVRLGVRFAGAPRPGGPHTLPCWAelLIADPVARLPPA-PPPL----------GHREVLVLTPDallSALVRRL 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  160 ------QSGPAVPEPPACGVRPGRRGPGtPELREVSGGGGPGREEAAPGAG--TACSRRGSESLCRRWRAP-PRLSPASa 230
Cdd:pfam20005 119 lrlgaeVAGLRLRTRPLAGGAPDRGTAG-GTLLTGSGQVPAALLVAVGGPPytVVLRAIDPDSPVRAFLERaPRVWTDV- 196
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  231 pgpGLSHPrgaLPRRCRCTRVRLAAARRGRGAWRFLP-GPPCPLRVGLVESGhrdPEGRADAAGPRFPSPSPGRPPLPSR 309
Cdd:pfam20005 197 ---GLQHP---LADRLTVPDGTLLLLDPPDACWRLEPlGTFTDIAAALLFAG---PAPPRDTPPAATPAPAPARIPVPVR 267

                  ....*..
gi 795311138  310 L-GTPSA 315
Cdd:pfam20005 268 LvRAPRA 274
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
726-774 1.30e-04

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 41.39  E-value: 1.30e-04
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*....
gi 795311138 726 KPFkCSECGRAFSQSASLIQHERIHTgekpYRCNECGKGFTSISRLNRH 774
Cdd:cd20908    1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
85-348 1.34e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.75  E-value: 1.34e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  85 RGPLVACGRLTAQPGRRRKRMELHFPESWRPPVDSISQRPPRlPRPSWSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPA 164
Cdd:PRK07764 383 RRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPA-PAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAA 461
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 165 VPEPPACGVRPGRRGPGTPELREVSGGGGPGREEAAPGAGTACSRRGSESLCRRW-------------RAPPRLSPASAP 231
Cdd:PRK07764 462 PSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWpeilaavpkrsrkTWAILLPEATVL 541
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 232 GPG-----LSHPRGALPRRCRCTR----VRLAAARRGRGAWRFL--PGPPcplRVGLVESGHRDPEGRADAAGPRFP--S 298
Cdd:PRK07764 542 GVRgdtlvLGFSTGGLARRFASPGnaevLVTALAEELGGDWQVEavVGPA---PGAAGGEGPPAPASSGPPEEAARPaaP 618
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|
gi 795311138 299 PSPGRPPLPSRLGTPSAflfkavfwPGSVAHACNPSTLGGQGGRITRGSW 348
Cdd:PRK07764 619 AAPAAPAAPAPAGAAAA--------PAEASAAPAPGVAAPEHHPKHVAVP 660
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
97-247 1.34e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 45.82  E-value: 1.34e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  97 QPGRRRKRMELHFPESWRPPVDSISQRPPRLPRPSWSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPAVPEPPACG--VR 174
Cdd:COG5180  333 QPTERPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNGAPQPGLGRRGAPGPPMGAGdlVQ 412
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 795311138 175 PGRRGPGTPELREVSGGGGPGRE-EAAPGAGTACSRRGSESLCRRWRAPPRLSPA-----SAPGPGLSHPRGALPRRCR 247
Cdd:COG5180  413 AALDGGGRETASLGGAAGGAGQGpKADFVPGDAESVSGPAGLADQAGAAASTAMAdfvapVTDATPVDVADVLGVRPDA 491
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
644-666 1.91e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.21  E-value: 1.91e-04
                          10        20
                  ....*....|....*....|...
gi 795311138  644 YRCKECGKSFSRRSGLFIHQKIH 666
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
574-599 2.07e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 39.28  E-value: 2.07e-04
                          10        20
                  ....*....|....*....|....*.
gi 795311138  574 SLRKHEKNHSGEKLFKCKECSKAFSQ 599
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
501-639 2.12e-04

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 44.71  E-value: 2.12e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 501 IERSHKNTELSQNFSPKSvlirqQILPRDKTPPKCEIQGNSFKQNSHLLNQP----KITADKRYKCSM--CEKTFINTSS 574
Cdd:COG5189  292 IHKSVGNKEIRGGISTGE-----MIDVRKLPCTNSSSNGKLAHGGERNIDTPsrmlKVKDGKPYKCPVegCNKKYKNQNG 366
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 795311138 575 LRKHEKNhsgeklfkcKECSKAFSQSSALIQHQITHTGEKPYICKECGKAFTLSTSLYKHlRTHT 639
Cdd:COG5189  367 LKYHMLH---------GHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYH-RKHS 421
PHA03247 PHA03247
large tegument protein UL36; Provisional
99-255 2.22e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 2.22e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   99 GRRRKRMELHFPE----SWRPP--VDSIS---QRPPRLPRPSW---------SPARRGVSGAGPEVAQTP-PGRSLVPRL 159
Cdd:PHA03247  341 PRPRQHYPLGFPKrrrpTWTPPssLEDLSagrHHPKRASLPTRkrrsarhaaTPFARGPGGDDQTRPAAPvPASVPTPAP 420
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  160 QSGPAvPEPPACGVRPGRRGPGTPELREVSGGGGPGRE--EAAPGAGTACSRRGSESLcrRWRAPPRlspasAPGPGLSH 237
Cdd:PHA03247  421 TPVPA-SAPPPPATPLPSAEPGSDDGPAPPPERQPPAPatEPAPDDPDDATRKALDAL--RERRPPE-----PPGADLAE 492
                         170
                  ....*....|....*...
gi 795311138  238 PRGALPrRCRCTRVRLAA 255
Cdd:PHA03247  493 LLGRHP-DTAGTVVRLAA 509
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
134-315 2.60e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.84  E-value: 2.60e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 134 PARRGVSGAGPEVAQTPPGRSLVPRLQSGPAVPEPPACGVRPGRRGPGTPeLREVSGGGGPGREEAAPGAGTACSRRGSE 213
Cdd:PRK07003 360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAA-LAPKAAAAAAATRAEAPPAAPAPPATADR 438
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 214 SLCRRWRAPPRLSPASAPGPGLSHPRGALPRRCRCTRVRLAAARRGRGAWRFLPGPPCPLRVGLVESGHRDPEgrADAAG 293
Cdd:PRK07003 439 GDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDAR--APAAA 516
                        170       180
                 ....*....|....*....|...
gi 795311138 294 PRFPSPSPGRPPLP-SRLGTPSA 315
Cdd:PRK07003 517 SREDAPAAAAPPAPeARPPTPAA 539
zf-H2C2_2 pfam13465
Zinc-finger double domain;
854-877 2.75e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 2.75e-04
                          10        20
                  ....*....|....*....|....
gi 795311138  854 SLSNHQRIHTGEKPYRCEECGISF 877
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSF 24
zf-H2C2_2 pfam13465
Zinc-finger double domain;
914-935 4.76e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.12  E-value: 4.76e-04
                          10        20
                  ....*....|....*....|..
gi 795311138  914 HQRIHTGEKPYECNTCGKLFNH 935
Cdd:pfam13465   5 HMRTHTGEKPYKCPECGKSFKS 26
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
118-315 5.23e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 5.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  118 DSISQRPPRLPRPSWSPARRGVSGAGPEVAQTPPGRSLVPRLQSGPAVPEPPACGVRPGrrGPGT----PELREVSGGGG 193
Cdd:PHA03307   15 AEGGEFFPRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPP--GPGTeapaNESRSTPTWSL 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  194 PGREEAAPGAGTACSRRGSESLCRRWRAPPRLSPASAPGPGLSHPRgalprrcrctRVRLAAARRGRGAWRFLPGPPCPL 273
Cdd:PHA03307   93 STLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML----------RPVGSPGPPPAASPPAAGASPAAV 162
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 795311138  274 RVGLVESGH-RDPEGRADAAGPRFPSPSPGRPPLPSRLGTPSA 315
Cdd:PHA03307  163 ASDAASSRQaALPLSSPEETARAPSSPPAEPPPSTPPAAASPR 205
PHA03247 PHA03247
large tegument protein UL36; Provisional
98-238 6.94e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 6.94e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   98 PGRRRKrmelhfPESWRPPVDSISQRPP--RLPRPSWSPARRGVSGAGPEV-------AQTPPGRSLVPRLQSGPAVPEP 168
Cdd:PHA03247 2861 DVRRRP------PSRSPAAKPAAPARPPvrRLARPAVSRSTESFALPPDQPerppqpqAPPPPQPQPQPPPPPQPQPPPP 2934
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 795311138  169 PacgvrPGRRGPGTPELREVSGGGGPGREEAAPGAGTACSRRGSESlcrRWR----APPRLSPASAPGPGLSHP 238
Cdd:PHA03247 2935 P-----PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP---RFRvpqpAPSREAPASSTPPLTGHS 3000
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
812-834 8.80e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 37.28  E-value: 8.80e-04
                          10        20
                  ....*....|....*....|...
gi 795311138  812 CKCKVCGKAFRQSSALIQHQRMH 834
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
808-887 8.90e-04

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 42.78  E-value: 8.90e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 808 GEKPCKCKV--CGKAFRQSSALIQHqRMHtgerpykcNECGKTFRCNSSLSNHQRIHTGEKPYRCEECGISFGQSSALIQ 885
Cdd:COG5189  346 DGKPYKCPVegCNKKYKNQNGLKYH-MLH--------GHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416

                 ..
gi 795311138 886 HR 887
Cdd:COG5189  417 HR 418
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
698-775 9.13e-04

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 42.78  E-value: 9.13e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 698 KSYLCN--ECGNTFKSSSSLRYHqRIHtgekpfkcSECGRAFSQSASLIQHERIHTGEKPYRCNECGKGFTSISRLNRHR 775
Cdd:COG5189  348 KPYKCPveGCNKKYKNQNGLKYH-MLH--------GHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418
PHA03377 PHA03377
EBNA-3C; Provisional
79-323 1.06e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 43.12  E-value: 1.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138   79 NLGPAHRGPLVACGRLTAQPGRRRKrmelhFPESWRPPVDSI----SQRPPRLPRPSWSPARRGVSGAGPEVAQ----TP 150
Cdd:PHA03377  396 NMEPVQQRPVMFVSRVPWRKPRTLP-----WPTPKTHPVKRTlvktSGRSDEAEQAQSTPERPGPSDQPSVPVEpahlTP 470
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  151 PGRSLVPrLQSGPAVPE-------PPACGVRPGRRGPGTPELREV---------SGGGGPGREEAAPGAGTACSRRGses 214
Cdd:PHA03377  471 VEHTTVI-LHQPPQSPPtvaikpaPPPSRRRRGACVVYDDDIIEVidvetteeeESVTQPAKPHRKVQDGFQRSGRR--- 546
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  215 lcRRWRAPPRLSPASAPGPGLSHPRGALPRRCRCTRVRLAAARRGRGAWRFLPGPPCPLRVGLVESGHRDPEGRadAAGP 294
Cdd:PHA03377  547 --QKRATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPP--SSAP 622
                         250       260
                  ....*....|....*....|....*....
gi 795311138  295 RFPSPSPGRPPLPSRLGTPSAFLFKAVFW 323
Cdd:PHA03377  623 RDMAPSVVRMFLRERLLEQSTGPKPKSFW 651
KLF14_N cd21576
N-terminal domain of Kruppel-like factor 14; Kruppel-like factor 14 (KLF14; also known as ...
122-315 1.18e-03

N-terminal domain of Kruppel-like factor 14; Kruppel-like factor 14 (KLF14; also known as Krueppel-like factor 14 or basic transcription element-binding protein 5/BTEB5) is a protein that in humans is encoded by the KLF14 gene. KLF14 regulates the transcription of various genes, including TGFbetaRII (the type II receptor for TGFbeta). KLF14 is expressed in many tissues, lacks introns, and is subject to parent-specific expression. It also appears to be a master regulator of gene expression in adipose tissue. KLF14 is associated with coronary artery disease, hypercholesterolemia, and type 2 diabetes. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. KLF14 belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF14.


Pssm-ID: 409238 [Multi-domain]  Cd Length: 195  Bit Score: 41.34  E-value: 1.18e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 122 QRPPRlPRPSWSPARRGVSGAGPEVAQTPPGrslVPRLQSGPAVPEPPACGVRPGRRGP---GTPELREVSGGGGPGREE 198
Cdd:cd21576   26 RRAPD-PEGAGGAAGSEVGAAPPESALPGPG---PPGPAWVPPLLQVPAPSPGAGGAAPhllAASVLADLRGGAGEGSRE 101
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 199 ---AAPGAGTACSrrgseslcrrwRAPPRLSPASAPGPGLSHPRGALPrrcrctrvrlaAARRGRGAWRFLPGPPCPlrv 275
Cdd:cd21576  102 dsgEAPRASSGSS-----------DPARGSSPTLGSEPAPASGEDAVS-----------GPESSFGAPAIPSAPAAP--- 156
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|.
gi 795311138 276 GLVESGHRDPEGrADAAGPR-FPSPSPGRPPLpsrlgTPSA 315
Cdd:cd21576  157 GAPAVSGEVPGG-APGAGPApAAGPAPRRRPV-----TPAA 191
zf-H2C2_2 pfam13465
Zinc-finger double domain;
602-626 1.59e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.58  E-value: 1.59e-03
                          10        20
                  ....*....|....*....|....*
gi 795311138  602 ALIQHQITHTGEKPYICKECGKAFT 626
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFK 25
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
896-918 2.03e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 2.03e-03
                          10        20
                  ....*....|....*....|...
gi 795311138  896 FKCNTCGKTFRQSSSRIAHQRIH 918
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
PHA03321 PHA03321
tegument protein VP11/12; Provisional
147-240 2.23e-03

tegument protein VP11/12; Provisional


Pssm-ID: 223041 [Multi-domain]  Cd Length: 694  Bit Score: 41.87  E-value: 2.23e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 147 AQTPPGRSLVPRLQSGPAVP-----EPPACGVRPGRRGPGTPELREVSGGGGPGR--EEAAPGAGTACSRRGSESLCRRW 219
Cdd:PHA03321 427 SRQPPGAPAPRRDNDPPPPPrarpgSTPACARRARAQRARDAGPEYVDPLGALRRlpAGAAPPPEPAAAPSPATYYTRMG 506
                         90       100
                 ....*....|....*....|.
gi 795311138 220 RAPPRLSPASAPGPGLSHPRG 240
Cdd:PHA03321 507 GGPPRLPPRNRATETLRPDWG 527
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
119-243 2.61e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 2.61e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 119 SISQRPPRLPRpswspaRRGVSGAGPEVAQT-PPGRSLVPRLQSGPAVPEPPAcgvRPGRRGPGTPElrevsGGGGPGRE 197
Cdd:PRK07764 373 GLLARLERLER------RLGVAGGAGAPAAAaPSAAAAAPAAAPAPAAAAPAA---AAAPAPAAAPQ-----PAPAPAPA 438
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 795311138 198 EAAPGAGTACSRRGSESLcrrwRAPPRLSPASAPGPGLSHPRGALP 243
Cdd:PRK07764 439 PAPPSPAGNAPAGGAPSP----PPAAAPSAQPAPAPAAAPEPTAAP 480
zf-H2C2_2 pfam13465
Zinc-finger double domain;
802-823 3.02e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 35.81  E-value: 3.02e-03
                          10        20
                  ....*....|....*....|..
gi 795311138  802 HERIHTGEKPCKCKVCGKAFRQ 823
Cdd:pfam13465   5 HMRTHTGEKPYKCPECGKSFKS 26
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
124-245 3.32e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.24  E-value: 3.32e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 124 PPRLPRPSWSPARRGVSGAGPEVAQTPPGRSLVPrlQSGPAVPEPPACGVRPGRRGPGTPElrevsggggPGrEEAAPGA 203
Cdd:PRK14951 381 PARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAP--AAPPAAAPPAPVAAPAAAAPAAAPA---------AA-PAAVALA 448
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 795311138 204 GTACSRRGSESLCRRWRAPPRLSPASAPGPGLSHPRGALPRR 245
Cdd:PRK14951 449 PAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
728-750 3.55e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.74  E-value: 3.55e-03
                          10        20
                  ....*....|....*....|...
gi 795311138  728 FKCSECGRAFSQSASLIQHERIH 750
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
616-638 4.49e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.35  E-value: 4.49e-03
                          10        20
                  ....*....|....*....|...
gi 795311138  616 YICKECGKAFTLSTSLYKHLRTH 638
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
614-684 5.17e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 36.77  E-value: 5.17e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 795311138 614 KPYiCKECGKAFTLSTSLYKHLRTHTveksYRCKECGKSFSRRSGLFIH-QKIHAEENPCKYN--PGRKASSCS 684
Cdd:cd20908    1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVHKETLTKVPNalPGRDDPEIE 69
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
868-890 5.57e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.35  E-value: 5.57e-03
                          10        20
                  ....*....|....*....|...
gi 795311138  868 YRCEECGISFGQSSALIQHRRIH 890
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
840-862 6.08e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 34.97  E-value: 6.08e-03
                          10        20
                  ....*....|....*....|...
gi 795311138  840 YKCNECGKTFRCNSSLSNHQRIH 862
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
COG3903 COG3903
Predicted ATPase [General function prediction only];
80-286 6.10e-03

Predicted ATPase [General function prediction only];


Pssm-ID: 443109 [Multi-domain]  Cd Length: 933  Bit Score: 40.39  E-value: 6.10e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138  80 LGPAHRGPLVACGRLTAQPGRRRKRMELHFPESWRPPVDSISQR----PPRLPRPSWSPARRGVSGAGPEVAQTPPGRSL 155
Cdd:COG3903    6 AAAAAAAAAALALLALAAAAAAAAAAAALAAALEALRAALALLLlllaALALALAALALLLAAAALLLRLLLLLLAARLL 85
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 795311138 156 VPRLQSGPAVPEPPACGVRPGRRGPGTPELREVSGGGGPGREEAAPGAGTACSRRGSESLCRRWRAPPRLSPASAPGPGL 235
Cdd:COG3903   86 ARLAAAAAAALARAAAAALALLLRLRLAARRLLLARALAAAALAAAAAAAAAAAAAPAPPPPAPPPPAPLAALARRAAAL 165
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 795311138 236 SHPRGALPRRCRC-----------TRVRLAAARRGRGAWrflpgppcPLRVGLVE-SGHRDPE 286
Cdd:COG3903  166 AAAARALLSAARLvtltgpggvgkTRLALEVAHRLADRF--------PDGVWFVDlAGVTDPA 220
PHA00733 PHA00733
hypothetical protein
867-914 6.88e-03

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 37.93  E-value: 6.88e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*...
gi 795311138 867 PYRCEECGISFGQSSALIQHRRIHTGEKpfKCNTCGKTFRQSSSRIAH 914
Cdd:PHA00733  73 PYVCPLCLMPFSSSVSLKQHIRYTEHSK--VCPVCGKEFRNTDSTLDH 118
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
924-946 8.01e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 34.58  E-value: 8.01e-03
                          10        20
                  ....*....|....*....|...
gi 795311138  924 YECNTCGKLFNHRSSLTNHYKIH 946
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
770-792 8.38e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 34.65  E-value: 8.38e-03
                          10        20
                  ....*....|....*....|...
gi 795311138  770 RLNRHRIIHTGEKFYNCNECGKA 792
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKS 23
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH