NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568958319|ref|XP_006531621|]
View 

zinc finger protein 469 isoform X1 [Mus musculus]

Protein Classification

C2H2-type zinc finger protein( domain architecture ID 10442881)

Cys2His2 (C2H2)-type zinc finger protein may be involved in transcriptional regulation

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
3281-3701 1.06e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 1.06e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3281 TLRSVKRPGVPRRKTRVSQDVLPSKQNRLMAPFSPPelstDRIPSTTSPTPSEVSLPALPLAPSlildqpssqenpvdqa 3360
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPP---------------- 2629
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3361 DHSPRGNNLPLSGQDLPPPSLSPFSAASAEGTGGCCKLNRTLEKPEHEASLGSLEPCKWQALVGEKRALHLFPGKHKSPG 3440
Cdd:PHA03247 2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3441 NGDKCAPGCSPGHPSQLQERLV---TTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKALKPPAP--PRKPGG 3515
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQAspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRrlTRPAVA 2789
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3516 MGIPAAELVLSPEDRVKPNTSkgklrgTPQSSGGLQPGTQTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQPPP 3595
Cdd:PHA03247 2790 SLSESRESLPSPWDPADPPAA------VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR 2863
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3596 RahtkgstRGPGDAVHQGVQVHSSPREKReshgrqrkgqalgLGRHGSVGNTGKAPLAPDKSSRAPRKQA----TPSRVP 3671
Cdd:PHA03247 2864 R-------RPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAppppQPQPQP 2923
                         410       420       430
                  ....*....|....*....|....*....|.
gi 568958319 3672 PVKSRPSGQ-SSRARPQPSAQRKGDPGHTSE 3701
Cdd:PHA03247 2924 PPPPQPQPPpPPPPRPQPPLAPTTDPAGAGE 2954
PHA03247 super family cl33720
large tegument protein UL36; Provisional
142-618 2.51e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 2.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  142 PGIPRAKALPSPEENSS---QRCFQEASSSFTSTNCTSPSATPGSLPRRAPQSDGTSPHRHASGTNLQAIGTNPWPPAAE 218
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSvppPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  219 NSFPGANFGVSSAEPKPFPDGSRPSspqgvsapyPFPVETVQHERAAetmlftfhQPLVAWSEEALGTNPAYPSLPCNPG 298
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDD---------PAPGRVSRPRRAR--------RLGRAAQASSPPQRPRRRAARPTVG 2693
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  299 PSGGASAPSDLGGALSPPGAARLLPSPfhdslhksltkgIPEGPLPARDGLGSPRGLPNPPPQRHFPGQGYEANGVGTSP 378
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATP------------LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  379 ASLDTELPTP-----GPPPTHLPQLWDTTAAPPYPTSTLDPAAAARTAFFESQQQLCLPHSPPLPWSPVLTTPGPNSHqm 453
Cdd:PHA03247 2762 TTAGPPAPAPpaapaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP-- 2839
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  454 gvlSQLTFPRGSSEWQGDSPGTLGALNTIPRPGESALRSSPGQPSSSPRLLAYGglKDPGTQPLFFGGAQPQMSPQGALS 533
Cdd:PHA03247 2840 ---PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAP 2914
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  534 LPPPRVVGASPSESPLPSPATNTASSSTCSSLSPPSSSPANPSSEDSQQPGPLRSPAFFLPPTHSQETSSPFPSPEPTYT 613
Cdd:PHA03247 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*
gi 568958319  614 LPTRY 618
Cdd:PHA03247 2995 PLTGH 2999
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1933-2445 1.83e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 1.83e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 1933 QKEPAERSPEKA-ASPQPLFSQEN-----PAPSNrdLAACVFSTRPQATPTPS-------DLEPMPQEDPETRVKPSKPL 1999
Cdd:PHA03247 2481 RRPAEARFPFAAgAAPDPGGGGPPdpdapPAPSR--LAPAILPDEPVGEPVHPrmltwirGLEELASDDAGDPPPPLPPA 2558
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2000 APSSYRDLPSPDDQPTcpvlvPLGASYGLTTKEAEP-----PASPTLLVTSCCGPEEPLSQHSLLGTSSPKDPPVGSLGS 2074
Cdd:PHA03247 2559 APPAAPDRSVPPPRPA-----PRPSEPAVTSRARRPdappqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSP 2633
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2075 ISFSAPVLLERNSPKGIAVRTLEDSGKEELR------LSPAHSSAPPLGdPSSPKMTIEAAPLTSIApkDGLDSGETLEv 2148
Cdd:PHA03247 2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrarrlGRAAQASSPPQR-PRRRAARPTVGSLTSLA--DPPPPPPTPE- 2709
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2149 PAPHCMgAPSLSNPERTYSKGPSLGPVSSTPCPGhgegrgiiAVPTDLATLETtgpdsqicqedgadvsikeqdnPETPG 2228
Cdd:PHA03247 2710 PAPHAL-VSATPLPPGPAAARQASPALPAAPAPP--------AVPAGPATPGG----------------------PARPA 2758
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2229 TRHCNVTKVARANARGMPTGLHLTLETPLSGTSSDSRSDSPQYHISISHRPPQKNFSDPQDHKRRPRGLNKKPEHAEQT- 2307
Cdd:PHA03247 2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTa 2838
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2308 ----PAELPETCQLCSA-----SFRSKAGLSRHKARKHRPQREPRSLLS---------PMPVPACQPSDPMTKACQTPGK 2369
Cdd:PHA03247 2839 ppppPGPPPPSLPLGGSvapggDVRRRPPSRSPAAKPAAPARPPVRRLArpavsrsteSFALPPDQPERPPQPQAPPPPQ 2918
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568958319 2370 KSHKVSEKGRPSRPALGAGRSSGPPPLQDTMGPEILKRTSEKSEGAGTLdTPLSQHPPTLGLSEQGESAEVPASKP 2445
Cdd:PHA03247 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPSREAPASST 2993
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191 7.02e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


:

Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 7.02e-03
                           10        20
                   ....*....|....*....|.
gi 568958319  3171 CHHCGKQFPKPFKLQRHLAVH 3191
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
3281-3701 1.06e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 1.06e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3281 TLRSVKRPGVPRRKTRVSQDVLPSKQNRLMAPFSPPelstDRIPSTTSPTPSEVSLPALPLAPSlildqpssqenpvdqa 3360
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPP---------------- 2629
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3361 DHSPRGNNLPLSGQDLPPPSLSPFSAASAEGTGGCCKLNRTLEKPEHEASLGSLEPCKWQALVGEKRALHLFPGKHKSPG 3440
Cdd:PHA03247 2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3441 NGDKCAPGCSPGHPSQLQERLV---TTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKALKPPAP--PRKPGG 3515
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQAspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRrlTRPAVA 2789
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3516 MGIPAAELVLSPEDRVKPNTSkgklrgTPQSSGGLQPGTQTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQPPP 3595
Cdd:PHA03247 2790 SLSESRESLPSPWDPADPPAA------VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR 2863
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3596 RahtkgstRGPGDAVHQGVQVHSSPREKReshgrqrkgqalgLGRHGSVGNTGKAPLAPDKSSRAPRKQA----TPSRVP 3671
Cdd:PHA03247 2864 R-------RPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAppppQPQPQP 2923
                         410       420       430
                  ....*....|....*....|....*....|.
gi 568958319 3672 PVKSRPSGQ-SSRARPQPSAQRKGDPGHTSE 3701
Cdd:PHA03247 2924 PPPPQPQPPpPPPPRPQPPLAPTTDPAGAGE 2954
PHA03247 PHA03247
large tegument protein UL36; Provisional
142-618 2.51e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 2.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  142 PGIPRAKALPSPEENSS---QRCFQEASSSFTSTNCTSPSATPGSLPRRAPQSDGTSPHRHASGTNLQAIGTNPWPPAAE 218
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSvppPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  219 NSFPGANFGVSSAEPKPFPDGSRPSspqgvsapyPFPVETVQHERAAetmlftfhQPLVAWSEEALGTNPAYPSLPCNPG 298
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDD---------PAPGRVSRPRRAR--------RLGRAAQASSPPQRPRRRAARPTVG 2693
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  299 PSGGASAPSDLGGALSPPGAARLLPSPfhdslhksltkgIPEGPLPARDGLGSPRGLPNPPPQRHFPGQGYEANGVGTSP 378
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATP------------LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  379 ASLDTELPTP-----GPPPTHLPQLWDTTAAPPYPTSTLDPAAAARTAFFESQQQLCLPHSPPLPWSPVLTTPGPNSHqm 453
Cdd:PHA03247 2762 TTAGPPAPAPpaapaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP-- 2839
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  454 gvlSQLTFPRGSSEWQGDSPGTLGALNTIPRPGESALRSSPGQPSSSPRLLAYGglKDPGTQPLFFGGAQPQMSPQGALS 533
Cdd:PHA03247 2840 ---PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAP 2914
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  534 LPPPRVVGASPSESPLPSPATNTASSSTCSSLSPPSSSPANPSSEDSQQPGPLRSPAFFLPPTHSQETSSPFPSPEPTYT 613
Cdd:PHA03247 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*
gi 568958319  614 LPTRY 618
Cdd:PHA03247 2995 PLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
1933-2445 1.83e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 1.83e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 1933 QKEPAERSPEKA-ASPQPLFSQEN-----PAPSNrdLAACVFSTRPQATPTPS-------DLEPMPQEDPETRVKPSKPL 1999
Cdd:PHA03247 2481 RRPAEARFPFAAgAAPDPGGGGPPdpdapPAPSR--LAPAILPDEPVGEPVHPrmltwirGLEELASDDAGDPPPPLPPA 2558
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2000 APSSYRDLPSPDDQPTcpvlvPLGASYGLTTKEAEP-----PASPTLLVTSCCGPEEPLSQHSLLGTSSPKDPPVGSLGS 2074
Cdd:PHA03247 2559 APPAAPDRSVPPPRPA-----PRPSEPAVTSRARRPdappqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSP 2633
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2075 ISFSAPVLLERNSPKGIAVRTLEDSGKEELR------LSPAHSSAPPLGdPSSPKMTIEAAPLTSIApkDGLDSGETLEv 2148
Cdd:PHA03247 2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrarrlGRAAQASSPPQR-PRRRAARPTVGSLTSLA--DPPPPPPTPE- 2709
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2149 PAPHCMgAPSLSNPERTYSKGPSLGPVSSTPCPGhgegrgiiAVPTDLATLETtgpdsqicqedgadvsikeqdnPETPG 2228
Cdd:PHA03247 2710 PAPHAL-VSATPLPPGPAAARQASPALPAAPAPP--------AVPAGPATPGG----------------------PARPA 2758
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2229 TRHCNVTKVARANARGMPTGLHLTLETPLSGTSSDSRSDSPQYHISISHRPPQKNFSDPQDHKRRPRGLNKKPEHAEQT- 2307
Cdd:PHA03247 2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTa 2838
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2308 ----PAELPETCQLCSA-----SFRSKAGLSRHKARKHRPQREPRSLLS---------PMPVPACQPSDPMTKACQTPGK 2369
Cdd:PHA03247 2839 ppppPGPPPPSLPLGGSvapggDVRRRPPSRSPAAKPAAPARPPVRRLArpavsrsteSFALPPDQPERPPQPQAPPPPQ 2918
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568958319 2370 KSHKVSEKGRPSRPALGAGRSSGPPPLQDTMGPEILKRTSEKSEGAGTLdTPLSQHPPTLGLSEQGESAEVPASKP 2445
Cdd:PHA03247 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPSREAPASST 2993
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191 7.02e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 7.02e-03
                           10        20
                   ....*....|....*....|.
gi 568958319  3171 CHHCGKQFPKPFKLQRHLAVH 3191
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
3281-3701 1.06e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 1.06e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3281 TLRSVKRPGVPRRKTRVSQDVLPSKQNRLMAPFSPPelstDRIPSTTSPTPSEVSLPALPLAPSlildqpssqenpvdqa 3360
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPP---------------- 2629
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3361 DHSPRGNNLPLSGQDLPPPSLSPFSAASAEGTGGCCKLNRTLEKPEHEASLGSLEPCKWQALVGEKRALHLFPGKHKSPG 3440
Cdd:PHA03247 2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3441 NGDKCAPGCSPGHPSQLQERLV---TTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKALKPPAP--PRKPGG 3515
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQAspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRrlTRPAVA 2789
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3516 MGIPAAELVLSPEDRVKPNTSkgklrgTPQSSGGLQPGTQTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQPPP 3595
Cdd:PHA03247 2790 SLSESRESLPSPWDPADPPAA------VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR 2863
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3596 RahtkgstRGPGDAVHQGVQVHSSPREKReshgrqrkgqalgLGRHGSVGNTGKAPLAPDKSSRAPRKQA----TPSRVP 3671
Cdd:PHA03247 2864 R-------RPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAppppQPQPQP 2923
                         410       420       430
                  ....*....|....*....|....*....|.
gi 568958319 3672 PVKSRPSGQ-SSRARPQPSAQRKGDPGHTSE 3701
Cdd:PHA03247 2924 PPPPQPQPPpPPPPRPQPPLAPTTDPAGAGE 2954
PHA03247 PHA03247
large tegument protein UL36; Provisional
142-618 2.51e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 2.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  142 PGIPRAKALPSPEENSS---QRCFQEASSSFTSTNCTSPSATPGSLPRRAPQSDGTSPHRHASGTNLQAIGTNPWPPAAE 218
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDRSvppPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  219 NSFPGANFGVSSAEPKPFPDGSRPSspqgvsapyPFPVETVQHERAAetmlftfhQPLVAWSEEALGTNPAYPSLPCNPG 298
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDD---------PAPGRVSRPRRAR--------RLGRAAQASSPPQRPRRRAARPTVG 2693
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  299 PSGGASAPSDLGGALSPPGAARLLPSPfhdslhksltkgIPEGPLPARDGLGSPRGLPNPPPQRHFPGQGYEANGVGTSP 378
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATP------------LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  379 ASLDTELPTP-----GPPPTHLPQLWDTTAAPPYPTSTLDPAAAARTAFFESQQQLCLPHSPPLPWSPVLTTPGPNSHqm 453
Cdd:PHA03247 2762 TTAGPPAPAPpaapaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP-- 2839
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  454 gvlSQLTFPRGSSEWQGDSPGTLGALNTIPRPGESALRSSPGQPSSSPRLLAYGglKDPGTQPLFFGGAQPQMSPQGALS 533
Cdd:PHA03247 2840 ---PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAP 2914
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  534 LPPPRVVGASPSESPLPSPATNTASSSTCSSLSPPSSSPANPSSEDSQQPGPLRSPAFFLPPTHSQETSSPFPSPEPTYT 613
Cdd:PHA03247 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*
gi 568958319  614 LPTRY 618
Cdd:PHA03247 2995 PLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
1933-2445 1.83e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 1.83e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 1933 QKEPAERSPEKA-ASPQPLFSQEN-----PAPSNrdLAACVFSTRPQATPTPS-------DLEPMPQEDPETRVKPSKPL 1999
Cdd:PHA03247 2481 RRPAEARFPFAAgAAPDPGGGGPPdpdapPAPSR--LAPAILPDEPVGEPVHPrmltwirGLEELASDDAGDPPPPLPPA 2558
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2000 APSSYRDLPSPDDQPTcpvlvPLGASYGLTTKEAEP-----PASPTLLVTSCCGPEEPLSQHSLLGTSSPKDPPVGSLGS 2074
Cdd:PHA03247 2559 APPAAPDRSVPPPRPA-----PRPSEPAVTSRARRPdappqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSP 2633
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2075 ISFSAPVLLERNSPKGIAVRTLEDSGKEELR------LSPAHSSAPPLGdPSSPKMTIEAAPLTSIApkDGLDSGETLEv 2148
Cdd:PHA03247 2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrarrlGRAAQASSPPQR-PRRRAARPTVGSLTSLA--DPPPPPPTPE- 2709
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2149 PAPHCMgAPSLSNPERTYSKGPSLGPVSSTPCPGhgegrgiiAVPTDLATLETtgpdsqicqedgadvsikeqdnPETPG 2228
Cdd:PHA03247 2710 PAPHAL-VSATPLPPGPAAARQASPALPAAPAPP--------AVPAGPATPGG----------------------PARPA 2758
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2229 TRHCNVTKVARANARGMPTGLHLTLETPLSGTSSDSRSDSPQYHISISHRPPQKNFSDPQDHKRRPRGLNKKPEHAEQT- 2307
Cdd:PHA03247 2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTa 2838
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 2308 ----PAELPETCQLCSA-----SFRSKAGLSRHKARKHRPQREPRSLLS---------PMPVPACQPSDPMTKACQTPGK 2369
Cdd:PHA03247 2839 ppppPGPPPPSLPLGGSvapggDVRRRPPSRSPAAKPAAPARPPVRRLArpavsrsteSFALPPDQPERPPQPQAPPPPQ 2918
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568958319 2370 KSHKVSEKGRPSRPALGAGRSSGPPPLQDTMGPEILKRTSEKSEGAGTLdTPLSQHPPTLGLSEQGESAEVPASKP 2445
Cdd:PHA03247 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPSREAPASST 2993
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-451 3.52e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 3.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319    5 RPPTLPRDLQPCQIARSLGCPSQHPLKDHGSASRTTQGMRDDGSKAQGSPEAQLSQAKDVEQEDLILRVQAPAaRSYAHV 84
Cdd:PHA03247 2599 RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG-RAAQAS 2677
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319   85 YPWPASRMESGHPQLHSLSPsrirciLGEPLKDLRHEAPQ----VSDTKVPQGQKTRARHRPGIPRAKALPSPEENSSQR 160
Cdd:PHA03247 2678 SPPQRPRRRAARPTVGSLTS------LADPPPPPPTPEPAphalVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  161 CfQEASSSFTSTNCTSPSATPGSLPRRAPQSDGTSPHRhASGTNLQAIGTNPWPPAAensfpganfgVSSAEPKPFPDGS 240
Cdd:PHA03247 2752 G-GPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPAD----------PPAAVLAPAAALP 2819
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  241 RPSSPQGVSAPYPFPVetvqheraaetmlftfhqplvawseealgtnPAYPSLPCNPGPSggasaPSDLGGALSPPGAAR 320
Cdd:PHA03247 2820 PAASPAGPLPPPTSAQ-------------------------------PTAPPPPPGPPPP-----SLPLGGSVAPGGDVR 2863
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  321 LLPSPFHDSLHKSLTKGIPEGPLPARDGLGSPRGLPNPPPQRHFPGQGYEANGVGTSPASLDTELPTPGPPPTHLPQlwd 400
Cdd:PHA03247 2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ--- 2940
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568958319  401 ttaAPPYPTSTLDPAAAARTAFFESQQQLCLPHSPPLPW------SPVLTTPGPNSH 451
Cdd:PHA03247 2941 ---PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRfrvpqpAPSREAPASSTP 2994
PHA03247 PHA03247
large tegument protein UL36; Provisional
280-653 5.92e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 5.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  280 SEEALGTNPAYPSLPCNPGPSGGASAPSDLGGA----LSPPGAARLLPSPFHD-----SLH-------------KSLTKG 337
Cdd:PHA03247 2470 LGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGppdpDAPPAPSRLAPAILPDepvgePVHprmltwirgleelASDDAG 2549
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  338 IPEGPLPARDGLGSP-RGLPNPPPQRHFPGQGYEAN----GVGTSPASLDTELPTPGPPPTHLPqlwdTTAAPPYPTSTL 412
Cdd:PHA03247 2550 DPPPPLPPAAPPAAPdRSVPPPRPAPRPSEPAVTSRarrpDAPPQSARPRAPVDDRGDPRGPAP----PSPLPPDTHAPD 2625
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  413 DPAAAARTAFFESQQQLCLPhSPPLPWSPVLTTPGPNSHQMGVLSQLTFPRGSSEWQG----DSPGTLGALNTIPRPGES 488
Cdd:PHA03247 2626 PPPPSPSPAANEPDPHPPPT-VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRprrrAARPTVGSLTSLADPPPP 2704
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  489 ALRSSPGQPSSSPRL-LAYGGLKDPGTQPLFFGGAQPQMSPQGALSLPPPRVVGASPSESPLPSPATNTASSSTCSSLSP 567
Cdd:PHA03247 2705 PPTPEPAPHALVSATpLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT 2784
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  568 PSSSPANPSSEDS--QQPGPLRSPAFFLPPTHSQETSSPFPSPEPTYTLPTRYQSETAKAFPLPTEGPGAEDAfksqEGA 645
Cdd:PHA03247 2785 RPAVASLSESRESlpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA----PGG 2860

                  ....*...
gi 568958319  646 PFSHKSPS 653
Cdd:PHA03247 2861 DVRRRPPS 2868
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3375-3733 8.33e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 8.33e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3375 DLPPPSLSPFSAASAEGTGGCCKLNRTleKPEHEASLGSLEPCKWQALVGEKRALHlfpgkhkSPGNGDKCAPGCSPGHP 3454
Cdd:PHA03307   69 TGPPPGPGTEAPANESRSTPTWSLSTL--APASPAREGSPTPPGPSSPDPPPPTPP-------PASPPPSPAPDLSEMLR 139
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3455 SQLQERLVTTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKALKPPAPPRKPGG--------MGIPAAELVLS 3526
Cdd:PHA03307  140 PVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAaasprpprRSSPISASASS 219
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3527 PEDRVKPNTSKGKLRGTPQSSGGLQPGTQTGG---------GSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQ---PP 3594
Cdd:PHA03307  220 PAPAPGRSAADDAGASSSDSSSSESSGCGWGPenecplprpAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERspsPS 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3595 PRAHTKGSTRGPGDAVHQGVQVHSSPREKRESHGRQRKGQALGLGRHGSVGNTGKAPlAPDKSSRAPRKQATPSRVPPVK 3674
Cdd:PHA03307  300 PSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP-PPPADPSSPRKRPRPSRAPSSP 378
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568958319 3675 SRPSGQSSRARPQPSAQRKGDPGHTSekGSLPQARALSRPYKrvrALHVSGVAPMEPRD 3733
Cdd:PHA03307  379 AASAGRPTRRRARAAVAGRARRRDAT--GRFPAGRPRPSPLD---AGAASGAFYARYPL 432
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
3324-3667 1.72e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 1.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3324 PSTTSPTPSEVSLPALPLAPSLILDQPSSQENPVDQADHSPRGNNLPlSGQDLPPPSLSPfSAASAEGTGGCCKLNRTLE 3403
Cdd:PRK07764  436 APAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAP-APPAAPAPAAAP-AAPAAPAAPAGADDAATLR 513
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3404 K--PEHEASLGSLEPCKWQALVGEKRALHLFPG----KHKSPGNGDKCApgcSPGHPSQLQERL--VTTHHMAPEGRIEG 3475
Cdd:PRK07764  514 ErwPEILAAVPKRSRKTWAILLPEATVLGVRGDtlvlGFSTGGLARRFA---SPGNAEVLVTALaeELGGDWQVEAVVGP 590
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3476 PSQKGNATKPGA-YSSTSHHRAAEPTKKALKPPAPPRKPGGMGIPAAELVLSPEDRVKPNTSKGKLRGTPQSSGGLQPGT 3554
Cdd:PRK07764  591 APGAAGGEGPPApASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3555 QTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQPPPRAHTKGSTRGPGDAVHQGVQVHSSPREKR---ESHGRQR 3631
Cdd:PRK07764  671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPlppEPDDPPD 750
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 568958319 3632 KGQALGLGRHGSVGNTGKAPLAPDKSSRAPRKQATP 3667
Cdd:PRK07764  751 PAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191 7.02e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 7.02e-03
                           10        20
                   ....*....|....*....|.
gi 568958319  3171 CHHCGKQFPKPFKLQRHLAVH 3191
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
dnaA PRK14086
chromosomal replication initiator protein DnaA;
233-414 7.10e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.12  E-value: 7.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  233 PKPFPDGSRPSSPQGvSAPYPFPVETVQHERAaetmlfTFHQPLVAWSEEALGTNPAYPSLPCNPGPSGGASAPSDLGGA 312
Cdd:PRK14086   96 APPPPHARRTSEPEL-PRPGRRPYEGYGGPRA------DDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQ 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319  313 LSPPG--AARLLPSPFHDSLHKSLTK-----GIPEGPLPARDGLGS------PRGLPNPPPQRHfPGQGYEANGVGTSPA 379
Cdd:PRK14086  169 QQRLGfpPRAPYASPASYAPEQERDRepydaGRPEYDQRRRDYDHPrpdwdrPRRDRTDRPEPP-PGAGHVHRGGPGPPE 247
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 568958319  380 SLDTELPTPGPP-PTHLPQLWDTTAAPPYPTSTLDP 414
Cdd:PRK14086  248 RDDAPVVPIRPSaPGPLAAQPAPAPGPGEPTARLNP 283
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3434-3758 9.71e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 9.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3434 GKHKSPGNGDKCAPGCSPGHPSQLQERLVTTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKAlkPPAPPRKP 3513
Cdd:PHA03307   58 GAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP--PSPAPDLS 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3514 GGMGIPAAELVLSPEDRVKPNTSKGKLRGTPQSSGGLQPGTQTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQP 3593
Cdd:PHA03307  136 EMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISA 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3594 PPRAHTKGSTRGPGDAVHQGVQVHSSPREK-----------RESHGRQRKGQALGLGRHGSVGNTGKAPLAPDKSSRAPR 3662
Cdd:PHA03307  216 SASSPAPAPGRSAADDAGASSSDSSSSESSgcgwgpenecpLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERS 295
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568958319 3663 KQATPSR--VPPVKSRPSGQSSRARPQPS---------------AQRKGDPGHTSEKGSLPQARALSRPYKRVRALHVSG 3725
Cdd:PHA03307  296 PSPSPSSpgSGPAPSSPRASSSSSSSRESsssstssssessrgaAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAP 375
                         330       340       350
                  ....*....|....*....|....*....|...
gi 568958319 3726 VAPMEPRDRRTAEAQSDLLSQLFGQKLTSFRIP 3758
Cdd:PHA03307  376 SSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH