View
Concise Results
Standard Results
Full Results
zinc finger protein 469 [Mus musculus]
Protein Classification
C2H2-type zinc finger protein ( domain architecture ID 10442881 )
Cys2His2 (C2H2)-type zinc finger protein may be involved in transcriptional regulation
List of domain hits
Name
Accession
Description
Interval
E-value
PHA03247 super family
cl33720
large tegument protein UL36; Provisional
3281-3701
1.06e-06
large tegument protein UL36; Provisional
The actual alignment was detected with superfamily member PHA03247 :Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 55.33
E-value: 1.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3281 TL R SVK RP GV P RRKT R VSQDVL P SKQN R LM AP FSPP elst DRIPSTTS P T P SEVSLP A LPLA P S lildqpssqenpvdqa 3360
Cdd:PHA03247 2570 PP R PAP RP SE P AVTS R ARRPDA P PQSA R PR AP VDDR ---- GDPRGPAP P S P LPPDTH A PDPP P P ---------------- 2629
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3361 DH SP RG N NLPLSGQDLP PP SLS P FSAASAEGTGGCCKLN R TLEKPEHEASLGSLEPCKWQAL VG EKRA L HLF P GKHKS P G 3440
Cdd:PHA03247 2630 SP SP AA N EPDPHPPPTV PP PER P RDDPAPGRVSRPRRAR R LGRAAQASSPPQRPRRRAARPT VG SLTS L ADP P PPPPT P E 2709
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3441 NGDKCAPGCS P GH P SQLQE R LV --- TTHHM AP EGRIE GP SQK G NATK P GAYSS T SHHR A AE P TKKALKP P AP -- P R KPGG 3515
Cdd:PHA03247 2710 PAPHALVSAT P LP P GPAAA R QA spa LPAAP AP PAVPA GP ATP G GPAR P ARPPT T AGPP A PA P PAAPAAG P PR rl T R PAVA 2789
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3516 MGIPAA E LVL SP E D RVK P NTS kgklrg TPQSSGG L Q P GTQTG G GSQ P QPTSGQLQSEMAST P TE PS C P SWA S ST P DQPPP 3595
Cdd:PHA03247 2790 SLSESR E SLP SP W D PAD P PAA ------ VLAPAAA L P P AASPA G PLP P PTSAQPTAPPPPPG P PP PS L P LGG S VA P GGDVR 2863
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3596 R ahtkgst R G P GDAVHQGVQVHSS P REK R eshgrqrkgqalg L G R HGSVGN T GKAP L A PD KSS R A P RK QA ---- T P SRV P 3671
Cdd:PHA03247 2864 R ------- R P P SRSPAAKPAAPAR P PVR R ------------- L A R PAVSRS T ESFA L P PD QPE R P P QP QA pppp Q P QPQ P 2923
410 420 430
....*....|....*....|....*....|.
gi 1385123368 3672 P VKSR P SGQ - SSRA RPQP SAQRKG DP GHTS E 3701
Cdd:PHA03247 2924 P PPPQ P QPP p PPPP RPQP PLAPTT DP AGAG E 2954
PHA03247 super family
cl33720
large tegument protein UL36; Provisional
142-618
2.51e-05
large tegument protein UL36; Provisional
The actual alignment was detected with superfamily member PHA03247 :Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 50.71
E-value: 2.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 142 P GI P RAK A L P SPEENS S --- Q R CFQEA S SSFTSTNCTS P S A T P G S LPR RAP QS D GTS P HRH A SGTN L QAIGTN P W PP AAE 218
Cdd:PHA03247 2551 P PP P LPP A A P PAAPDR S vpp P R PAPRP S EPAVTSRARR P D A P P Q S ARP RAP VD D RGD P RGP A PPSP L PPDTHA P D PP PPS 2630
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 219 N S FPGANFGVSSAEPK P F P DGS R PS spqgvsapy P F P VETVQHE RA A etmlftfh QPLV A WSEEALGTN P AYPSLPCNP G 298
Cdd:PHA03247 2631 P S PAANEPDPHPPPTV P P P ERP R DD --------- P A P GRVSRPR RA R -------- RLGR A AQASSPPQR P RRRAARPTV G 2693
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 299 PSGGASA P SDLGGALS P PGA A RLLPS P fhdslhksltkg I P E GP LP AR DGLGSPRGL P N PP PQRHF P GQGYEANGVGTS P 378
Cdd:PHA03247 2694 SLTSLAD P PPPPPTPE P APH A LVSAT P ------------ L P P GP AA AR QASPALPAA P A PP AVPAG P ATPGGPARPARP P 2761
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 379 ASLDTEL P T P ----- GP PP TH L PQLWDTTAAPPYPTSTLDPAA A ART A FFESQQQLCL P HSP P LPWS P VL T TPG P NSH qm 453
Cdd:PHA03247 2762 TTAGPPA P A P paapa AG PP RR L TRPAVASLSESRESLPSPWDP A DPP A AVLAPAAALP P AAS P AGPL P PP T SAQ P TAP -- 2839
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 454 gvl SQLTF P RGS S EWQ G D S PGTL G ALNTI P RPGES A LRSSPGQPSSSP RL LAYG gl KDPG T QPLFFGGA QP QMS PQ GALS 533
Cdd:PHA03247 2840 --- PPPPG P PPP S LPL G G S VAPG G DVRRR P PSRSP A AKPAAPARPPVR RL ARPA -- VSRS T ESFALPPD QP ERP PQ PQAP 2914
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 534 L PP PRVVGAS P SES P L P S P ATNTASSSTCSSLSP P SSSPANPSSEDSQQP G P L RSPAFFL P PTHSQETSSPFPS P EPTYT 613
Cdd:PHA03247 2915 P PP QPQPQPP P PPQ P Q P P P PPPPRPQPPLAPTTD P AGAGEPSGAVPQPWL G A L VPGRVAV P RFRVPQPAPSREA P ASSTP 2994
....*
gi 1385123368 614 LP T RY 618
Cdd:PHA03247 2995 PL T GH 2999
PHA03247 super family
cl33720
large tegument protein UL36; Provisional
1933-2445
1.83e-04
large tegument protein UL36; Provisional
The actual alignment was detected with superfamily member PHA03247 :Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 48.01
E-value: 1.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 1933 QKEPAE R S P EK A - A S P Q P LFSQEN ----- PAPS N rd LA ACVFSTR P QAT P TPS ------- D LE PMPQE D PETRVK P SK P L 1999
Cdd:PHA03247 2481 RRPAEA R F P FA A g A A P D P GGGGPP dpdap PAPS R -- LA PAILPDE P VGE P VHP rmltwir G LE ELASD D AGDPPP P LP P A 2558
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2000 AP SSYR D LPS P DDQ P T cpvlv P LGASYGL T TKEAE P ----- P A S P TLL V TSCCG P EE P LSQHS L LGTSSPK DPP VG S LGS 2074
Cdd:PHA03247 2559 AP PAAP D RSV P PPR P A ----- P RPSEPAV T SRARR P dappq S A R P RAP V DDRGD P RG P APPSP L PPDTHAP DPP PP S PSP 2633
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2075 ISFSAPVLLERNS P KGIAV R TLEDS G KEELR ------ LSP A HS S A PP LG d P SSPKMTIEAAP LTS I A pk D GLDSGE T L E v 2148
Cdd:PHA03247 2634 AANEPDPHPPPTV P PPERP R DDPAP G RVSRP rrarrl GRA A QA S S PP QR - P RRRAARPTVGS LTS L A -- D PPPPPP T P E - 2709
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2149 PAPH CM g APSLSN P ERTYSKGPSLGPVSST P C P G hgegrgii AVP TDL AT LET tgpdsqicqedgadvsikeqdn P ET P G 2228
Cdd:PHA03247 2710 PAPH AL - VSATPL P PGPAAARQASPALPAA P A P P -------- AVP AGP AT PGG ---------------------- P AR P A 2758
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2229 TRHCNVTKV A R A NARGMPT G LHLT L ET P LSGTS S D SR SDS P QYHISISHRPPQKNFSDPQDHKRR P R G LNKK P EH A EQ T - 2307
Cdd:PHA03247 2759 RPPTTAGPP A P A PPAAPAA G PPRR L TR P AVASL S E SR ESL P SPWDPADPPAAVLAPAAALPPAAS P A G PLPP P TS A QP T a 2838
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2308 ---- P AEL P ETCQ L CSA ----- SF R SKAGLSRHK A RKHR P Q R E P RSL L S --------- PMPV P AC QP SD P MTKACQT P GK 2369
Cdd:PHA03247 2839 pppp P GPP P PSLP L GGS vapgg DV R RRPPSRSPA A KPAA P A R P P VRR L A rpavsrste SFAL P PD QP ER P PQPQAPP P PQ 2918
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1385123368 2370 KSHKVSEKGR P SR P ALGAG R SSG P PPLQDTMGPEILKRTSEKSEGA G T L d T P LSQHP P TLGLSEQGE S A E V PAS KP 2445
Cdd:PHA03247 2919 PQPQPPPPPQ P QP P PPPPP R PQP P LAPTTDPAGAGEPSGAVPQPWL G A L - V P GRVAV P RFRVPQPAP S R E A PAS ST 2993
zf-C2H2
pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191
7.02e-03
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
:Pssm-ID: 395048 [Multi-domain]
Cd Length: 23
Bit Score: 36.51
E-value: 7.02e-03
Name
Accession
Description
Interval
E-value
PHA03247
PHA03247
large tegument protein UL36; Provisional
3281-3701
1.06e-06
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 55.33
E-value: 1.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3281 TL R SVK RP GV P RRKT R VSQDVL P SKQN R LM AP FSPP elst DRIPSTTS P T P SEVSLP A LPLA P S lildqpssqenpvdqa 3360
Cdd:PHA03247 2570 PP R PAP RP SE P AVTS R ARRPDA P PQSA R PR AP VDDR ---- GDPRGPAP P S P LPPDTH A PDPP P P ---------------- 2629
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3361 DH SP RG N NLPLSGQDLP PP SLS P FSAASAEGTGGCCKLN R TLEKPEHEASLGSLEPCKWQAL VG EKRA L HLF P GKHKS P G 3440
Cdd:PHA03247 2630 SP SP AA N EPDPHPPPTV PP PER P RDDPAPGRVSRPRRAR R LGRAAQASSPPQRPRRRAARPT VG SLTS L ADP P PPPPT P E 2709
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3441 NGDKCAPGCS P GH P SQLQE R LV --- TTHHM AP EGRIE GP SQK G NATK P GAYSS T SHHR A AE P TKKALKP P AP -- P R KPGG 3515
Cdd:PHA03247 2710 PAPHALVSAT P LP P GPAAA R QA spa LPAAP AP PAVPA GP ATP G GPAR P ARPPT T AGPP A PA P PAAPAAG P PR rl T R PAVA 2789
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3516 MGIPAA E LVL SP E D RVK P NTS kgklrg TPQSSGG L Q P GTQTG G GSQ P QPTSGQLQSEMAST P TE PS C P SWA S ST P DQPPP 3595
Cdd:PHA03247 2790 SLSESR E SLP SP W D PAD P PAA ------ VLAPAAA L P P AASPA G PLP P PTSAQPTAPPPPPG P PP PS L P LGG S VA P GGDVR 2863
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3596 R ahtkgst R G P GDAVHQGVQVHSS P REK R eshgrqrkgqalg L G R HGSVGN T GKAP L A PD KSS R A P RK QA ---- T P SRV P 3671
Cdd:PHA03247 2864 R ------- R P P SRSPAAKPAAPAR P PVR R ------------- L A R PAVSRS T ESFA L P PD QPE R P P QP QA pppp Q P QPQ P 2923
410 420 430
....*....|....*....|....*....|.
gi 1385123368 3672 P VKSR P SGQ - SSRA RPQP SAQRKG DP GHTS E 3701
Cdd:PHA03247 2924 P PPPQ P QPP p PPPP RPQP PLAPTT DP AGAG E 2954
PHA03247
PHA03247
large tegument protein UL36; Provisional
142-618
2.51e-05
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 50.71
E-value: 2.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 142 P GI P RAK A L P SPEENS S --- Q R CFQEA S SSFTSTNCTS P S A T P G S LPR RAP QS D GTS P HRH A SGTN L QAIGTN P W PP AAE 218
Cdd:PHA03247 2551 P PP P LPP A A P PAAPDR S vpp P R PAPRP S EPAVTSRARR P D A P P Q S ARP RAP VD D RGD P RGP A PPSP L PPDTHA P D PP PPS 2630
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 219 N S FPGANFGVSSAEPK P F P DGS R PS spqgvsapy P F P VETVQHE RA A etmlftfh QPLV A WSEEALGTN P AYPSLPCNP G 298
Cdd:PHA03247 2631 P S PAANEPDPHPPPTV P P P ERP R DD --------- P A P GRVSRPR RA R -------- RLGR A AQASSPPQR P RRRAARPTV G 2693
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 299 PSGGASA P SDLGGALS P PGA A RLLPS P fhdslhksltkg I P E GP LP AR DGLGSPRGL P N PP PQRHF P GQGYEANGVGTS P 378
Cdd:PHA03247 2694 SLTSLAD P PPPPPTPE P APH A LVSAT P ------------ L P P GP AA AR QASPALPAA P A PP AVPAG P ATPGGPARPARP P 2761
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 379 ASLDTEL P T P ----- GP PP TH L PQLWDTTAAPPYPTSTLDPAA A ART A FFESQQQLCL P HSP P LPWS P VL T TPG P NSH qm 453
Cdd:PHA03247 2762 TTAGPPA P A P paapa AG PP RR L TRPAVASLSESRESLPSPWDP A DPP A AVLAPAAALP P AAS P AGPL P PP T SAQ P TAP -- 2839
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 454 gvl SQLTF P RGS S EWQ G D S PGTL G ALNTI P RPGES A LRSSPGQPSSSP RL LAYG gl KDPG T QPLFFGGA QP QMS PQ GALS 533
Cdd:PHA03247 2840 --- PPPPG P PPP S LPL G G S VAPG G DVRRR P PSRSP A AKPAAPARPPVR RL ARPA -- VSRS T ESFALPPD QP ERP PQ PQAP 2914
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 534 L PP PRVVGAS P SES P L P S P ATNTASSSTCSSLSP P SSSPANPSSEDSQQP G P L RSPAFFL P PTHSQETSSPFPS P EPTYT 613
Cdd:PHA03247 2915 P PP QPQPQPP P PPQ P Q P P P PPPPRPQPPLAPTTD P AGAGEPSGAVPQPWL G A L VPGRVAV P RFRVPQPAPSREA P ASSTP 2994
....*
gi 1385123368 614 LP T RY 618
Cdd:PHA03247 2995 PL T GH 2999
PHA03247
PHA03247
large tegument protein UL36; Provisional
1933-2445
1.83e-04
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 48.01
E-value: 1.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 1933 QKEPAE R S P EK A - A S P Q P LFSQEN ----- PAPS N rd LA ACVFSTR P QAT P TPS ------- D LE PMPQE D PETRVK P SK P L 1999
Cdd:PHA03247 2481 RRPAEA R F P FA A g A A P D P GGGGPP dpdap PAPS R -- LA PAILPDE P VGE P VHP rmltwir G LE ELASD D AGDPPP P LP P A 2558
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2000 AP SSYR D LPS P DDQ P T cpvlv P LGASYGL T TKEAE P ----- P A S P TLL V TSCCG P EE P LSQHS L LGTSSPK DPP VG S LGS 2074
Cdd:PHA03247 2559 AP PAAP D RSV P PPR P A ----- P RPSEPAV T SRARR P dappq S A R P RAP V DDRGD P RG P APPSP L PPDTHAP DPP PP S PSP 2633
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2075 ISFSAPVLLERNS P KGIAV R TLEDS G KEELR ------ LSP A HS S A PP LG d P SSPKMTIEAAP LTS I A pk D GLDSGE T L E v 2148
Cdd:PHA03247 2634 AANEPDPHPPPTV P PPERP R DDPAP G RVSRP rrarrl GRA A QA S S PP QR - P RRRAARPTVGS LTS L A -- D PPPPPP T P E - 2709
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2149 PAPH CM g APSLSN P ERTYSKGPSLGPVSST P C P G hgegrgii AVP TDL AT LET tgpdsqicqedgadvsikeqdn P ET P G 2228
Cdd:PHA03247 2710 PAPH AL - VSATPL P PGPAAARQASPALPAA P A P P -------- AVP AGP AT PGG ---------------------- P AR P A 2758
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2229 TRHCNVTKV A R A NARGMPT G LHLT L ET P LSGTS S D SR SDS P QYHISISHRPPQKNFSDPQDHKRR P R G LNKK P EH A EQ T - 2307
Cdd:PHA03247 2759 RPPTTAGPP A P A PPAAPAA G PPRR L TR P AVASL S E SR ESL P SPWDPADPPAAVLAPAAALPPAAS P A G PLPP P TS A QP T a 2838
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2308 ---- P AEL P ETCQ L CSA ----- SF R SKAGLSRHK A RKHR P Q R E P RSL L S --------- PMPV P AC QP SD P MTKACQT P GK 2369
Cdd:PHA03247 2839 pppp P GPP P PSLP L GGS vapgg DV R RRPPSRSPA A KPAA P A R P P VRR L A rpavsrste SFAL P PD QP ER P PQPQAPP P PQ 2918
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1385123368 2370 KSHKVSEKGR P SR P ALGAG R SSG P PPLQDTMGPEILKRTSEKSEGA G T L d T P LSQHP P TLGLSEQGE S A E V PAS KP 2445
Cdd:PHA03247 2919 PQPQPPPPPQ P QP P PPPPP R PQP P LAPTTDPAGAGEPSGAVPQPWL G A L - V P GRVAV P RFRVPQPAP S R E A PAS ST 2993
zf-C2H2
pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191
7.02e-03
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
Pssm-ID: 395048 [Multi-domain]
Cd Length: 23
Bit Score: 36.51
E-value: 7.02e-03
Name
Accession
Description
Interval
E-value
PHA03247
PHA03247
large tegument protein UL36; Provisional
3281-3701
1.06e-06
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 55.33
E-value: 1.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3281 TL R SVK RP GV P RRKT R VSQDVL P SKQN R LM AP FSPP elst DRIPSTTS P T P SEVSLP A LPLA P S lildqpssqenpvdqa 3360
Cdd:PHA03247 2570 PP R PAP RP SE P AVTS R ARRPDA P PQSA R PR AP VDDR ---- GDPRGPAP P S P LPPDTH A PDPP P P ---------------- 2629
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3361 DH SP RG N NLPLSGQDLP PP SLS P FSAASAEGTGGCCKLN R TLEKPEHEASLGSLEPCKWQAL VG EKRA L HLF P GKHKS P G 3440
Cdd:PHA03247 2630 SP SP AA N EPDPHPPPTV PP PER P RDDPAPGRVSRPRRAR R LGRAAQASSPPQRPRRRAARPT VG SLTS L ADP P PPPPT P E 2709
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3441 NGDKCAPGCS P GH P SQLQE R LV --- TTHHM AP EGRIE GP SQK G NATK P GAYSS T SHHR A AE P TKKALKP P AP -- P R KPGG 3515
Cdd:PHA03247 2710 PAPHALVSAT P LP P GPAAA R QA spa LPAAP AP PAVPA GP ATP G GPAR P ARPPT T AGPP A PA P PAAPAAG P PR rl T R PAVA 2789
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3516 MGIPAA E LVL SP E D RVK P NTS kgklrg TPQSSGG L Q P GTQTG G GSQ P QPTSGQLQSEMAST P TE PS C P SWA S ST P DQPPP 3595
Cdd:PHA03247 2790 SLSESR E SLP SP W D PAD P PAA ------ VLAPAAA L P P AASPA G PLP P PTSAQPTAPPPPPG P PP PS L P LGG S VA P GGDVR 2863
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3596 R ahtkgst R G P GDAVHQGVQVHSS P REK R eshgrqrkgqalg L G R HGSVGN T GKAP L A PD KSS R A P RK QA ---- T P SRV P 3671
Cdd:PHA03247 2864 R ------- R P P SRSPAAKPAAPAR P PVR R ------------- L A R PAVSRS T ESFA L P PD QPE R P P QP QA pppp Q P QPQ P 2923
410 420 430
....*....|....*....|....*....|.
gi 1385123368 3672 P VKSR P SGQ - SSRA RPQP SAQRKG DP GHTS E 3701
Cdd:PHA03247 2924 P PPPQ P QPP p PPPP RPQP PLAPTT DP AGAG E 2954
PHA03247
PHA03247
large tegument protein UL36; Provisional
142-618
2.51e-05
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 50.71
E-value: 2.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 142 P GI P RAK A L P SPEENS S --- Q R CFQEA S SSFTSTNCTS P S A T P G S LPR RAP QS D GTS P HRH A SGTN L QAIGTN P W PP AAE 218
Cdd:PHA03247 2551 P PP P LPP A A P PAAPDR S vpp P R PAPRP S EPAVTSRARR P D A P P Q S ARP RAP VD D RGD P RGP A PPSP L PPDTHA P D PP PPS 2630
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 219 N S FPGANFGVSSAEPK P F P DGS R PS spqgvsapy P F P VETVQHE RA A etmlftfh QPLV A WSEEALGTN P AYPSLPCNP G 298
Cdd:PHA03247 2631 P S PAANEPDPHPPPTV P P P ERP R DD --------- P A P GRVSRPR RA R -------- RLGR A AQASSPPQR P RRRAARPTV G 2693
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 299 PSGGASA P SDLGGALS P PGA A RLLPS P fhdslhksltkg I P E GP LP AR DGLGSPRGL P N PP PQRHF P GQGYEANGVGTS P 378
Cdd:PHA03247 2694 SLTSLAD P PPPPPTPE P APH A LVSAT P ------------ L P P GP AA AR QASPALPAA P A PP AVPAG P ATPGGPARPARP P 2761
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 379 ASLDTEL P T P ----- GP PP TH L PQLWDTTAAPPYPTSTLDPAA A ART A FFESQQQLCL P HSP P LPWS P VL T TPG P NSH qm 453
Cdd:PHA03247 2762 TTAGPPA P A P paapa AG PP RR L TRPAVASLSESRESLPSPWDP A DPP A AVLAPAAALP P AAS P AGPL P PP T SAQ P TAP -- 2839
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 454 gvl SQLTF P RGS S EWQ G D S PGTL G ALNTI P RPGES A LRSSPGQPSSSP RL LAYG gl KDPG T QPLFFGGA QP QMS PQ GALS 533
Cdd:PHA03247 2840 --- PPPPG P PPP S LPL G G S VAPG G DVRRR P PSRSP A AKPAAPARPPVR RL ARPA -- VSRS T ESFALPPD QP ERP PQ PQAP 2914
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 534 L PP PRVVGAS P SES P L P S P ATNTASSSTCSSLSP P SSSPANPSSEDSQQP G P L RSPAFFL P PTHSQETSSPFPS P EPTYT 613
Cdd:PHA03247 2915 P PP QPQPQPP P PPQ P Q P P P PPPPRPQPPLAPTTD P AGAGEPSGAVPQPWL G A L VPGRVAV P RFRVPQPAPSREA P ASSTP 2994
....*
gi 1385123368 614 LP T RY 618
Cdd:PHA03247 2995 PL T GH 2999
PHA03247
PHA03247
large tegument protein UL36; Provisional
1933-2445
1.83e-04
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 48.01
E-value: 1.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 1933 QKEPAE R S P EK A - A S P Q P LFSQEN ----- PAPS N rd LA ACVFSTR P QAT P TPS ------- D LE PMPQE D PETRVK P SK P L 1999
Cdd:PHA03247 2481 RRPAEA R F P FA A g A A P D P GGGGPP dpdap PAPS R -- LA PAILPDE P VGE P VHP rmltwir G LE ELASD D AGDPPP P LP P A 2558
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2000 AP SSYR D LPS P DDQ P T cpvlv P LGASYGL T TKEAE P ----- P A S P TLL V TSCCG P EE P LSQHS L LGTSSPK DPP VG S LGS 2074
Cdd:PHA03247 2559 AP PAAP D RSV P PPR P A ----- P RPSEPAV T SRARR P dappq S A R P RAP V DDRGD P RG P APPSP L PPDTHAP DPP PP S PSP 2633
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2075 ISFSAPVLLERNS P KGIAV R TLEDS G KEELR ------ LSP A HS S A PP LG d P SSPKMTIEAAP LTS I A pk D GLDSGE T L E v 2148
Cdd:PHA03247 2634 AANEPDPHPPPTV P PPERP R DDPAP G RVSRP rrarrl GRA A QA S S PP QR - P RRRAARPTVGS LTS L A -- D PPPPPP T P E - 2709
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2149 PAPH CM g APSLSN P ERTYSKGPSLGPVSST P C P G hgegrgii AVP TDL AT LET tgpdsqicqedgadvsikeqdn P ET P G 2228
Cdd:PHA03247 2710 PAPH AL - VSATPL P PGPAAARQASPALPAA P A P P -------- AVP AGP AT PGG ---------------------- P AR P A 2758
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2229 TRHCNVTKV A R A NARGMPT G LHLT L ET P LSGTS S D SR SDS P QYHISISHRPPQKNFSDPQDHKRR P R G LNKK P EH A EQ T - 2307
Cdd:PHA03247 2759 RPPTTAGPP A P A PPAAPAA G PPRR L TR P AVASL S E SR ESL P SPWDPADPPAAVLAPAAALPPAAS P A G PLPP P TS A QP T a 2838
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2308 ---- P AEL P ETCQ L CSA ----- SF R SKAGLSRHK A RKHR P Q R E P RSL L S --------- PMPV P AC QP SD P MTKACQT P GK 2369
Cdd:PHA03247 2839 pppp P GPP P PSLP L GGS vapgg DV R RRPPSRSPA A KPAA P A R P P VRR L A rpavsrste SFAL P PD QP ER P PQPQAPP P PQ 2918
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1385123368 2370 KSHKVSEKGR P SR P ALGAG R SSG P PPLQDTMGPEILKRTSEKSEGA G T L d T P LSQHP P TLGLSEQGE S A E V PAS KP 2445
Cdd:PHA03247 2919 PQPQPPPPPQ P QP P PPPPP R PQP P LAPTTDPAGAGEPSGAVPQPWL G A L - V P GRVAV P RFRVPQPAP S R E A PAS ST 2993
PHA03247
PHA03247
large tegument protein UL36; Provisional
5-451
3.52e-04
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 46.86
E-value: 3.52e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 5 R P P TLP R DLQPCQIAR S LGC P SQ H PLKDHGSASRTTQGMR D DGSKAQGS P EAQLSQAKDVEQEDLIL R VQAPA a R SYAHV 84
Cdd:PHA03247 2599 R A P VDD R GDPRGPAPP S PLP P DT H APDPPPPSPSPAANEP D PHPPPTVP P PERPRDDPAPGRVSRPR R ARRLG - R AAQAS 2677
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 85 Y P WPAS R MESGH P QLH SL SP srirci L GE P LKDLRHEA P Q ---- VS D T KV P Q G QKTRARHR P GI P R A K A L P SPEENSSQR 160
Cdd:PHA03247 2678 S P PQRP R RRAAR P TVG SL TS ------ L AD P PPPPPTPE P A phal VS A T PL P P G PAAARQAS P AL P A A P A P P AVPAGPATP 2751
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 161 C f QE A SSSFTS T NCTS P SAT P GSL P RRA P QSDG T S P HR h AS GTNLQAIGTN PW P PA A ensfpganfg VSS A EPK P FPDGS 240
Cdd:PHA03247 2752 G - GP A RPARPP T TAGP P APA P PAA P AAG P PRRL T R P AV - AS LSESRESLPS PW D PA D ---------- PPA A VLA P AAALP 2819
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 241 RPS SP Q G VSA P YPFPV etvqheraaetmlftfhqplvawseealgtn P AY P SL P CN P G P S ggasa PSD LGG ALS P P G AA R 320
Cdd:PHA03247 2820 PAA SP A G PLP P PTSAQ ------------------------------- P TA P PP P PG P P P P ----- SLP LGG SVA P G G DV R 2863
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 321 LL P SPFHDSLHKSLTKGI P EGP L PARDGLG S PRGLPN PP P Q RHF P G Q GYEANGVGTS P ASLDTEL P T P G PPP THL PQ lwd 400
Cdd:PHA03247 2864 RR P PSRSPAAKPAAPARP P VRR L ARPAVSR S TESFAL PP D Q PER P P Q PQAPPPPQPQ P QPPPPPQ P Q P P PPP PPR PQ --- 2940
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*..
gi 1385123368 401 tta A P PY PT STLDP A AAART A FFESQQQLCL P HSPPL P W ------ S P VLTT P GPNSH 451
Cdd:PHA03247 2941 --- P P LA PT TDPAG A GEPSG A VPQPWLGALV P GRVAV P R frvpqp A P SREA P ASSTP 2994
PHA03247
PHA03247
large tegument protein UL36; Provisional
280-653
5.92e-04
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain]
Cd Length: 3151
Bit Score: 46.08
E-value: 5.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 280 SE E ALGTN P A Y PSLPCNPG P SGGAS AP SDL GG A ---- LS PP GAA RL L P SPFH D ----- SL H ------------- K S LTK G 337
Cdd:PHA03247 2470 LG E LFPGA P V Y RRPAEARF P FAAGA AP DPG GG G ppdp DA PP APS RL A P AILP D epvge PV H prmltwirgleel A S DDA G 2549
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 338 I P EG PLP ARDGLGS P - R GL P N P P P QRHFPGQGYEAN ---- GVGTSP A SLDTELPTP G P P PTHL P qlwd TTAA PP YPTSTL 412
Cdd:PHA03247 2550 D P PP PLP PAAPPAA P d R SV P P P R P APRPSEPAVTSR arrp DAPPQS A RPRAPVDDR G D P RGPA P ---- PSPL PP DTHAPD 2625
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 413 D P AAAART A FF E SQQQLCLP h S PP LPWSPVLTT PG PN S HQMGVLSQLTFPRG SS EW Q G ---- DSPG T L G A L NTIPR P GES 488
Cdd:PHA03247 2626 P P PPSPSP A AN E PDPHPPPT - V PP PERPRDDPA PG RV S RPRRARRLGRAAQA SS PP Q R prrr AARP T V G S L TSLAD P PPP 2704
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 489 ALRSS P GQPSSSPRL - L AY G GLKDPGTQ P LFFGGAQ P QMS P Q G ALSLPP P RVVGAS P SESPL P S PA TNT A SSSTCSSLSP 567
Cdd:PHA03247 2705 PPTPE P APHALVSAT p L PP G PAAARQAS P ALPAAPA P PAV P A G PATPGG P ARPARP P TTAGP P A PA PPA A PAAGPPRRLT 2784
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 568 PSSSPANPS S ED S -- QQPG P LRS PA FF L P P THSQETSSPFPS P E P TY T LPTRYQSETAKAF P L P TEGP G AED A fksq E G A 645
Cdd:PHA03247 2785 RPAVASLSE S RE S lp SPWD P ADP PA AV L A P AAALPPAASPAG P L P PP T SAQPTAPPPPPGP P P P SLPL G GSV A ---- P G G 2860
....*...
gi 1385123368 646 PFSHKS PS 653
Cdd:PHA03247 2861 DVRRRP PS 2868
PHA03307
PHA03307
transcriptional regulator ICP4; Provisional
3375-3733
8.33e-04
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain]
Cd Length: 1352
Bit Score: 45.55
E-value: 8.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3375 DL PPP SLSPFSA A SAEGTGGCCK L NRT le K P EHE A SL GS LE P CKWQALVGEKRALH lfpgkhk SPGNGDKC AP GC S PGHP 3454
Cdd:PHA03307 69 TG PPP GPGTEAP A NESRSTPTWS L STL -- A P ASP A RE GS PT P PGPSSPDPPPPTPP ------- PASPPPSP AP DL S EMLR 139
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3455 SQLQERLVTTHHMAPE G RIEGPSQKGN A TKPG A YSST S HHRAAEPTKKALKPPA PP RK P GG -------- MGI P AAELVL S 3526
Cdd:PHA03307 140 PVGSPGPPPAASPPAA G ASPAAVASDA A SSRQ A ALPL S SPEETARAPSSPPAEP PP ST P PA aasprppr RSS P ISASAS S 219
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3527 P EDRVKPNTSKGKLRGTPQ SS GGLQP G TQT G G --------- GSQPQ PT SGQLQ S EMASTPTE P SCP S WA SS TPDQ --- P P 3594
Cdd:PHA03307 220 P APAPGRSAADDAGASSSD SS SSESS G CGW G P enecplprp APITL PT RIWEA S GWNGPSSR P GPA S SS SS PRER sps P S 299
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3595 P RAHTK G STRGPGD A VHQGVQVHS S PREKRE S HGRQRK G Q A LGL G RHG S VGNTGKA P l A P DKSSRA PRK QAT PSR V P PVK 3674
Cdd:PHA03307 300 P SSPGS G PAPSSPR A SSSSSSSRE S SSSSTS S SSESSR G A A VSP G PSP S RSPSPSR P - P P PADPSS PRK RPR PSR A P SSP 378
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 1385123368 3675 SRPS G QSS R A R PQPSAQRKGDPGHTS ek G SL P QA R ALSR P YK rvr A LHV SG VAPMEPRD 3733
Cdd:PHA03307 379 AASA G RPT R R R ARAAVAGRARRRDAT -- G RF P AG R PRPS P LD --- A GAA SG AFYARYPL 432
PRK07764
PRK07764
DNA polymerase III subunits gamma and tau; Validated
3324-3667
1.72e-03
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain]
Cd Length: 824
Bit Score: 44.59
E-value: 1.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3324 PSTTS P TPSEVSL PA LPLAPSLILDQ PS S Q EN P VDQ A DHS P RGNNL P l SGQDL P P P SLS P f S A AS A EGTGGCCKLNR TL E 3403
Cdd:PRK07764 436 APAPA P PSPAGNA PA GGAPSPPPAAA PS A Q PA P APA A APE P TAAPA P - APPAA P A P AAA P - A A PA A PAAPAGADDAA TL R 513
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3404 K -- PE HE A SLGSLEPCK W QA L VG E KRA L HLFPG ---- KHKSP G NGDKC A pgc SPG HPSQ L QER L -- VTTHHMAP E GRIEG 3475
Cdd:PRK07764 514 E rw PE IL A AVPKRSRKT W AI L LP E ATV L GVRGD tlvl GFSTG G LARRF A --- SPG NAEV L VTA L ae ELGGDWQV E AVVGP 590
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3476 PSQKGNATK P G A - Y SS TSHHR AA E P TKK A LKPPAPPRK P G G MGIPA AE LVLS P EDR V KPNTSKG K LRGT P QS S G G LQPGT 3554
Cdd:PRK07764 591 APGAAGGEG P P A p A SS GPPEE AA R P AAP A APAAPAAPA P A G AAAAP AE ASAA P APG V AAPEHHP K HVAV P DA S D G GDGWP 670
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3555 QTG GG SQ P QPTSGQLQSEMAST P TEPSCPSW A SSTPDQ PP PRAHTKGSTRG P GD A VHQGVQVHSSPREKR --- E SHGRQR 3631
Cdd:PRK07764 671 AKA GG AA P AAPPPAPAPAAPAA P AGAAPAQP A PAPAAT PP AGQADDPAAQP P QA A QGASAPSPAADDPVP lpp E PDDPPD 750
330 340 350
....*....|....*....|....*....|....*.
gi 1385123368 3632 KGQ A LGLGRHGSVGNTGK AP L A PDKS S RAPRKQATP 3667
Cdd:PRK07764 751 PAG A PAQPPPPPAPAPAA AP A A APPP S PPSEEEEMA 786
zf-C2H2
pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191
7.02e-03
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
Pssm-ID: 395048 [Multi-domain]
Cd Length: 23
Bit Score: 36.51
E-value: 7.02e-03
dnaA
PRK14086
chromosomal replication initiator protein DnaA;
233-414
7.10e-03
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain]
Cd Length: 617
Bit Score: 42.12
E-value: 7.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 233 PK P F P DGS R P S S P QG v SA P YPF P V E TVQHE RA aetmlf TFHQ P LVAWSEEALGTN PAYP SLPCN P G P SGGAS A PS D L G GA 312
Cdd:PRK14086 96 AP P P P HAR R T S E P EL - PR P GRR P Y E GYGGP RA ------ DDRP P GLPRQDQLPTAR PAYP AYQQR P E P GAWPR A AD D Y G WQ 168
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 313 LSPP G -- AARLLP SP FHDSLHKSLTK ----- G I PE GPLPA RD GLGS ------ PR GLPNPP P QRH f PG Q G YEAN G VGTS P A 379
Cdd:PRK14086 169 QQRL G fp PRAPYA SP ASYAPEQERDR epyda G R PE YDQRR RD YDHP rpdwdr PR RDRTDR P EPP - PG A G HVHR G GPGP P E 247
170 180 190
....*....|....*....|....*....|....*.
gi 1385123368 380 SL D TELPTPG P P - P TH L PQLWDTTAA P PY PT ST L D P 414
Cdd:PRK14086 248 RD D APVVPIR P S a P GP L AAQPAPAPG P GE PT AR L N P 283
PHA03307
PHA03307
transcriptional regulator ICP4; Provisional
3434-3758
9.71e-03
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain]
Cd Length: 1352
Bit Score: 42.08
E-value: 9.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3434 G KHKSPGNGDKCA P GCS PG HPSQLQ E RLV T THHMAPEGRIEG P SQK G NA T K PG AY S STSHHRAAE P TKKA lk P PAP P RKP 3513
Cdd:PHA03307 58 G AAACDRFEPPTG P PPG PG TEAPAN E SRS T PTWSLSTLAPAS P ARE G SP T P PG PS S PDPPPPTPP P ASPP -- P SPA P DLS 135
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3514 GGMGIPAAELVLSPEDRVKPNT S KGKLRGTPQ SS GGLQPGTQTGGGSQPQ P T S GQLQSEMAST P TEP S CPSWAS S T P DQP 3593
Cdd:PHA03307 136 EMLRPVGSPGPPPAASPPAAGA S PAAVASDAA SS RQAALPLSSPEETARA P S S PPAEPPPSTP P AAA S PRPPRR S S P ISA 215
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3594 PPRAHTKGST R GPG D AVHQGVQVH SS PREK ----------- RESHGRQRKGQALGLGRHGSVGNTGKA P LAPDK S S R APR 3662
Cdd:PHA03307 216 SASSPAPAPG R SAA D DAGASSSDS SS SESS gcgwgpenecp LPRPAPITLPTRIWEASGWNGPSSRPG P ASSSS S P R ERS 295
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3663 KQAT PS R -- VP P VK S R P SGQ SS RARPQP S --------------- A QRK G DPGHT S EKG S L P QAR A LSRPYKRVRALHVSG 3725
Cdd:PHA03307 296 PSPS PS S pg SG P AP S S P RAS SS SSSSRE S sssstssssessrga A VSP G PSPSR S PSP S R P PPP A DPSSPRKRPRPSRAP 375
330 340 350
....*....|....*....|....*....|...
gi 1385123368 3726 VA P MEPRD R R T AEAQSDLLSQLFGQKLTSF R I P 3758
Cdd:PHA03307 376 SS P AASAG R P T RRRARAAVAGRARRRDATG R F P 408
Blast search parameters
Data Source:
Precalculated data, version = cdd.v.3.21
Preset Options: Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01