|
Name |
Accession |
Description |
Interval |
E-value |
| HSH155 |
COG5181 |
U2 snRNP spliceosome subunit [RNA processing and modification]; |
457-1312 |
0e+00 |
|
U2 snRNP spliceosome subunit [RNA processing and modification];
Pssm-ID: 227508 [Multi-domain] Cd Length: 975 Bit Score: 1203.23 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 457 MQTEDRSMKQVNDQPS-GNLPFLKPDDIQYFDKLLVEVDESTLSPEEQKERKIMKLLLKIKNGTPPMRKAALRQITDKAR 535
Cdd:COG5181 118 MCLPARGYKALTDFHGyADLGFFKVEDLKYFADDEKDFFMPLLEDREGDERDVYRLLLKVKNGGKRMRMEGLRILTDKAV 197
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 536 EFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAA 615
Cdd:COG5181 198 NFGAAAVFNKVLPMLMSRELEDQERHLVVKLIDRLLYGLDDLKVPYVHKILVVVGPLLIDEDLKRRCMGREIILNLVYRC 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 616 GLATMISTMRPDIDNMDEYVRNTTARAFAVVASALGIPSLLPFLKAVCKSKKSWQARHTGIKIVQQIAILMGCAILPHLR 695
Cdd:COG5181 278 GLGFSVSSMRPDITSKDEYVRNVTGRAVGVVADALGVEELLPFLEALCGSRKSWEARHTGIRIAQQICELLGRSRLSHLG 357
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 696 SLVEIIEHGLVDEQQKVRTISALAIAALAEAATPYGIESFDSVLKPLWKGIRQHRGKGLAAFLKAIGYLIPLMDAEYANY 775
Cdd:COG5181 358 PLLKCISKLLKDRSRFVRIDTANALSYLAELVGPYGIEQFDEVLCPLWEGASQHRGKELVSFLKAMGFIIPLMSPEYACH 437
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 776 YTREVMLILIREFQSPDEEMKKIVLKVVKQCCATDGVEANYIKTEILPPFFKHFWQHRMALDRRNYRQLVDTTVELANKV 855
Cdd:COG5181 438 DTREHMEIVFREFKSPDEEMKKDLLVVERICDKVGTDTPWKLRDQVSPEFFSPFWRRRSAGDRRSYKQVVLTTVILAKMG 517
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 856 GAAEIISRIVDDLKDEAEQYRKMVMETIEKIMGNLGAADIDHKLEEQLIDGILYAFQEQTTEDSVMLNGFGTVVNALGKR 935
Cdd:COG5181 518 GDPRVSRKILEYYSDEPEPYRKMNAGLVSRIFSRLGRLGFDERLEERLYDSILNAFQEQDTTVGLILPCFSTVLVSLEFR 597
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 936 VKPYLPQICGTVLWRLNNKSAKVRQQAADLISRTAVVMKTCQEEKLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVI 1015
Cdd:COG5181 598 GKPHLSMIVSTILKLLRSKPPDVRIRAADLMGSLAKVLKACGETKELAKLGNILYENLGEDYPEVLGSILKAICSIYSVH 677
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 1016 GMHKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGAEYVSAREWMRICFELLELLKAHKKAIRRATVNTFGYI 1095
Cdd:COG5181 678 RFRSMQPPISGILPSLTPILRNKHQKVVANTIALVGTICMNSPEYIGVREWMRICFELVDSLKSWNKEIRRNATETFGCI 757
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 1096 AKAIGPHDVLATLLNNLKVQERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKD 1175
Cdd:COG5181 758 SRAIGPQDVLDILLNNLKVQERQQRVCTSVAISIVAEYCGPFSVLPTLMSDYETPEANVQNGVLKAMCFMFEYIGQASLD 837
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 1176 YIYAVTPLLEDALMDRDLVHRQTASAVVQHMSLGVYGFGCEDSLNHLLNYVWPNVFETSPHVIQAVMGALEGLRVAIGPC 1255
Cdd:COG5181 838 YVYSITPLLEDALTDRDPVHRQTAMNVIRHLVLNCPGTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGSG 917
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|....*..
gi 68366436 1256 RMLQYCLQGLFHPARKVRDVYWKIYNSIYIGSQDALIAHYPLIfnDEKNSYVRYELE 1312
Cdd:COG5181 918 AMMKYVQQGLFHPSSTVRKRYWTVYNIMYVFDSDAMVPCYPVE--EDLNPELARTLH 972
|
|
| SF3b1 |
pfam08920 |
Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B ... |
343-453 |
7.18e-66 |
|
Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B subunit 1 proteins, which associate with p14 through a C-terminus beta-strand that interacts with beta-3 of the p14 RNA recognition motif (RRM) beta-sheet, which is in turn connected to an alpha-helix by a loop that makes extensive contacts with both the shorter C-terminal helix and RRM of p14. This subunit is required for 'A' splicing complex assembly (formed by the stable binding of U2 snRNP to the branchpoint sequence in pre-mRNA) and 'E' splicing complex assembly.
Pssm-ID: 462634 [Multi-domain] Cd Length: 114 Bit Score: 217.63 E-value: 7.18e-66
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 343 SKRKSRWDETPASQM---GSSTPLLTPGK-TPLGtpAMNMATPTPGHLMSMTPEQLQAWRWEREIDERNRPLTDEELDAM 418
Cdd:pfam08920 1 SKRRSRWDETPANAGsgpGGATPGETPGRqTPVG--AMGMATPTPGALGPMTPEQMQAFRWEKEIDERNRPLTDEELDAM 78
|
90 100 110
....*....|....*....|....*....|....*.
gi 68366436 419 FP-EGYKVLPPPAGYVPIRTPARKLAATPTPIGGMT 453
Cdd:pfam08920 79 LPgEGYKILDPPAGYVPIRTPARKLLATPTPMGGTG 114
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
188-383 |
3.39e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 58.26 E-value: 3.39e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 188 GSAASQAAAKRKRRWDQTADQTPSNSTPKKVSSWDQAD---GGSETPGHTPGHTPSNSRWDEtPGRPKGSETPGATPSTR 264
Cdd:PHA03307 216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENecpLPRPAPITLPTRIWEASGWNG-PSSRPGPASSSSSPRER 294
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 265 MWEPTPSHTPAGAATPGRDTPGHATPGHGGATSSVRknrwdETPKTERE-------------TPGHGSGWAETPRTDRGD 331
Cdd:PHA03307 295 SPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTS-----SSSESSRGaavspgpspsrspSPSRPPPPADPSSPRKRP 369
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 68366436 332 ESVGETPTPGASK-----RKSRWDETPASQMGSSTPLLTPGKTPLGTPAMNMATPTP 383
Cdd:PHA03307 370 RPSRAPSSPAASAgrptrRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAF 426
|
|
| CTD |
smart01104 |
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ... |
231-393 |
1.09e-07 |
|
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.
Pssm-ID: 215026 [Multi-domain] Cd Length: 121 Bit Score: 51.75 E-value: 1.09e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 231 PGHTPGHTPSNSRwdeTPGRpkGSETPGATPstrmWEPTPSHTPAGAATPGRDTPGHATPGHGGATSSVRKnrwdetpkt 310
Cdd:smart01104 1 GGRTPAWGASGSK---TPAW--GSRTPGTAA----GGAPTARGGSGSRTPAWGGAGSRTPAWGGAGPTGSR--------- 62
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 311 ereTPGHGSGWAETPRTDRGDESvgetptpgaskrksrwdeTPASQMGSSTPLLTPGKTPLGTPamnMATPTPGHLMSMT 390
Cdd:smart01104 63 ---TPAWGGASAWGNKSSEGSAS------------------SWAAGPGGAYGAPTPGYGGTPSA---YGPATPGGGAMAG 118
|
...
gi 68366436 391 PEQ 393
Cdd:smart01104 119 SAS 121
|
|
| HEAT_EZ |
pfam13513 |
HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats ... |
998-1053 |
1.07e-03 |
|
HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). These EZ repeats are found in subunits of cyanobacterial phycocyanin lyase and other proteins and probably carry out a scaffolding role.
Pssm-ID: 463906 [Multi-domain] Cd Length: 55 Bit Score: 38.50 E-value: 1.07e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 68366436 998 PEVLGSILGALKAIVNViGMHKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRI 1053
Cdd:pfam13513 1 WRVREAAALALGSLAEG-GPDLLAPAVPELLPALLPLLNDDSDLVREAAAWALGRL 55
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| HSH155 |
COG5181 |
U2 snRNP spliceosome subunit [RNA processing and modification]; |
457-1312 |
0e+00 |
|
U2 snRNP spliceosome subunit [RNA processing and modification];
Pssm-ID: 227508 [Multi-domain] Cd Length: 975 Bit Score: 1203.23 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 457 MQTEDRSMKQVNDQPS-GNLPFLKPDDIQYFDKLLVEVDESTLSPEEQKERKIMKLLLKIKNGTPPMRKAALRQITDKAR 535
Cdd:COG5181 118 MCLPARGYKALTDFHGyADLGFFKVEDLKYFADDEKDFFMPLLEDREGDERDVYRLLLKVKNGGKRMRMEGLRILTDKAV 197
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 536 EFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAA 615
Cdd:COG5181 198 NFGAAAVFNKVLPMLMSRELEDQERHLVVKLIDRLLYGLDDLKVPYVHKILVVVGPLLIDEDLKRRCMGREIILNLVYRC 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 616 GLATMISTMRPDIDNMDEYVRNTTARAFAVVASALGIPSLLPFLKAVCKSKKSWQARHTGIKIVQQIAILMGCAILPHLR 695
Cdd:COG5181 278 GLGFSVSSMRPDITSKDEYVRNVTGRAVGVVADALGVEELLPFLEALCGSRKSWEARHTGIRIAQQICELLGRSRLSHLG 357
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 696 SLVEIIEHGLVDEQQKVRTISALAIAALAEAATPYGIESFDSVLKPLWKGIRQHRGKGLAAFLKAIGYLIPLMDAEYANY 775
Cdd:COG5181 358 PLLKCISKLLKDRSRFVRIDTANALSYLAELVGPYGIEQFDEVLCPLWEGASQHRGKELVSFLKAMGFIIPLMSPEYACH 437
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 776 YTREVMLILIREFQSPDEEMKKIVLKVVKQCCATDGVEANYIKTEILPPFFKHFWQHRMALDRRNYRQLVDTTVELANKV 855
Cdd:COG5181 438 DTREHMEIVFREFKSPDEEMKKDLLVVERICDKVGTDTPWKLRDQVSPEFFSPFWRRRSAGDRRSYKQVVLTTVILAKMG 517
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 856 GAAEIISRIVDDLKDEAEQYRKMVMETIEKIMGNLGAADIDHKLEEQLIDGILYAFQEQTTEDSVMLNGFGTVVNALGKR 935
Cdd:COG5181 518 GDPRVSRKILEYYSDEPEPYRKMNAGLVSRIFSRLGRLGFDERLEERLYDSILNAFQEQDTTVGLILPCFSTVLVSLEFR 597
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 936 VKPYLPQICGTVLWRLNNKSAKVRQQAADLISRTAVVMKTCQEEKLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVI 1015
Cdd:COG5181 598 GKPHLSMIVSTILKLLRSKPPDVRIRAADLMGSLAKVLKACGETKELAKLGNILYENLGEDYPEVLGSILKAICSIYSVH 677
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 1016 GMHKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGAEYVSAREWMRICFELLELLKAHKKAIRRATVNTFGYI 1095
Cdd:COG5181 678 RFRSMQPPISGILPSLTPILRNKHQKVVANTIALVGTICMNSPEYIGVREWMRICFELVDSLKSWNKEIRRNATETFGCI 757
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 1096 AKAIGPHDVLATLLNNLKVQERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKD 1175
Cdd:COG5181 758 SRAIGPQDVLDILLNNLKVQERQQRVCTSVAISIVAEYCGPFSVLPTLMSDYETPEANVQNGVLKAMCFMFEYIGQASLD 837
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 1176 YIYAVTPLLEDALMDRDLVHRQTASAVVQHMSLGVYGFGCEDSLNHLLNYVWPNVFETSPHVIQAVMGALEGLRVAIGPC 1255
Cdd:COG5181 838 YVYSITPLLEDALTDRDPVHRQTAMNVIRHLVLNCPGTGDEDAAIHLLNLLWPNILEPSPHVIQSFDEGMESFATVLGSG 917
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|....*..
gi 68366436 1256 RMLQYCLQGLFHPARKVRDVYWKIYNSIYIGSQDALIAHYPLIfnDEKNSYVRYELE 1312
Cdd:COG5181 918 AMMKYVQQGLFHPSSTVRKRYWTVYNIMYVFDSDAMVPCYPVE--EDLNPELARTLH 972
|
|
| SF3b1 |
pfam08920 |
Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B ... |
343-453 |
7.18e-66 |
|
Splicing factor 3B subunit 1; This family consists of several eukaryotic splicing factor 3B subunit 1 proteins, which associate with p14 through a C-terminus beta-strand that interacts with beta-3 of the p14 RNA recognition motif (RRM) beta-sheet, which is in turn connected to an alpha-helix by a loop that makes extensive contacts with both the shorter C-terminal helix and RRM of p14. This subunit is required for 'A' splicing complex assembly (formed by the stable binding of U2 snRNP to the branchpoint sequence in pre-mRNA) and 'E' splicing complex assembly.
Pssm-ID: 462634 [Multi-domain] Cd Length: 114 Bit Score: 217.63 E-value: 7.18e-66
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 343 SKRKSRWDETPASQM---GSSTPLLTPGK-TPLGtpAMNMATPTPGHLMSMTPEQLQAWRWEREIDERNRPLTDEELDAM 418
Cdd:pfam08920 1 SKRRSRWDETPANAGsgpGGATPGETPGRqTPVG--AMGMATPTPGALGPMTPEQMQAFRWEKEIDERNRPLTDEELDAM 78
|
90 100 110
....*....|....*....|....*....|....*.
gi 68366436 419 FP-EGYKVLPPPAGYVPIRTPARKLAATPTPIGGMT 453
Cdd:pfam08920 79 LPgEGYKILDPPAGYVPIRTPARKLLATPTPMGGTG 114
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
188-383 |
3.39e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 58.26 E-value: 3.39e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 188 GSAASQAAAKRKRRWDQTADQTPSNSTPKKVSSWDQAD---GGSETPGHTPGHTPSNSRWDEtPGRPKGSETPGATPSTR 264
Cdd:PHA03307 216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENecpLPRPAPITLPTRIWEASGWNG-PSSRPGPASSSSSPRER 294
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 265 MWEPTPSHTPAGAATPGRDTPGHATPGHGGATSSVRknrwdETPKTERE-------------TPGHGSGWAETPRTDRGD 331
Cdd:PHA03307 295 SPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTS-----SSSESSRGaavspgpspsrspSPSRPPPPADPSSPRKRP 369
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 68366436 332 ESVGETPTPGASK-----RKSRWDETPASQMGSSTPLLTPGKTPLGTPAMNMATPTP 383
Cdd:PHA03307 370 RPSRAPSSPAASAgrptrRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAF 426
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
204-448 |
8.46e-08 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 56.98 E-value: 8.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 204 QTADQTPSNSTP----KKVSSWDQADGGSETPGHTPGHTPSNSrwdetpGRPKGSETPGATPSTRMWEPTPSHTPAGAAT 279
Cdd:PHA03377 518 ETTEEEESVTQPakphRKVQDGFQRSGRRQKRATPPKVSPSDR------GPPKASPPVMAPPSTGPRVMATPSTGPRDMA 591
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 280 PGRDTPGHATPGHGGATSSVRKnrwDETPKTERETPGHGSGWAETPRTDRGDESVGETPTP----GASKRKSRWDETPAS 355
Cdd:PHA03377 592 PPSTGPRQQAKCKDGPPASGPH---EKQPPSSAPRDMAPSVVRMFLRERLLEQSTGPKPKSfwemRAGRDGSGIQQEPSS 668
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 356 QMGSSTPLLTPGKTPLGT----PAMNMATPTP---GHLMSMTP-------EQLQAWRWEREIDERNRPLTD--------- 412
Cdd:PHA03377 669 RRQPATQSTPPRPSWLPSvfvlPSVDAGRAQPseeSHLSSMSPtqpisheEQPRYEDPDDPLDLSLHPDQApppshqapy 748
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 68366436 413 ---EEL---DAMFPEGYKVLPPPAGYVPIRTP-ARKLAATPTP 448
Cdd:PHA03377 749 sghEEPqaqQAPYPGYWEPRPPQAPYLGYQEPqAQGVQVSSYP 791
|
|
| CTD |
smart01104 |
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ... |
231-393 |
1.09e-07 |
|
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.
Pssm-ID: 215026 [Multi-domain] Cd Length: 121 Bit Score: 51.75 E-value: 1.09e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 231 PGHTPGHTPSNSRwdeTPGRpkGSETPGATPstrmWEPTPSHTPAGAATPGRDTPGHATPGHGGATSSVRKnrwdetpkt 310
Cdd:smart01104 1 GGRTPAWGASGSK---TPAW--GSRTPGTAA----GGAPTARGGSGSRTPAWGGAGSRTPAWGGAGPTGSR--------- 62
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 311 ereTPGHGSGWAETPRTDRGDESvgetptpgaskrksrwdeTPASQMGSSTPLLTPGKTPLGTPamnMATPTPGHLMSMT 390
Cdd:smart01104 63 ---TPAWGGASAWGNKSSEGSAS------------------SWAAGPGGAYGAPTPGYGGTPSA---YGPATPGGGAMAG 118
|
...
gi 68366436 391 PEQ 393
Cdd:smart01104 119 SAS 121
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
131-448 |
1.17e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 53.25 E-value: 1.17e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 131 SPERHDPFADGGKTPDPkvqvrtymdvmkeqqlskeereirlqmvekAKAGELKAVNGSAASQAAAKRkrrwDQTADQTP 210
Cdd:PHA03307 206 PPRRSSPISASASSPAP------------------------------APGRSAADDAGASSSDSSSSE----SSGCGWGP 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 211 SNSTPKkvsswdqadgGSETPGHTPGHTPSNSRWDEtPGRPKGSETPGATPSTRMWEPTPSHTPAGAATPGRDTPGHATP 290
Cdd:PHA03307 252 ENECPL----------PRPAPITLPTRIWEASGWNG-PSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSS 320
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 291 GHGGATSSVRknrwdETPKTERETPGH-GSGWAETPRTDRGDEsvgetPTPGASKRKSrwdetPASQMGSSTPLLTPGKt 369
Cdd:PHA03307 321 SRESSSSSTS-----SSSESSRGAAVSpGPSPSRSPSPSRPPP-----PADPSSPRKR-----PRPSRAPSSPAASAGR- 384
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 370 plGTPAMNMATPTPGHLMSMTPEQLQAWRwereidERNRPLTDEELDAMFPEGYKVL-----------PPPAGYV----- 433
Cdd:PHA03307 385 --PTRRRARAAVAGRARRRDATGRFPAGR------PRPSPLDAGAASGAFYARYPLLtpsgepwpgspPPPPGRVryggl 456
|
330 340
....*....|....*....|....*.
gi 68366436 434 ----------P-IRTPARKLAATPTP 448
Cdd:PHA03307 457 gdsrpglwdaPeVREAAARYEASPGP 482
|
|
| CTD |
smart01104 |
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ... |
178-298 |
3.87e-06 |
|
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.
Pssm-ID: 215026 [Multi-domain] Cd Length: 121 Bit Score: 47.13 E-value: 3.87e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 178 AKAGELKAVNGSAASQAAAKRKRRWDQTADQTPSnstpkkvssWDQAdgGSETPGHTpGHTPSNSRwdeTPGRPKGSETP 257
Cdd:smart01104 9 ASGSKTPAWGSRTPGTAAGGAPTARGGSGSRTPA---------WGGA--GSRTPAWG-GAGPTGSR---TPAWGGASAWG 73
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 68366436 258 --GATPSTRMWEPTPSHTPaGAATPG----RDTPGHATPGHGGATSS 298
Cdd:smart01104 74 nkSSEGSASSWAAGPGGAY-GAPTPGyggtPSAYGPATPGGGAMAGS 119
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
189-473 |
1.68e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 49.40 E-value: 1.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 189 SAASQAAAKRKRRWDqtaDQTPSNST------PKKVSSWDQADGGSETPGHTPGHTPSNSRWDETPGRPKGSETPGATPS 262
Cdd:PHA03307 55 VVAGAAACDRFEPPT---GPPPGPGTeapaneSRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPA 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 263 TRMWEPTPSHTPAGAATPGRDTPGHATPGHGGATSSVRKNRWDETPKTERETPGHGSGWAETPrtdrgdesvGETPTPGA 342
Cdd:PHA03307 132 PDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPP---------PSTPPAAA 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 343 SKRKSRWDETPASQMGSSTPLltPGKTPlgtpamnmATPTPGHLMSMTPEQLQAWRWEREiDERNRP---LTDEELDAMF 419
Cdd:PHA03307 203 SPRPPRRSSPISASASSPAPA--PGRSA--------ADDAGASSSDSSSSESSGCGWGPE-NECPLPrpaPITLPTRIWE 271
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 68366436 420 PEGYKVLPPPAGYVPIRTPARKLAATPTPIGGMTGFHMQTEDRSMKQVNDQPSG 473
Cdd:PHA03307 272 ASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESS 325
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
200-460 |
2.72e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 48.91 E-value: 2.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 200 RRWDQTADQTPSNSTPKKVSSWDQAdgGSETPGHTP--GHTPSNSRWDETPGRPKGSETPGATPSTRM-WEPTPSHTPA- 275
Cdd:PHA03378 622 RQWPMPLRPIPMRPLRMQPITFNVL--VFPTPHQPPqvEITPYKPTWTQIGHIPYQPSPTGANTMLPIqWAPGTMQPPPr 699
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 276 ------------GAATPGRDTPGHATPGHGGATSSVRKNRWDETPKTERETPGHGSGWAETPRTDRGDESVGETPTPgas 343
Cdd:PHA03378 700 aptpmrppaappGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTP--- 776
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 344 krksrwdeTPASQMGsSTPLLTP--GKTPLGTPamnMATPTPGHLM--SMTPEQLQAWRWEREI----DERNRPLTD--- 412
Cdd:PHA03378 777 --------QPPPQAP-PAPQQRPrgAPTPQPPP---QAGPTSMQLMprAAPGQQGPTKQILRQLltggVKRGRPSLKkpa 844
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 413 --EELDAMFPE-------GYKVLPPPAGYVPIRTPA---RKLaATPTPIGGMTGFHMQTE 460
Cdd:PHA03378 845 alERQAAAGPTpspgsgtSDKIVQAPVFYPPVLQPIqvmRQL-GSVRAAAASTVTQAPTE 903
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
224-454 |
8.50e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 8.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 224 ADGGSETPGHTPGHTPSNSrwdETPGRPKGS----ETPGATPSTRMW---------------EPTPSHTPAG-AATPGRD 283
Cdd:PHA03247 2491 AAGAAPDPGGGGPPDPDAP---PAPSRLAPAilpdEPVGEPVHPRMLtwirgleelasddagDPPPPLPPAApPAAPDRS 2567
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 284 TP-GHATPGHGGATSSVRKNRWDETPKTERetpghgsgwAETPRTDRGDESVGETPTPgASKRKSRWDETPASQMGSSTP 362
Cdd:PHA03247 2568 VPpPRPAPRPSEPAVTSRARRPDAPPQSAR---------PRAPVDDRGDPRGPAPPSP-LPPDTHAPDPPPPSPSPAANE 2637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 363 LLTPGKTPLGTPAMNMATPTPGHLmSMTPEQLQAWRWEREIDERNRPLTDEELDAMFPEGYKVLPPPAGYVPIRTPARKL 442
Cdd:PHA03247 2638 PDPHPPPTVPPPERPRDDPAPGRV-SRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALV 2716
|
250
....*....|..
gi 68366436 443 AATPTPIGGMTG 454
Cdd:PHA03247 2717 SATPLPPGPAAA 2728
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
189-448 |
1.81e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 1.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 189 SAASQAAAKRKRRWDQTADQTPSNSTPKKVSSWDQADGGSETPGHTPGHTPSNSrwDETPGRPKGSETPgATPSTRMWEP 268
Cdd:PHA03247 2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS--PAGPLPPPTSAQP-TAPPPPPGPP 2846
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 269 TPSHTPAGAATPGRDTPGHATPGHGGATSSVRKNrwdetPKTERetpghgsgwAETPRTDRGDESVGETPTPGASKRksr 348
Cdd:PHA03247 2847 PPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPAR-----PPVRR---------LARPAVSRSTESFALPPDQPERPP--- 2909
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 349 wdeTPASQMgSSTPLLTPGKTPLGTPAMnmatPTPGhlmsMTPEQLQAWRWEREIDERNRPLTDEELDAMFPEGYKVlpp 428
Cdd:PHA03247 2910 ---QPQAPP-PPQPQPQPPPPPQPQPPP----PPPP----RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV--- 2974
|
250 260
....*....|....*....|
gi 68366436 429 PAGYVPIRTPARKLAATPTP 448
Cdd:PHA03247 2975 PRFRVPQPAPSREAPASSTP 2994
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
190-447 |
3.35e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.31 E-value: 3.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 190 AASQAAAKRKRRWDQTADQTPSNSTPKK----------VSSWDQADGGSETPGHTPGHTPSNSRWDETPGRPKGSETPGA 259
Cdd:PHA03247 2655 DPAPGRVSRPRRARRLGRAAQASSPPQRprrraarptvGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA 2734
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 260 TPSTrmwePTPSHTPAGAATPGRDTPGHATPGHGGATSSvrknrwdetpktereTPGHGSGWAETPRTdrgdesvgeTPT 339
Cdd:PHA03247 2735 LPAA----PAPPAVPAGPATPGGPARPARPPTTAGPPAP---------------APPAAPAAGPPRRL---------TRP 2786
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 340 PGASKRKSRwDETPASQMGSSTPLLTPGKTPLGTPAMNMATPTPghlmsMTPEQLQAwrwereiderNRPLTDEELDAMF 419
Cdd:PHA03247 2787 AVASLSESR-ESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP-----PPTSAQPT----------APPPPPGPPPPSL 2850
|
250 260
....*....|....*....|....*...
gi 68366436 420 PEGYKVLppPAGYVPIRTPARKLAATPT 447
Cdd:PHA03247 2851 PLGGSVA--PGGDVRRRPPSRSPAAKPA 2876
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
223-448 |
7.34e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.82 E-value: 7.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 223 QADGGSETPGHTPGHTPSNSRWDETPGRPKGSETPGAtpstrmwepTPSHTPAGAATPGRDTPGHATPGHGGATSSVRkn 302
Cdd:PRK07764 587 VVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAA---------PAAPAPAGAAAAPAEASAAPAPGVAAPEHHPK-- 655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 303 rwdetpKTERETPGHGSGWAETPRTDRGDESVGETPTPGAskrkSRWDETPASQMGSSTPLLTPGKTPLGTPAMNMATPT 382
Cdd:PRK07764 656 ------HVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAA----PAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA 725
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 68366436 383 pghlmsmtpeqlQAWRWEREIDERNRPLTDEELDAMFPEGYKVLPPPAGYVPIRTPArklAATPTP 448
Cdd:PRK07764 726 ------------QGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAP---AAAPPP 776
|
|
| HEAT_EZ |
pfam13513 |
HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats ... |
998-1053 |
1.07e-03 |
|
HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). These EZ repeats are found in subunits of cyanobacterial phycocyanin lyase and other proteins and probably carry out a scaffolding role.
Pssm-ID: 463906 [Multi-domain] Cd Length: 55 Bit Score: 38.50 E-value: 1.07e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 68366436 998 PEVLGSILGALKAIVNViGMHKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRI 1053
Cdd:pfam13513 1 WRVREAAALALGSLAEG-GPDLLAPAVPELLPALLPLLNDDSDLVREAAAWALGRL 55
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
189-346 |
1.13e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.62 E-value: 1.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 189 SAASQAAAKRKRRWDQTAD-QTPSNSTPKKVSSWDQADGGSETPGHT--PGHTPSNSRWDETPGRPKGSETPGATPSTRM 265
Cdd:PHA03307 790 VRAEAAFRRPGRLRRSGPAaDAASRTASKRKSRSHTPDGGSESSGPArpPGAAARPPPARSSESSKSKPAAAGGRARGKN 869
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 266 wEPTPSHTPAGAATPGRDTPGHATPGHGGATSSVRKNRWDETPKTER---ETPGHGSGWAETPRtdrgdesvGE--TPTP 340
Cdd:PHA03307 870 -GRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPmppGGPDPRGGFRRVPP--------GDlhTPAP 940
|
....*.
gi 68366436 341 GASKRK 346
Cdd:PHA03307 941 SAAALA 946
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
185-417 |
2.16e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.28 E-value: 2.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 185 AVNGSAASQAAAKRKRRWDQTADQTPSNSTPKKVSSWDQADGGSETPGHTPGHTPSNSRWDETPGRPKGSETPGATPSTR 264
Cdd:PRK07764 604 ASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 265 MWEPTPSHTPAGAATPGRDTPGHATPGHGGATSSVRKNRwdetpkteretpghGSGWAETPRTDRGDESVGETPTPGAsk 344
Cdd:PRK07764 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP--------------QAAQGASAPSPAADDPVPLPPEPDD-- 747
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 68366436 345 rksrwDETPASQMGSSTPLLTPGKTPLGTPAMNMATPTPGHLMSMTPEQLQAWRWEREIDERNRPLTDEELDA 417
Cdd:PRK07764 748 -----PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEEELGA 815
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
203-384 |
4.36e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 41.35 E-value: 4.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 203 DQTADQTPSNSTPKKVSSWDQ--ADGGSETPGHtPGHTPSnsrwDETPGRPKGSETPGATPS--TRMWEPTPSHTPAGAA 278
Cdd:PRK14086 89 DPSAGEPAPPPPHARRTSEPElpRPGRRPYEGY-GGPRAD----DRPPGLPRQDQLPTARPAypAYQQRPEPGAWPRAAD 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 68366436 279 TPGRDTPGHATPG---HGGATSSVR-KNRWDETPKTERETPGHGSGWAETPRTD----RGDESVGETPTPGASKRKSRWD 350
Cdd:PRK14086 164 DYGWQQQRLGFPPrapYASPASYAPeQERDREPYDAGRPEYDQRRRDYDHPRPDwdrpRRDRTDRPEPPPGAGHVHRGGP 243
|
170 180 190
....*....|....*....|....*....|....
gi 68366436 351 ETPASQMGSSTPLLTPGKTPLgtPAMNMATPTPG 384
Cdd:PRK14086 244 GPPERDDAPVVPIRPSAPGPL--AAQPAPAPGPG 275
|
|
|