|
Name |
Accession |
Description |
Interval |
E-value |
| Treslin_N |
pfam15292 |
Treslin N-terminus; This family represents the N-terminus of treslin, a checkpoint regulator ... |
208-1004 |
0e+00 |
|
Treslin N-terminus; This family represents the N-terminus of treslin, a checkpoint regulator which plays a role in DNA replication preinitiation complex formation. :
Pssm-ID: 464618 Cd Length: 793 Bit Score: 1380.93 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 208 FYWVDTTEWSKLWESPDHLGYWTVCELLHHGGGTVLPSESFSWDFAQAGEMLLRSGIKLSSEPHLSPWISMLPTDATLNR 287
Cdd:pfam15292 1 LHWVDTTEYSKLWESPDHLGYWTVSEVLQQVGGTILPSETALLDLSSAGESLLSGGRKGSPAPHLSPWISALPFDSTLNY 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 288 LLYNSPEYEASFPRMEGMLFLPVEaGKEIQETWTVTLEPLAMHQRHFQKPVRIFLKGSVAQWSLPTSSTLGTDSWMLGSP 367
Cdd:pfam15292 81 LLSSEPVYRAAFPQLEGVLFWPQE-GKEEQQSCAVTLEPVAMRQRHLQEPVRIFLKGVLTQWDAPSLSQLGTESWILQSS 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 368 EESTATQRLLFQQLVSRLTAEELHLVADVDPGEGRPPITGVISPLSASAMILTVCRTKEAEFQRHVLQTAVADSPRDTAS 447
Cdd:pfam15292 160 EEEDSEQAALFQQLLRRLSAEELHMVAEVDPGEGGPPCTAVLSPLSASTALLTVLQPEEAQFQQLLLTTVVTESTQDTSS 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 448 LFSDVVDSILNQTHDSLADTASA----ASPVPEWAQQELGHTTPWSPAVVEKWFPFCNISGASSDLMESFGLLQAASANK 523
Cdd:pfam15292 240 DLPDVVSSVLNVVYDIMEEDPAAdeieDPPVPEWAQQELSRTSPWSTAVVEGWFPLSDQSGASSHLMESFRLLQAVPEEK 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 524 EESSKT-EGELIHCLAELYQRKSREEStiAHQEDSKKKRGVPRTPVRQKMNTMCRSLKMLNVARLNVKAQKLHPDGSPDV 602
Cdd:pfam15292 320 EESSKTlEQELTSCLSELYQRKSREES--ASQEDRGKKRGVPRTPVRQKMKTMSRSLQMLNVARLNVKAQKLQPEGEPDG 397
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 603 AGEKGIQKIPSGRTVDKLEDRGRTLRSskPKDFKTEEELLSYIRENYQKTVATGEIMLYACARNMISTVKMFLKSKGTkE 682
Cdd:pfam15292 398 AGEKGPQKPGKRRSSDRLEPRGRTLRS--PKDFKTEEELLSHLKENYQKTVAEGESSLLTCAQNLISTVKAFLKSKGT-D 474
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 683 LEVNCLNQVKSSLLKTSKSLRQNLGKKLDKEDKVRECQLQVFLRLEMCLQCPSINESTDDMEQVVEEVTDLLRMVCLTED 762
Cdd:pfam15292 475 LEANCLNLVKNHLLKTSKSIRQQYGSALDKESKVRECQLQVFLRLELCLQCPSLQSDSDDMEQLVEEVTDMLRIISLTKD 554
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 763 SAYLAEFL-EEILRLYIDSIPKTLGNLYNSLGFVIPQKLAGVLPTDFFSDDSMTQENKSPLLSVPFLSSARRSVSGspeS 841
Cdd:pfam15292 555 PAYLARFLqEEILPLYLDSIPKTLGDLYHSLGTQIPEKLAAVLPADFFSDDSMTQDSISPSLSSSLLSSASLSSSG---E 631
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 842 DELQELRTRSAKKRRKNALIRHKSIAEVSQNLRQIEIPKVSKRATKKENSHPAPQQPSQ---PVKDTVQEVTKVRRNLFN 918
Cdd:pfam15292 632 DQLEELRTRSAKKRRKNALTRHRSMTESSQNLRQIEIPKKSKRATKSENSHSLLKTAVQqppPQKDTVQEVTKVRRNLFN 711
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 919 QELLSPSKRSLkrgLPRSHSVSAVDGLEDKLDNFKKNkGYHKLLTKSVAETPVHKQISKRLLHRQIKGRSSDPGPDIGVV 998
Cdd:pfam15292 712 QEIVSPSKRSK---LPRSQSVSAVEGLKHKRSSEKEE-DYHKLLTKKVAETPLHKQVSRRLLHRQIKGRSSDPGPDICIV 787
|
....*.
gi 118421085 999 EESPEK 1004
Cdd:pfam15292 788 EESPEK 793
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1086-1504 |
1.07e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1086 RMKKRSRNTLDSEVPAAYQTPKKSHQKSLSfSKTTPRRISHTPQTPLYTPERLQKSPAKMTPTKQAAFKESLKDSSSPGH 1165
Cdd:PHA03247 2660 RVSRPRRARRLGRAAQASSPPQRPRRRAAR-PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAA 2738
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1166 DSPLDSKITPQKRHTQAGEGTSLETKTPRTPKRQGTQPPGFLPNCTWPHSVNSSPESPSCPAPPTSSTAQPRRECLTPIR 1245
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1246 DPLRTPpraAAFMGTPqnqthqqPHVLRAARAEEPAQKLKDKAIKTPKRPGNSTVTSSPPVTPKKLFTSPlcdvsKKSPF 1325
Cdd:PHA03247 2819 PPAASP---AGPLPPP-------TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPV 2883
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1326 RKSKIECPSPGELDQKEPQMSPSVAASLSCPVPSTPPELSQRATLDTVPPPPPSkvgkrcrktsdprrsivecQPDASAT 1405
Cdd:PHA03247 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP-------------------RPQPPLA 2944
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1406 PGVGTADSPAAPTDSRDDQKGL----------SLSPQ----------SPPERRGYPGPGLrSDWHASSPLLITSDTEHVT 1465
Cdd:PHA03247 2945 PTTDPAGAGEPSGAVPQPWLGAlvpgrvavprFRVPQpapsreapasSTPPLTGHSLSRV-SSWASSLALHEETDPPPVS 3023
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 118421085 1466 LL------SEAEHhgiGDLKSNVLSVEEGEGLRTADAEKSSLSHP 1504
Cdd:PHA03247 3024 LKqtlwppDDTED---SDADSLFDSDSERSDLEALDPLPPEPHDP 3065
|
|
| PHA03307 super family |
cl33723 |
transcriptional regulator ICP4; Provisional |
1323-1707 |
1.54e-03 |
|
transcriptional regulator ICP4; Provisional The actual alignment was detected with superfamily member PHA03307:
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.62 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1323 SPFRKSKIECPSPGELDQKEPQMSPSVAASLSCPVPSTPPELSQRATLDTVPPPPPSkvgkrcrkTSDPrrsiveCQPDA 1402
Cdd:PHA03307 63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPP--------TPPP------ASPPP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1403 SATPGVGTADSPAAPTDSRDDQkglslSPQSPPERRGYPGPGLRSDWHASSPLLITSDTEHVtllseaehhGIGDLKSNV 1482
Cdd:PHA03307 129 SPAPDLSEMLRPVGSPGPPPAA-----SPPAAGASPAAVASDAASSRQAALPLSSPEETARA---------PSSPPAEPP 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1483 LSVEEGEGLRTADAEKSSLSHPGippsppscgPGSPLMPSRDvhcttdgrqcqasaQLDNLPASAWHSTDSASPQTYEVE 1562
Cdd:PHA03307 195 PSTPPAAASPRPPRRSSPISASA---------SSPAPAPGRS--------------AADDAGASSSDSSSSESSGCGWGP 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1563 LEMQASGLPKLrikKIDPSSSLEAEPLSKEESSLGEESFLPALSMPRASRSLSKPEPTYVSPPCPRLSHSTP------GK 1636
Cdd:PHA03307 252 ENECPLPRPAP---ITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSsresssSS 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 118421085 1637 SRGQTYICQACTPTHGPSSTPSPFQTDGVPWTP-------SPKHSGKTTPDIIKDWPRRKRAVGCGAGSSSGRGEVGA 1707
Cdd:PHA03307 329 TSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADpssprkrPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR 406
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Treslin_N |
pfam15292 |
Treslin N-terminus; This family represents the N-terminus of treslin, a checkpoint regulator ... |
208-1004 |
0e+00 |
|
Treslin N-terminus; This family represents the N-terminus of treslin, a checkpoint regulator which plays a role in DNA replication preinitiation complex formation.
Pssm-ID: 464618 Cd Length: 793 Bit Score: 1380.93 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 208 FYWVDTTEWSKLWESPDHLGYWTVCELLHHGGGTVLPSESFSWDFAQAGEMLLRSGIKLSSEPHLSPWISMLPTDATLNR 287
Cdd:pfam15292 1 LHWVDTTEYSKLWESPDHLGYWTVSEVLQQVGGTILPSETALLDLSSAGESLLSGGRKGSPAPHLSPWISALPFDSTLNY 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 288 LLYNSPEYEASFPRMEGMLFLPVEaGKEIQETWTVTLEPLAMHQRHFQKPVRIFLKGSVAQWSLPTSSTLGTDSWMLGSP 367
Cdd:pfam15292 81 LLSSEPVYRAAFPQLEGVLFWPQE-GKEEQQSCAVTLEPVAMRQRHLQEPVRIFLKGVLTQWDAPSLSQLGTESWILQSS 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 368 EESTATQRLLFQQLVSRLTAEELHLVADVDPGEGRPPITGVISPLSASAMILTVCRTKEAEFQRHVLQTAVADSPRDTAS 447
Cdd:pfam15292 160 EEEDSEQAALFQQLLRRLSAEELHMVAEVDPGEGGPPCTAVLSPLSASTALLTVLQPEEAQFQQLLLTTVVTESTQDTSS 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 448 LFSDVVDSILNQTHDSLADTASA----ASPVPEWAQQELGHTTPWSPAVVEKWFPFCNISGASSDLMESFGLLQAASANK 523
Cdd:pfam15292 240 DLPDVVSSVLNVVYDIMEEDPAAdeieDPPVPEWAQQELSRTSPWSTAVVEGWFPLSDQSGASSHLMESFRLLQAVPEEK 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 524 EESSKT-EGELIHCLAELYQRKSREEStiAHQEDSKKKRGVPRTPVRQKMNTMCRSLKMLNVARLNVKAQKLHPDGSPDV 602
Cdd:pfam15292 320 EESSKTlEQELTSCLSELYQRKSREES--ASQEDRGKKRGVPRTPVRQKMKTMSRSLQMLNVARLNVKAQKLQPEGEPDG 397
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 603 AGEKGIQKIPSGRTVDKLEDRGRTLRSskPKDFKTEEELLSYIRENYQKTVATGEIMLYACARNMISTVKMFLKSKGTkE 682
Cdd:pfam15292 398 AGEKGPQKPGKRRSSDRLEPRGRTLRS--PKDFKTEEELLSHLKENYQKTVAEGESSLLTCAQNLISTVKAFLKSKGT-D 474
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 683 LEVNCLNQVKSSLLKTSKSLRQNLGKKLDKEDKVRECQLQVFLRLEMCLQCPSINESTDDMEQVVEEVTDLLRMVCLTED 762
Cdd:pfam15292 475 LEANCLNLVKNHLLKTSKSIRQQYGSALDKESKVRECQLQVFLRLELCLQCPSLQSDSDDMEQLVEEVTDMLRIISLTKD 554
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 763 SAYLAEFL-EEILRLYIDSIPKTLGNLYNSLGFVIPQKLAGVLPTDFFSDDSMTQENKSPLLSVPFLSSARRSVSGspeS 841
Cdd:pfam15292 555 PAYLARFLqEEILPLYLDSIPKTLGDLYHSLGTQIPEKLAAVLPADFFSDDSMTQDSISPSLSSSLLSSASLSSSG---E 631
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 842 DELQELRTRSAKKRRKNALIRHKSIAEVSQNLRQIEIPKVSKRATKKENSHPAPQQPSQ---PVKDTVQEVTKVRRNLFN 918
Cdd:pfam15292 632 DQLEELRTRSAKKRRKNALTRHRSMTESSQNLRQIEIPKKSKRATKSENSHSLLKTAVQqppPQKDTVQEVTKVRRNLFN 711
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 919 QELLSPSKRSLkrgLPRSHSVSAVDGLEDKLDNFKKNkGYHKLLTKSVAETPVHKQISKRLLHRQIKGRSSDPGPDIGVV 998
Cdd:pfam15292 712 QEIVSPSKRSK---LPRSQSVSAVEGLKHKRSSEKEE-DYHKLLTKKVAETPLHKQVSRRLLHRQIKGRSSDPGPDICIV 787
|
....*.
gi 118421085 999 EESPEK 1004
Cdd:pfam15292 788 EESPEK 793
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1086-1504 |
1.07e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1086 RMKKRSRNTLDSEVPAAYQTPKKSHQKSLSfSKTTPRRISHTPQTPLYTPERLQKSPAKMTPTKQAAFKESLKDSSSPGH 1165
Cdd:PHA03247 2660 RVSRPRRARRLGRAAQASSPPQRPRRRAAR-PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAA 2738
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1166 DSPLDSKITPQKRHTQAGEGTSLETKTPRTPKRQGTQPPGFLPNCTWPHSVNSSPESPSCPAPPTSSTAQPRRECLTPIR 1245
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1246 DPLRTPpraAAFMGTPqnqthqqPHVLRAARAEEPAQKLKDKAIKTPKRPGNSTVTSSPPVTPKKLFTSPlcdvsKKSPF 1325
Cdd:PHA03247 2819 PPAASP---AGPLPPP-------TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPV 2883
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1326 RKSKIECPSPGELDQKEPQMSPSVAASLSCPVPSTPPELSQRATLDTVPPPPPSkvgkrcrktsdprrsivecQPDASAT 1405
Cdd:PHA03247 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP-------------------RPQPPLA 2944
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1406 PGVGTADSPAAPTDSRDDQKGL----------SLSPQ----------SPPERRGYPGPGLrSDWHASSPLLITSDTEHVT 1465
Cdd:PHA03247 2945 PTTDPAGAGEPSGAVPQPWLGAlvpgrvavprFRVPQpapsreapasSTPPLTGHSLSRV-SSWASSLALHEETDPPPVS 3023
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 118421085 1466 LL------SEAEHhgiGDLKSNVLSVEEGEGLRTADAEKSSLSHP 1504
Cdd:PHA03247 3024 LKqtlwppDDTED---SDADSLFDSDSERSDLEALDPLPPEPHDP 3065
|
|
| GGN |
pfam15685 |
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the ... |
1335-1502 |
4.40e-04 |
|
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the maturation of sperm and is expressed virtually only in the testis. It is found to be associated with the intracellular membrane, binds with GGNBP1 and may be involved in vesicular trafficking.
Pssm-ID: 434857 [Multi-domain] Cd Length: 668 Bit Score: 45.14 E-value: 4.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1335 PGELDQKEPQMSPS-VAASLSCPVPST-PPELSQRATLDTVPPPP-------PSKvGKRCRKTSDPR-RSIVECQPDASA 1404
Cdd:pfam15685 59 PGLLVPPEPQASPSpLPLTLELPLPVTpPPEEAAAAAVSTAPPPAvgsllpaPSK-WRKPTGTAVARiRGLLEASHRGQG 137
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1405 TPGVGTADSPAAPTD--SRDDQKGLSLSPQSPPERRGYPGPGLRSDWHASS----PLLITSDTEHVTllSEAEHHGIGDL 1478
Cdd:pfam15685 138 DPLSLRPLLPLLPRQliEKDPAPGAPAPPPPTPLEPRKPPPLPPSDRQPPNrgitPALATSATSPTD--SQAKHIAEGKT 215
|
170 180
....*....|....*....|....*....
gi 118421085 1479 KSNVL-----SVEEGEGLRTAdAEKSSLS 1502
Cdd:pfam15685 216 AGGACggappQAGEGEMARFA-ASESGLS 243
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1323-1707 |
1.54e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.62 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1323 SPFRKSKIECPSPGELDQKEPQMSPSVAASLSCPVPSTPPELSQRATLDTVPPPPPSkvgkrcrkTSDPrrsiveCQPDA 1402
Cdd:PHA03307 63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPP--------TPPP------ASPPP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1403 SATPGVGTADSPAAPTDSRDDQkglslSPQSPPERRGYPGPGLRSDWHASSPLLITSDTEHVtllseaehhGIGDLKSNV 1482
Cdd:PHA03307 129 SPAPDLSEMLRPVGSPGPPPAA-----SPPAAGASPAAVASDAASSRQAALPLSSPEETARA---------PSSPPAEPP 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1483 LSVEEGEGLRTADAEKSSLSHPGippsppscgPGSPLMPSRDvhcttdgrqcqasaQLDNLPASAWHSTDSASPQTYEVE 1562
Cdd:PHA03307 195 PSTPPAAASPRPPRRSSPISASA---------SSPAPAPGRS--------------AADDAGASSSDSSSSESSGCGWGP 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1563 LEMQASGLPKLrikKIDPSSSLEAEPLSKEESSLGEESFLPALSMPRASRSLSKPEPTYVSPPCPRLSHSTP------GK 1636
Cdd:PHA03307 252 ENECPLPRPAP---ITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSsresssSS 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 118421085 1637 SRGQTYICQACTPTHGPSSTPSPFQTDGVPWTP-------SPKHSGKTTPDIIKDWPRRKRAVGCGAGSSSGRGEVGA 1707
Cdd:PHA03307 329 TSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADpssprkrPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR 406
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Treslin_N |
pfam15292 |
Treslin N-terminus; This family represents the N-terminus of treslin, a checkpoint regulator ... |
208-1004 |
0e+00 |
|
Treslin N-terminus; This family represents the N-terminus of treslin, a checkpoint regulator which plays a role in DNA replication preinitiation complex formation.
Pssm-ID: 464618 Cd Length: 793 Bit Score: 1380.93 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 208 FYWVDTTEWSKLWESPDHLGYWTVCELLHHGGGTVLPSESFSWDFAQAGEMLLRSGIKLSSEPHLSPWISMLPTDATLNR 287
Cdd:pfam15292 1 LHWVDTTEYSKLWESPDHLGYWTVSEVLQQVGGTILPSETALLDLSSAGESLLSGGRKGSPAPHLSPWISALPFDSTLNY 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 288 LLYNSPEYEASFPRMEGMLFLPVEaGKEIQETWTVTLEPLAMHQRHFQKPVRIFLKGSVAQWSLPTSSTLGTDSWMLGSP 367
Cdd:pfam15292 81 LLSSEPVYRAAFPQLEGVLFWPQE-GKEEQQSCAVTLEPVAMRQRHLQEPVRIFLKGVLTQWDAPSLSQLGTESWILQSS 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 368 EESTATQRLLFQQLVSRLTAEELHLVADVDPGEGRPPITGVISPLSASAMILTVCRTKEAEFQRHVLQTAVADSPRDTAS 447
Cdd:pfam15292 160 EEEDSEQAALFQQLLRRLSAEELHMVAEVDPGEGGPPCTAVLSPLSASTALLTVLQPEEAQFQQLLLTTVVTESTQDTSS 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 448 LFSDVVDSILNQTHDSLADTASA----ASPVPEWAQQELGHTTPWSPAVVEKWFPFCNISGASSDLMESFGLLQAASANK 523
Cdd:pfam15292 240 DLPDVVSSVLNVVYDIMEEDPAAdeieDPPVPEWAQQELSRTSPWSTAVVEGWFPLSDQSGASSHLMESFRLLQAVPEEK 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 524 EESSKT-EGELIHCLAELYQRKSREEStiAHQEDSKKKRGVPRTPVRQKMNTMCRSLKMLNVARLNVKAQKLHPDGSPDV 602
Cdd:pfam15292 320 EESSKTlEQELTSCLSELYQRKSREES--ASQEDRGKKRGVPRTPVRQKMKTMSRSLQMLNVARLNVKAQKLQPEGEPDG 397
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 603 AGEKGIQKIPSGRTVDKLEDRGRTLRSskPKDFKTEEELLSYIRENYQKTVATGEIMLYACARNMISTVKMFLKSKGTkE 682
Cdd:pfam15292 398 AGEKGPQKPGKRRSSDRLEPRGRTLRS--PKDFKTEEELLSHLKENYQKTVAEGESSLLTCAQNLISTVKAFLKSKGT-D 474
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 683 LEVNCLNQVKSSLLKTSKSLRQNLGKKLDKEDKVRECQLQVFLRLEMCLQCPSINESTDDMEQVVEEVTDLLRMVCLTED 762
Cdd:pfam15292 475 LEANCLNLVKNHLLKTSKSIRQQYGSALDKESKVRECQLQVFLRLELCLQCPSLQSDSDDMEQLVEEVTDMLRIISLTKD 554
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 763 SAYLAEFL-EEILRLYIDSIPKTLGNLYNSLGFVIPQKLAGVLPTDFFSDDSMTQENKSPLLSVPFLSSARRSVSGspeS 841
Cdd:pfam15292 555 PAYLARFLqEEILPLYLDSIPKTLGDLYHSLGTQIPEKLAAVLPADFFSDDSMTQDSISPSLSSSLLSSASLSSSG---E 631
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 842 DELQELRTRSAKKRRKNALIRHKSIAEVSQNLRQIEIPKVSKRATKKENSHPAPQQPSQ---PVKDTVQEVTKVRRNLFN 918
Cdd:pfam15292 632 DQLEELRTRSAKKRRKNALTRHRSMTESSQNLRQIEIPKKSKRATKSENSHSLLKTAVQqppPQKDTVQEVTKVRRNLFN 711
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 919 QELLSPSKRSLkrgLPRSHSVSAVDGLEDKLDNFKKNkGYHKLLTKSVAETPVHKQISKRLLHRQIKGRSSDPGPDIGVV 998
Cdd:pfam15292 712 QEIVSPSKRSK---LPRSQSVSAVEGLKHKRSSEKEE-DYHKLLTKKVAETPLHKQVSRRLLHRQIKGRSSDPGPDICIV 787
|
....*.
gi 118421085 999 EESPEK 1004
Cdd:pfam15292 788 EESPEK 793
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1086-1504 |
1.07e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1086 RMKKRSRNTLDSEVPAAYQTPKKSHQKSLSfSKTTPRRISHTPQTPLYTPERLQKSPAKMTPTKQAAFKESLKDSSSPGH 1165
Cdd:PHA03247 2660 RVSRPRRARRLGRAAQASSPPQRPRRRAAR-PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAA 2738
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1166 DSPLDSKITPQKRHTQAGEGTSLETKTPRTPKRQGTQPPGFLPNCTWPHSVNSSPESPSCPAPPTSSTAQPRRECLTPIR 1245
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1246 DPLRTPpraAAFMGTPqnqthqqPHVLRAARAEEPAQKLKDKAIKTPKRPGNSTVTSSPPVTPKKLFTSPlcdvsKKSPF 1325
Cdd:PHA03247 2819 PPAASP---AGPLPPP-------TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPV 2883
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1326 RKSKIECPSPGELDQKEPQMSPSVAASLSCPVPSTPPELSQRATLDTVPPPPPSkvgkrcrktsdprrsivecQPDASAT 1405
Cdd:PHA03247 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP-------------------RPQPPLA 2944
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1406 PGVGTADSPAAPTDSRDDQKGL----------SLSPQ----------SPPERRGYPGPGLrSDWHASSPLLITSDTEHVT 1465
Cdd:PHA03247 2945 PTTDPAGAGEPSGAVPQPWLGAlvpgrvavprFRVPQpapsreapasSTPPLTGHSLSRV-SSWASSLALHEETDPPPVS 3023
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 118421085 1466 LL------SEAEHhgiGDLKSNVLSVEEGEGLRTADAEKSSLSHP 1504
Cdd:PHA03247 3024 LKqtlwppDDTED---SDADSLFDSDSERSDLEALDPLPPEPHDP 3065
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1079-1443 |
1.20e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.20e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1079 PSEKGSARMKKRSRNTLDSEVPAAYQTPKKSHQKSLSFSKTTPRriSHTPQTPLYTPeRLQKSPAKMTPTKQAAFKESLK 1158
Cdd:PHA03247 2577 PSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPD--THAPDPPPPSP-SPAANEPDPHPPPTVPPPERPR 2653
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1159 DSSSPGHDSPlDSKITPQKRHTQAgegtsleTKTPRTPKRQGTQPP-GFLPNCTWPHSVNSSPESPSCPA----PPTSST 1233
Cdd:PHA03247 2654 DDPAPGRVSR-PRRARRLGRAAQA-------SSPPQRPRRRAARPTvGSLTSLADPPPPPPTPEPAPHALvsatPLPPGP 2725
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1234 AQPRRECLTPIRDPL-RTPPRAAAFMGTPqnqthqqphvlraARAEEPAqklkdkaikTPKRPGNSTVTSSPPVTPKKLF 1312
Cdd:PHA03247 2726 AAARQASPALPAAPApPAVPAGPATPGGP-------------ARPARPP---------TTAGPPAPAPPAAPAAGPPRRL 2783
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1313 TSPlcDVSKKSPFRKSKIECPSPGELDQKEPQMSPSVAASLSCPVPSTPPELSQRATLDTVPPPPPSKV---GKRCRKTS 1389
Cdd:PHA03247 2784 TRP--AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLplgGSVAPGGD 2861
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 118421085 1390 DPRRSIVECQPDASATPGVGTADSPAAPTDSRDDQKgLSLSPQSPPERRGYPGP 1443
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTES-FALPPDQPERPPQPQAP 2914
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1151-1444 |
2.76e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 52.87 E-value: 2.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1151 AAFKESLKDSSSPGHDSPLDSKITPQKRHT----QAGEGTSLETKTPRTPKRQGTQPPGFLPNCTWPHSVNSSPESPscP 1226
Cdd:PHA03307 15 AEGGEFFPRPPATPGDAADDLLSGSQGQLVsdsaELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWS--L 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1227 APPTSSTAQPRRECLTPIRDPLRTPPRAAAFMGTPQNQTHQQPHVLRAARAEEPAQKLKDKAIKTPKRPGNSTVTSSPPV 1306
Cdd:PHA03307 93 STLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1307 TPKKLFTSPL--CDVSKKSPFRKSkiecPSPGELDQKEPQMSPSVAASLSCPVPSTP--PELSQRATLDTVPPPPPSkvg 1382
Cdd:PHA03307 173 ALPLSSPEETarAPSSPPAEPPPS----TPPAAASPRPPRRSSPISASASSPAPAPGrsAADDAGASSSDSSSSESS--- 245
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 118421085 1383 krcRKTSDPRRSIVECQPDASATPGVGTADSPAAPTDSRddqKGLSLSPQSPPERRGYPGPG 1444
Cdd:PHA03307 246 ---GCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSR---PGPASSSSSPRERSPSPSPS 301
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
910-1408 |
3.51e-06 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 52.38 E-value: 3.51e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 910 TKVRRNLFNQELLSPSKRSlKRGLPrshSVSAVDglEDKLDNFKKNKGYHKLLTKSvaetPVHKQiskrllhrqikGRSS 989
Cdd:PTZ00449 474 TRISKIQFTQEIKKLIKKS-KKKLA---PIEEED--SDKHDEPPEGPEASGLPPKA----PGDKE-----------GEEG 532
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 990 DPGPDIGVvEESPEKGDEISLRRSPRIKQLSFSRTHSASfysvsqpKSRSVQRVHSFQQD-KSDQRENSPvqsiRSPKSl 1068
Cdd:PTZ00449 533 EHEDSKES-DEPKEGGKPGETKEGEVGKKPGPAKEHKPS-------KIPTLSKKPEFPKDpKHPKDPEEP----KKPKR- 599
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1069 lfgamseMISPSEKGSARMKKRSRNtldSEVPAAYQTPKkshqkslsfSKTTPRRiSHTPQTPLyTPERLQ--------K 1140
Cdd:PTZ00449 600 -------PRSAQRPTRPKSPKLPEL---LDIPKSPKRPE---------SPKSPKR-PPPPQRPS-SPERPEgpkiikspK 658
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1141 SPAKMTPTKQAAFKESLKDSSSPGHDSPLDSKITPQKRHTQAGEGTSLETKTPRTPKRQGTQPPGFLPncTWPHSVNSSP 1220
Cdd:PTZ00449 659 PPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLP--RDEEFPFEPI 736
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1221 ESPSCPAPPTSSTAQPRRECLTPIRDPLRTPPRAAAFmgtpqNQTHQQPHVlrAARAEEPaqklkDKAIKTPKRPG--NS 1298
Cdd:PTZ00449 737 GDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDIL-----AEEFKEEDI--HAETGEP-----DEAMKRPDSPSehED 804
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1299 TVTSSPPVTPKKLFTSPLCDVS----KKSPFRKSKIECPSPGEL-------DQKEPQMSPSVAASLSCPV---------- 1357
Cdd:PTZ00449 805 KPPGDHPSLPKKRHRLDGLALSttdlESDAGRIAKDASGKIVKLkrsksfdDLTTVEEAEEMGAEARKIVvdddgteadd 884
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|..
gi 118421085 1358 -PSTPPELSQRATLDTVPPPPPSKVGKRCRKTSDPRRsivecqPDASATPGV 1408
Cdd:PTZ00449 885 eDTHPPEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKK------PDSAFIPSI 930
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1196-1443 |
4.25e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 4.25e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1196 PKRQGTQPPGflPNCTWPHSVNSSPESPSCPAPPTSSTAQPRRECLTPIRDPLRTPPRAAAFMGTPQNQTHQQPHVLRAA 1275
Cdd:PHA03247 2570 PPRPAPRPSE--PAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1276 RAEEPAQKLKDKAIKTPKR---PGNSTVTSSPPVTPKKLFTSPlcDVSKKSPFRKSKIECPSPgeldqkEPQMSPSVAAS 1352
Cdd:PHA03247 2648 PPERPRDDPAPGRVSRPRRarrLGRAAQASSPPQRPRRRAARP--TVGSLTSLADPPPPPPTP------EPAPHALVSAT 2719
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1353 LSCPVPSTPPELSQRATLDTVPPPPPSKVGKRCRKTSDPRRSIVECQPDASATPGVGTADSPAAPTDSRDDQKGLSLSPQ 1432
Cdd:PHA03247 2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLP 2799
|
250
....*....|.
gi 118421085 1433 SPPERRGYPGP 1443
Cdd:PHA03247 2800 SPWDPADPPAA 2810
|
|
| GGN |
pfam15685 |
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the ... |
1335-1502 |
4.40e-04 |
|
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the maturation of sperm and is expressed virtually only in the testis. It is found to be associated with the intracellular membrane, binds with GGNBP1 and may be involved in vesicular trafficking.
Pssm-ID: 434857 [Multi-domain] Cd Length: 668 Bit Score: 45.14 E-value: 4.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1335 PGELDQKEPQMSPS-VAASLSCPVPST-PPELSQRATLDTVPPPP-------PSKvGKRCRKTSDPR-RSIVECQPDASA 1404
Cdd:pfam15685 59 PGLLVPPEPQASPSpLPLTLELPLPVTpPPEEAAAAAVSTAPPPAvgsllpaPSK-WRKPTGTAVARiRGLLEASHRGQG 137
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1405 TPGVGTADSPAAPTD--SRDDQKGLSLSPQSPPERRGYPGPGLRSDWHASS----PLLITSDTEHVTllSEAEHHGIGDL 1478
Cdd:pfam15685 138 DPLSLRPLLPLLPRQliEKDPAPGAPAPPPPTPLEPRKPPPLPPSDRQPPNrgitPALATSATSPTD--SQAKHIAEGKT 215
|
170 180
....*....|....*....|....*....
gi 118421085 1479 KSNVL-----SVEEGEGLRTAdAEKSSLS 1502
Cdd:pfam15685 216 AGGACggappQAGEGEMARFA-ASESGLS 243
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
923-1443 |
6.37e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.76 E-value: 6.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 923 SPSKRSLKRGLPRSHSVSAVDGLEDKLDNFKKNKgyhkllTKSVAETPVHKQISKRllhRQIKGRSSDPGPDIGVVEESp 1002
Cdd:pfam03154 33 SPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSS------KKIKEEAPSPLKSAKR---QREKGASDTEEPERATAKKS- 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1003 eKGDEISLRRSPRIKQLSFSRTHSASFYSVSQPK-----SRSVQRVHSFQQDKSDQRENSPVQSIR-------------- 1063
Cdd:pfam03154 103 -KTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKdidqdNRSTSPSIPSPQDNESDSDSSAQQQILqtqppvlqaqsgaa 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1064 SPKSLLFGAMSEMISPSEKGSARMKKRSRNTLDSEVPAAYQTPKKSHQKSLSFSKTTPRRIS--HTPQTPLYTPERLQKS 1141
Cdd:pfam03154 182 SPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPspHPPLQPMTQPPPPSQV 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1142 PAKMTP-----TKQAAFKESLKDSSS----PGHDSPL-------DSKITPQKRHTQAGEGTSLETKTPRTPKRQGTQPPG 1205
Cdd:pfam03154 262 SPQPLPqpslhGQMPPMPHSLQTGPShmqhPVPPQPFpltpqssQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPR 341
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1206 FLPNCTWPHSVnsspesPSCPAPPTsstaqprreclTPIrdplrtPPraaafMGTPQNQTHqQPHVLRAARAEEPAQKLK 1285
Cdd:pfam03154 342 EQPLPPAPLSM------PHIKPPPT-----------TPI------PQ-----LPNPQSHKH-PPHLSGPSPFQMNSNLPP 392
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1286 DKAIKtpkrPGNSTVTSSPPvtpkKLFTSPLCDVSKKSPFRKSKIECPSPGELDQKEPQMSPSVAASLSCPVPSTPPels 1365
Cdd:pfam03154 393 PPALK----PLSSLSTHHPP----SAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSP--- 461
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1366 qRATLDTVPPPPPSKVGKRCRKTSDPrRSIVECQPDASATPGV-----GTADSPAAPTDSRDDQKGLSLSPQSPPERRGY 1440
Cdd:pfam03154 462 -FPQHPFVPGGPPPITPPSGPPTSTS-SAMPGIQPPSSASVSSsgpvpAAVSCPLPPVQIKEEALDEAEEPESPPPPPRS 539
|
...
gi 118421085 1441 PGP 1443
Cdd:pfam03154 540 PSP 542
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1323-1707 |
1.54e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.62 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1323 SPFRKSKIECPSPGELDQKEPQMSPSVAASLSCPVPSTPPELSQRATLDTVPPPPPSkvgkrcrkTSDPrrsiveCQPDA 1402
Cdd:PHA03307 63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPP--------TPPP------ASPPP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1403 SATPGVGTADSPAAPTDSRDDQkglslSPQSPPERRGYPGPGLRSDWHASSPLLITSDTEHVtllseaehhGIGDLKSNV 1482
Cdd:PHA03307 129 SPAPDLSEMLRPVGSPGPPPAA-----SPPAAGASPAAVASDAASSRQAALPLSSPEETARA---------PSSPPAEPP 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1483 LSVEEGEGLRTADAEKSSLSHPGippsppscgPGSPLMPSRDvhcttdgrqcqasaQLDNLPASAWHSTDSASPQTYEVE 1562
Cdd:PHA03307 195 PSTPPAAASPRPPRRSSPISASA---------SSPAPAPGRS--------------AADDAGASSSDSSSSESSGCGWGP 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1563 LEMQASGLPKLrikKIDPSSSLEAEPLSKEESSLGEESFLPALSMPRASRSLSKPEPTYVSPPCPRLSHSTP------GK 1636
Cdd:PHA03307 252 ENECPLPRPAP---ITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSsresssSS 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 118421085 1637 SRGQTYICQACTPTHGPSSTPSPFQTDGVPWTP-------SPKHSGKTTPDIIKDWPRRKRAVGCGAGSSSGRGEVGA 1707
Cdd:PHA03307 329 TSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADpssprkrPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR 406
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
1031-1420 |
2.02e-03 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 43.12 E-value: 2.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1031 SVSQPKSRSVQRVHSFQQDKSDQRENSPvqsirsPKsllfgamsemISPSEKGSARMKKRSRNTLDSEVPAAyqTPKKSH 1110
Cdd:PHA03377 525 SVTQPAKPHRKVQDGFQRSGRRQKRATP------PK----------VSPSDRGPPKASPPVMAPPSTGPRVM--ATPSTG 586
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1111 QKSLSFSKTTPRRISHTPQTPLYTPERLQK----SPAKMTPTKqaaFKESLKDSSSPGHDSPlDSKITPQKRHTQAGEGT 1186
Cdd:PHA03377 587 PRDMAPPSTGPRQQAKCKDGPPASGPHEKQppssAPRDMAPSV---VRMFLRERLLEQSTGP-KPKSFWEMRAGRDGSGI 662
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1187 SLETKTPRTPKRQGTQP-PGFLPN--------CTWPHSVNSSPESPSCPAPPTSSTAQPRRE--------CLTPIRDPLr 1249
Cdd:PHA03377 663 QQEPSSRRQPATQSTPPrPSWLPSvfvlpsvdAGRAQPSEESHLSSMSPTQPISHEEQPRYEdpddpldlSLHPDQAPP- 741
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1250 tPPRAAAFMGTPQNQTHQQPHvlrAARAEEPAQKLKDKAIKTPKRPGNSTVTSSPPVTPKKLFTSPLCDVSKKSPFRKSK 1329
Cdd:PHA03377 742 -PSHQAPYSGHEEPQAQQAPY---PGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYP 817
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1330 IECPSPGELDQKEPQMSPSVAASL--------------SCPVPSTP-----PELSQRATLDTVPPPPPSkvgkrcrktSD 1390
Cdd:PHA03377 818 GHGHPQGPWAPRPPHLPPQWDGSAghgqdqvsqfphlqSETGPPRLqlsqvPQLPYSQTLVSSSAPSWS---------SP 888
|
410 420 430
....*....|....*....|....*....|
gi 118421085 1391 PRRSIVECQPDASATPGVGTADSPAAPTDS 1420
Cdd:PHA03377 889 QPRAPIRPIPTRFPPPPMPLQDSMAVGCDS 918
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
1181-1400 |
2.45e-03 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 42.74 E-value: 2.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1181 QAGEGTSLETKT---PRTPKRQGTQPPGFLPNCTWPHSVNSSPE--SPSCPAPPTSSTAQPrreclTPIRD-------PL 1248
Cdd:PHA03379 393 RAGKLTERAREAlekASEPTYGTPRPPVEKPRPEVPQSLETATShgSAQVPEPPPVHDLEP-----GPLHDqhsmapcPV 467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1249 RTPPRAAAFMGTPQNqthQQPHVLRAAR-AEEPAQKLKDKAIktpkRPGNSTVTSSPPVTPKKLFTSPLCDVSKKSP-FR 1326
Cdd:PHA03379 468 AQLPPGPLQDLEPGD---QLPGVVQDGRpACAPVPAPAGPIV----RPWEASLSQVPGVAFAPVMPQPMPVEPVPVPtVA 540
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 118421085 1327 KSKIECPSPGELDQKEP-QMSPSVAASLSCPVPSTPPElsqratldtvPPPPPSKVGKRC-----RKTSDPRRSIVECQP 1400
Cdd:PHA03379 541 LERPVCPAPPLIAMQGPgETSGIVRVRERWRPAPWTPN----------PPRSPSQMSVRDrlarlRAEAQPYQASVEVQP 610
|
|
|