|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
456-1055 |
1.59e-14 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 79.60 E-value: 1.59e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247 2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247 2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247 2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLkedPVTT 773
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPL---PPPT 2832
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 774 IDLERVTDLETPVAFRTEAPGTTLVPAvvlEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREP 853
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER 2907
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 854 vtlrtetwvtTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPKTTTL 933
Cdd:PHA03247 2908 ----------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPWLGAL 2967
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 934 APN---VQRTRRPHPRP--KTTASTGVSESKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvafstespqKT 1003
Cdd:PHA03247 2968 VPGrvaVPRFRVPQPAPsrEAPASSTPPLTGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE----------DS 3037
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|..
gi 1907118310 1004 IAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1055
Cdd:PHA03247 3038 DADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1360-1451 |
3.06e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 3.06e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1360 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1437
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1907118310 1438 LGEGPASNTVAFST 1451
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1361-1441 |
9.23e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.08 E-value: 9.23e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1361 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1438
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1907118310 1439 GEG 1441
Cdd:smart00060 81 GEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1361-1444 |
1.18e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 47.79 E-value: 1.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1361 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1437
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1907118310 1438 LGEGPAS 1444
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
915-1306 |
3.30e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 3.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 915 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 994
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 995 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1064
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1065 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1142
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1143 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1213
Cdd:PHA03247 2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1214 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1293
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
|
410
....*....|...
gi 1907118310 1294 EEEFGTTTDFSSS 1306
Cdd:PHA03247 2990 ASSTPPLTGHSLS 3002
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
384-758 |
7.46e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.45 E-value: 7.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154 171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154 251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154 327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEP 678
Cdd:pfam03154 407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 679 PLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154 484 STSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1345-1456 |
1.05e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.92 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1345 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1424
Cdd:COG3401 220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
|
90 100 110
....*....|....*....|....*....|...
gi 1907118310 1425 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1456
Cdd:COG3401 295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
1.87e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.63 E-value: 1.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1907118310 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
114-195 |
2.68e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.06 E-value: 2.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69
|
....*.
gi 1907118310 190 YEFGVK 195
Cdd:smart00060 70 YEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
114-195 |
3.02e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 41.33 E-value: 3.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71
|
....
gi 1907118310 192 FGVK 195
Cdd:cd00063 72 FRVR 75
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
491-703 |
4.26e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 44.76 E-value: 4.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839 286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839 365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118310 648 THRQRTKYKttqspkiPHSKPADLGPITSEPPLASTTKKVRRP---RPKPQTTPHPEVP 703
Cdd:NF033839 431 VKPQPEKPK-------PEVKPQPEKPKPEVKPQPETPKPEVKPqpeKPKPEVKPQPEKP 482
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
428-770 |
3.31e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 42.34 E-value: 3.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665 208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665 288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665 364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665 416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665 496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
307-575 |
3.47e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 3.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247 2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247 2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 464 RTAE---QPRATLAPiealfeSRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtrPSAQTTKAPR 540
Cdd:PHA03247 2893 RSTEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE--PSGAVPQPWL 2964
|
250 260 270
....*....|....*....|....*....|....*
gi 1907118310 541 KTKKPGHHRLRRpktTRSPEVPKSKPALEPATVTP 575
Cdd:PHA03247 2965 GALVPGRVAVPR---FRVPQPAPSREAPASSTPPL 2996
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
456-1055 |
1.59e-14 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 79.60 E-value: 1.59e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247 2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247 2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247 2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLkedPVTT 773
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPL---PPPT 2832
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 774 IDLERVTDLETPVAFRTEAPGTTLVPAvvlEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREP 853
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER 2907
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 854 vtlrtetwvtTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPKTTTL 933
Cdd:PHA03247 2908 ----------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPWLGAL 2967
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 934 APN---VQRTRRPHPRP--KTTASTGVSESKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvafstespqKT 1003
Cdd:PHA03247 2968 VPGrvaVPRFRVPQPAPsrEAPASSTPPLTGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE----------DS 3037
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|..
gi 1907118310 1004 IAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1055
Cdd:PHA03247 3038 DADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
509-851 |
4.79e-12 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 71.26 E-value: 4.79e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449 542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 580 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 659
Cdd:PTZ00449 620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 660 SPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 739
Cdd:PTZ00449 656 SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 740 KPKTTRSPAASPTELVPTPVFEPVtplkedpvttidlervtdletpvaFRTEAPGTTLVPAVVLEPVTlRPEVQVTTLAP 819
Cdd:PTZ00449 735 PIGDPDAEQPDDIEFFTPPEEERT------------------------FFHETPADTPLPDILAEEFK-EEDIHAETGEP 789
|
330 340 350
....*....|....*....|....*....|..
gi 1907118310 820 QKTQKKHRPSPKPKPVPspevTESKPVLPRVR 851
Cdd:PTZ00449 790 DEAMKRPDSPSEHEDKP----PGDHPSLPKKR 817
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
398-765 |
9.52e-12 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 70.74 E-value: 9.52e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 398 PRGSWASSEEPWVVPGAKTSEDSRVVQPQTAtydvissSTTSDETEIEIHTATRDPILDSVPPKTSRtaeqPRATLAPIE 477
Cdd:PHA03247 2604 DRGDPRGPAPPSPLPPDTHAPDPPPPSPSPA-------ANEPDPHPPPTVPPPERPRDDPAPGRVSR----PRRARRLGR 2672
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 478 ALFESRNVEIFTSPEVRPTTaAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGhhrlrRPKTTR 557
Cdd:PHA03247 2673 AAQASSPPQRPRRRAARPTV-GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPA-----PPAVPA 2746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 558 SPEVPKS--KPALEPATVTPEILVPKIVPKPPQKPKATRrpevPQVKPAHEPVTFGSEAPALAIVTTtdiePVITRTKAS 635
Cdd:PHA03247 2747 GPATPGGpaRPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRESLPSPWDPADPPA----AVLAPAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 636 VTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKP-----ADLGPITSEPPLASTTKKV-----------------RRPRPK 693
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvAPGGDVRRRPPSRSPAAKPaaparppvrrlarpavsRSTESF 2898
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907118310 694 PQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPhPKPKTTRSPAASPTELVPTPVFEPVTP 765
Cdd:PHA03247 2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQPWLGALVP 2969
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
507-1079 |
1.07e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 67.27 E-value: 1.07e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPghhrlrRPKTTRSPEVPKSKPALEPAtvtpeilvpkivP 584
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAP------------P 2614
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 585 KPPqkPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEpvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIP 664
Cdd:PHA03247 2615 SPL--PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRD-------------DPAPGRVSRPRRARRLGRAAQASSP 2679
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 665 HSKPAdlgPITSEPPLASTTKKVRRPRP--KPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPhPKPK 742
Cdd:PHA03247 2680 PQRPR---RRAARPTVGSLTSLADPPPPppTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP-GGPA 2755
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 743 TTRSPAASPTELVPTPVFEPVTPlkedPVTTIDLERVTDLETPVAFRTEAPGTTLVPAVVLEPVTLRPEVQ--VTTLAPQ 820
Cdd:PHA03247 2756 RPARPPTTAGPPAPAPPAAPAAG----PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspAGPLPPP 2831
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 821 KTQKKHRPSPKPKPVPSPEVTESKpVLPrvREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPEtpltkpvaatdlE 900
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGS-VAP--GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRST------------E 2896
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 901 PSALSTEVPATVVLATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTAST-GVSESKSAPTELQSLVLKPVTSPSLE 979
Cdd:PHA03247 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPaGAGEPSGAVPQPWLGALVPGRVAVPR 2976
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 980 IIQSQSvSDDLELVAFSTESPQKTIAPR------------QTTSMPPKLK--------TPHSRMPAKEPVPKEPLHTTSK 1039
Cdd:PHA03247 2977 FRVPQP-APSREAPASSTPPLTGHSLSRvsswasslalheETDPPPVSLKqtlwppddTEDSDADSLFDSDSERSDLEAL 3055
|
570 580 590 600
....*....|....*....|....*....|....*....|
gi 1907118310 1040 PKMPPSPEvaDTTSVPKDERLSLKPDPEVTHSETVLPPVT 1079
Cdd:PHA03247 3056 DPLPPEPH--DPFAHEPDPATPEAGARESPSSQFGPPPLS 3093
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
545-1165 |
1.75e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 66.50 E-value: 1.75e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 611
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 612 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPR 691
Cdd:PHA03247 2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 692 PKPQT-TPHP-EVPHTILVPATSLEPFIITEAPGTTLVPK---LPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:PHA03247 2626 PPPPSpSPAAnEPDPHPPPTVPPPERPRDDPAPGRVSRPRrarRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP 2705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 767 KE-DPVTTIDLERVTDLETPVAFRTEAPGTTLVPAVvlepvtlrPEVQVTTLAPQKTQKKHRPSPKPkpvpspevTESKP 845
Cdd:PHA03247 2706 PTpEPAPHALVSATPLPPGPAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTA--------GPPAP 2769
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 846 VLPRVREPVTLRTETwVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPSALSTEVPATVVLATALTPVTLRT 925
Cdd:PHA03247 2770 APPAAPAAGPPRRLT-RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 926 KAPKTTTLAPNVQRTRRPHPRPKTTASTGVSESKSaptelqSLVLKPVTSPSLEiiqsqsvsddlelvafSTESPQKTIA 1005
Cdd:PHA03247 2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV------RRLARPAVSRSTE----------------SFALPPDQPE 2906
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1006 PRQTTSMPPKLKTPHSRMPAKEPVPKEPlhttsKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT-FRVEP 1084
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPP-----PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQ 2981
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1085 PKTTIAPLETRGIPLIPVISPRPSQEELQTAM-EETDQSTQELFTT-KIPRTTELAKTTQA----PHRLHTAPVRPrIPG 1158
Cdd:PHA03247 2982 PAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTlWPPDDTEDSDADSLfdsdSERSDLEALDP-LPP 3060
|
....*..
gi 1907118310 1159 RPHGRPA 1165
Cdd:PHA03247 3061 EPHDPFA 3067
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1360-1451 |
3.06e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 3.06e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1360 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1437
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1907118310 1438 LGEGPASNTVAFST 1451
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
381-784 |
3.72e-08 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 58.55 E-value: 3.72e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 381 KRFPEFPEAKTAFPLEKPRGSWASSEEPwVVPGAKTSEdSRVVQPQTATYDVISSSTTSDETEIEI---------HTATR 451
Cdd:PTZ00449 494 KKLAPIEEEDSDKHDEPPEGPEASGLPP-KAPGDKEGE-EGEHEDSKESDEPKEGGKPGETKEGEVgkkpgpakeHKPSK 571
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 452 DPILDSVP-----PKTSRTAEQPRATLAPIEAlfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpAPE 526
Cdd:PTZ00449 572 IPTLSKKPefpkdPKHPKDPEEPKKPKRPRSA-----------QRPTRPKSPKLPELLDIPKSPKRPESPKSPK---RPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 527 PETRPSAQttkaprktkkpghhrlRRPKTTRSPEVPKSKPAlepatvtpeilvpkivPKPPQKPKATRRPEVPQVKPAHE 606
Cdd:PTZ00449 638 PPQRPSSP----------------ERPEGPKIIKSPKPPKS----------------PKPPFDPKFKEKFYDDYLDAAAK 685
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 607 PVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRThrqrtkykttqsPKIPHSKPADlgpitsePPLASTTK 685
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRD------------EEFPFEPIGD-------PDAEQPDD 746
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 686 KVRRPRPKPQTTPHPEvphtilVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVT 764
Cdd:PTZ00449 747 IEFFTPPEEERTFFHE------TPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSL 813
|
410 420
....*....|....*....|
gi 1907118310 765 PLKEDPVTTIDLErVTDLET 784
Cdd:PTZ00449 814 PKKRHRLDGLALS-TTDLES 832
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
511-1045 |
4.72e-08 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 58.16 E-value: 4.72e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 511 KRQSTPKPPRVKPAPE--PETRPSAQTTKAPRK-TKKPGHHRlrRPKTTRSPEVPKsKPAlePATVTPEILVPKIVPKP- 586
Cdd:PTZ00449 506 KHDEPPEGPEASGLPPkaPGDKEGEEGEHEDSKeSDEPKEGG--KPGETKEGEVGK-KPG--PAKEHKPSKIPTLSKKPe 580
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 587 -PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRTHRQRTKYKTTQSPKIP 664
Cdd:PTZ00449 581 fPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPqRPSSPERPEGPKIIKSPKPP 660
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 665 HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHPKPKTT 744
Cdd:PTZ00449 661 KSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFEPIGDP 739
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 745 RSPAASPTELVPTPVFEPVtplkedpvttidlervtdletpvaFRTEAPGTTLVPAVVLEPVTlRPEVQVTTLAPQKTQK 824
Cdd:PTZ00449 740 DAEQPDDIEFFTPPEEERT------------------------FFHETPADTPLPDILAEEFK-EEDIHAETGEPDEAMK 794
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 825 KhrpspkpkpvpspevteskPVLPRVREPVTlrtetwvTTKAPKTPKRTRRPRPKPqttptpetpltkpVAATDLEPSal 904
Cdd:PTZ00449 795 R-------------------PDSPSEHEDKP-------PGDHPSLPKKRHRLDGLA-------------LSTTDLESD-- 833
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 905 stevPATVVLATALTPVTLRtkapktttlapnvqrtrrphprpkttastgvsESKSApTELQSLVLKPVTSPSLEIIqsq 984
Cdd:PTZ00449 834 ----AGRIAKDASGKIVKLK--------------------------------RSKSF-DDLTTVEEAEEMGAEARKI--- 873
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907118310 985 SVSDDlelvafSTESPQKTIAPrqttSMPPKLKTPHSRMPAKEPVPKEPLHTTSKPKMPPS 1045
Cdd:PTZ00449 874 VVDDD------GTEADDEDTHP----PEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDS 924
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1361-1441 |
9.23e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.08 E-value: 9.23e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1361 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1438
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1907118310 1439 GEG 1441
Cdd:smart00060 81 GEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1361-1444 |
1.18e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 47.79 E-value: 1.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1361 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1437
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1907118310 1438 LGEGPAS 1444
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
915-1306 |
3.30e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 3.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 915 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 994
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 995 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1064
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1065 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1142
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1143 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1213
Cdd:PHA03247 2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1214 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1293
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
|
410
....*....|...
gi 1907118310 1294 EEEFGTTTDFSSS 1306
Cdd:PHA03247 2990 ASSTPPLTGHSLS 3002
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
517-703 |
5.72e-06 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 51.21 E-value: 5.72e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377 414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPK---IPHSKPAD 670
Cdd:PHA03377 489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGFQRSGRRQKratPPKVSPSD 559
|
170 180 190
....*....|....*....|....*....|...
gi 1907118310 671 LGPITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:PHA03377 560 RGPPKASPPVMAPPSTGPRVMATPSTGPRDMAP 592
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1143-1363 |
4.00e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 4.00e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1143 APHRLHTAPVRPRIPGRPHGRP---ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVDSHATRKPGSVSG 1219
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPsepAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1220 TRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGT 1299
Cdd:PHA03247 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907118310 1300 TTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1363
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
384-758 |
7.46e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.45 E-value: 7.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154 171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154 251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154 327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEP 678
Cdd:pfam03154 407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 679 PLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154 484 STSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1012-1362 |
1.04e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 1.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1012 MPPKLK--TPHSRMPAKEPVPKEP-LHTTSKPKMPPSPEVADTTSVPKDERlslkPDPEVTHSETVLPPVTFRVEPPKTT 1088
Cdd:PHA03247 2555 LPPAAPpaAPDRSVPPPRPAPRPSePAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1089 IAPLETRGIPLIPVISPRPSQEELQTAMEETDQSTQelfTTKIPRTTELAKTTQAPHRLHTAP--------VRPRIPGR- 1159
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR---ARRLGRAAQASSPPQRPRRRAARPtvgsltslADPPPPPPt 2707
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1160 PHGRP---------------ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPP 1224
Cdd:PHA03247 2708 PEPAPhalvsatplppgpaaARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1225 IPHRHSSTRPVSPERRPLPPNNVTgkPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGttTDFS 1304
Cdd:PHA03247 2788 VASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG--GDVR 2863
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118310 1305 SSPTKETDPLGKPRFIGPHVRYIPKPENKPCSIT-----DSVRRFPTEEATEGNATSPPQNPP 1362
Cdd:PHA03247 2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalppDQPERPPQPQAPPPPQPQPQPPPP 2926
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1345-1456 |
1.05e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.92 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1345 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1424
Cdd:COG3401 220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
|
90 100 110
....*....|....*....|....*....|...
gi 1907118310 1425 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1456
Cdd:COG3401 295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1345-1499 |
1.44e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.53 E-value: 1.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1345 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1422
Cdd:COG3401 314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118310 1423 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1499
Cdd:COG3401 388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
1.87e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.63 E-value: 1.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1907118310 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
114-195 |
2.68e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.06 E-value: 2.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69
|
....*.
gi 1907118310 190 YEFGVK 195
Cdd:smart00060 70 YEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
114-195 |
3.02e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 41.33 E-value: 3.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71
|
....
gi 1907118310 192 FGVK 195
Cdd:cd00063 72 FRVR 75
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
491-703 |
4.26e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 44.76 E-value: 4.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839 286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839 365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118310 648 THRQRTKYKttqspkiPHSKPADLGPITSEPPLASTTKKVRRP---RPKPQTTPHPEVP 703
Cdd:NF033839 431 VKPQPEKPK-------PEVKPQPEKPKPEVKPQPETPKPEVKPqpeKPKPEVKPQPEKP 482
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
515-626 |
4.86e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 44.80 E-value: 4.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
|
90 100 110
....*....|....*....|....*....|..
gi 1907118310 595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950 434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
491-705 |
1.09e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 43.90 E-value: 1.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 491 PEVRPTTAapQQTTSIPSTPKRqSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGH----HRLRRPKTTRSPEV---PK 563
Cdd:PHA03378 576 PLTSPTTS--QLASSAPSYAQT-PWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRpipmRPLRMQPITFNVLVfptPH 652
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 564 SKPALEPATVTPEILVPKIVP-----------KPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRT 632
Cdd:PHA03378 653 QPPQVEITPYKPTWTQIGHIPyqpsptgantmLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPG 732
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118310 633 KASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHT 705
Cdd:PHA03378 733 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPT 805
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
553-773 |
1.22e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 43.38 E-value: 1.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 553 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 627
Cdd:PLN03209 330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 628 VITRTKASVTTLAPKPPRPRTHRQRTKyktTQSPKIPHSKPADLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTIL 707
Cdd:PLN03209 407 PAEPDVVPSPGSASNVPEVEPAQVEAK---KTRPLSPYARYEDLKPPTS-------------PSPTAPTGVSPSVSSTSS 470
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 708 VPATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTT 773
Cdd:PLN03209 471 VPAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTA 540
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
420-747 |
1.45e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.54 E-value: 1.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 420 SRVVQPQTATYDVI---------SSSTTSDETEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 489
Cdd:PRK10263 297 NRATQPEYDEYDPLlngapitepVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 490 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:PRK10263 376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 568 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 637
Cdd:PRK10263 456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 638 TLAPKPPRPrthrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRrprpkpQTTPHPEVPHTILVPATSLepfi 717
Cdd:PRK10263 525 AWYQPIPEP------VKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK------KATLATGAAATVAAPVFSL---- 588
|
330 340 350
....*....|....*....|....*....|
gi 1907118310 718 iteAPGTTLVPKLPQQPDYPHPKPKTTRSP 747
Cdd:PRK10263 589 ---ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
458-771 |
1.47e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 43.30 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003 372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003 451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTK--------ASVTTLAPKPPRPRTHRQrtkyktTQSPKIPHS 666
Cdd:PRK07003 531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAraaaaakpAAAPAAAPKPAAPRVAVQ------VPTPRARAA 604
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 667 KPADLGPITSEPPLASTTkkvRRPRPkpqttPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRS 746
Cdd:PRK07003 605 TGDAPPNGAARAEQAAES---RGAPP-----PWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADA 673
|
330 340
....*....|....*....|....*
gi 1907118310 747 PAAsPTELVPTPvfePVTPLkeDPV 771
Cdd:PRK07003 674 PAP-PVDTRPLP---PAIPL--DAI 692
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1000-1251 |
1.50e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.52 E-value: 1.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1000 PQKTIAPRQTTSMPPKLKTPhsRMPAKEPVPKEPLhTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT 1079
Cdd:PTZ00449 569 PSKIPTLSKKPEFPKDPKHP--KDPEEPKKPKRPR-SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSP 645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1080 FRVEPPKT--TIAPLETRGIPLIPVISPRPSQEELQTAMEETDQST----QELFTTKIPRTTELAKTTQAPHRLHTAPVR 1153
Cdd:PTZ00449 646 ERPEGPKIikSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTtvvlDESFESILKETLPETPGTPFTTPRPLPPKL 725
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1154 PRIPGRPHGRPAlnktttRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRhsstR 1233
Cdd:PTZ00449 726 PRDEEFPFEPIG------DPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK----R 795
|
250
....*....|....*....
gi 1907118310 1234 PVSP-ERRPLPPNNVTGKP 1251
Cdd:PTZ00449 796 PDSPsEHEDKPPGDHPSLP 814
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
490-618 |
1.84e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 42.93 E-value: 1.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994 370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118310 569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994 450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
491-766 |
2.14e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.83 E-value: 2.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPRT 648
Cdd:pfam03154 252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 649 hrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPkPQTTPHPEVPhtilVPATSLEPfiiteaPGTTLVP 728
Cdd:pfam03154 300 ----LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-PREQPLPPAP----LSMPHIKP------PPTTPIP 364
|
250 260 270
....*....|....*....|....*....|....*...
gi 1907118310 729 KLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:pfam03154 365 QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
480-628 |
2.16e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 42.49 E-value: 2.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950 351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118310 560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950 427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
515-718 |
2.50e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 42.51 E-value: 2.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPEilvPKIVPKPP- 587
Cdd:PRK14086 87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPE---PGAWPRAAd 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 588 ----QKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKI 663
Cdd:PRK14086 164 dygwQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHV 238
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1907118310 664 PHSKPADLGPItSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 718
Cdd:PRK14086 239 HRGGPGPPERD-DAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
467-870 |
3.27e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.36 E-value: 3.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 467 EQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPstpkrQSTPKPPRVKPAPEP-ETRPSAQTTKAPR----- 540
Cdd:PHA03378 345 EAVRLPDDPIIVEDDDESEEIESECDPDEDKSGAEALASIP-----QTLPDPPTVYGRPKVfARKADLKSTKKCRaivtd 419
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 541 ---------KTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKP--PQKPKATrrpevPQVKPA--HEP 607
Cdd:PHA03378 420 psvikaieeEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQPLEGPTGPLSVQAPlePWQPLPH-----PQVTPVilHQP 494
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 608 VTFGSEAP-ALAIVTTTDIEPVITRTKAsvTTLAPKPPRPRTHRQ-----------RTKYKTTQSPKIPHSKPAD-LGPI 674
Cdd:PHA03378 495 PAQGVQAHgSMLDLLEKDDEDMEQRVMA--TLLPPSPPQPRAGRRapcvytedldiESDEPASTEPVHDQLLPAPgLGPL 572
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 675 TSEPPLASTTKKVR--------RPRPKPQTTPHPEVPHT-ILVPATSLEPFIITEAPGTTLVPKLPQQPDYPHP-KPKTT 744
Cdd:PHA03378 573 QIQPLTSPTTSQLAssapsyaqTPWPVPHPSQTPEPPTTqSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLvFPTPH 652
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 745 RSPAASPTELVPTPVFEPVTPLKEDPVTTIDLERVTDLETPVAFRTEAPGTTLVPAVvlEPVTLRPEVQVTTLAPQKTQK 824
Cdd:PHA03378 653 QPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAA--PPGRAQRPAAATGRARPPAAA 730
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1907118310 825 KHRPSPKPKPVPSPEVTESKPvlPRVREPVTLRTETWVTTKAPKTP 870
Cdd:PHA03378 731 PGRARPPAAAPGRARPPAAAP--GRARPPAAAPGRARPPAAAPGAP 774
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
428-770 |
3.31e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 42.34 E-value: 3.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665 208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665 288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665 364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665 416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665 496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
307-575 |
3.47e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 3.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247 2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247 2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 464 RTAE---QPRATLAPiealfeSRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtrPSAQTTKAPR 540
Cdd:PHA03247 2893 RSTEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE--PSGAVPQPWL 2964
|
250 260 270
....*....|....*....|....*....|....*
gi 1907118310 541 KTKKPGHHRLRRpktTRSPEVPKSKPALEPATVTP 575
Cdd:PHA03247 2965 GALVPGRVAVPR---FRVPQPAPSREAPASSTPPL 2996
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
469-600 |
3.54e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.90 E-value: 3.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764 371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118310 545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764 449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
515-612 |
4.18e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 41.85 E-value: 4.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954 385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
|
90
....*....|....*...
gi 1907118310 595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954 451 PRNVASGKPG---VDLGS 465
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
491-673 |
4.95e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.40 E-value: 4.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323 383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323 463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
|
170 180 190
....*....|....*....|....*....|...
gi 1907118310 641 PKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGP 673
Cdd:PRK12323 543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
|
|
| COG3979 |
COG3979 |
Chitodextrinase [Carbohydrate transport and metabolism]; |
1357-1456 |
5.75e-03 |
|
Chitodextrinase [Carbohydrate transport and metabolism];
Pssm-ID: 443178 [Multi-domain] Cd Length: 369 Bit Score: 40.91 E-value: 5.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1357 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1435
Cdd:COG3979 2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
|
90 100
....*....|....*....|.
gi 1907118310 1436 nplgeGPASNTVAFSTESADP 1456
Cdd:COG3979 72 -----DAAGNVSAASGTSTAM 87
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
489-751 |
6.00e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 41.31 E-value: 6.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 489 TSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPK----- 563
Cdd:PHA03307 88 PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVasdaa 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 564 ----------SKPALEPATVTPEILVPKIVPKPPQKPKAtRRPEVPQVKPAHEPVTFGSEAPALAIVTTTD--IEPVITR 631
Cdd:PHA03307 168 ssrqaalplsSPEETARAPSSPPAEPPPSTPPAAASPRP-PRRSSPISASASSPAPAPGRSAADDAGASSSdsSSSESSG 246
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 632 TKASVTTLAPKpPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKvRRPRPKPQTTPHPEVPHTILVPAT 711
Cdd:PHA03307 247 CGWGPENECPL-PRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSP-SSPGSGPAPSSPRASSSSSSSRES 324
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1907118310 712 SLE-PFIITEAPGTTLVPklPQQPDYPHPKPKTTRSPAASP 751
Cdd:PHA03307 325 SSSsTSSSSESSRGAAVS--PGPSPSRSPSPSRPPPPADPS 363
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
471-748 |
8.80e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 40.68 E-value: 8.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 471 ATLAPIEALFESrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtRPSAQttkAPRKTKKPGHHRL 550
Cdd:PLN03209 311 APLTPMEELLAK-------IPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPP-QPKAV---VPRPLSPYTAYED 379
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 551 RRPKTTRSPEVPKSKPA----------LEPATVTPEILVPKIVP--KPPQKPKATRRPEVPQVK-PAHEPVTFGSEAPAL 617
Cdd:PLN03209 380 LKPPTSPIPTPPSSSPAssksvdavakPAEPDVVPSPGSASNVPevEPAQVEAKKTRPLSPYARyEDLKPPTSPSPTAPT 459
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 618 AIVTTTDIEPVITRT------KASVTTLAPKPPRPRthrqrtkykttqsPKIPHSKPADLGPITSEPPLAsttkkvrrPR 691
Cdd:PLN03209 460 GVSPSVSSTSSVPAVpdtapaTAATDAAAPPPANMR-------------PLSPYAVYDDLKPPTSPSPAA--------PV 518
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907118310 692 PKPQTTPHPEVPhtilvPATSLEPFIITEAPGTTLVPKlpQQPDYPHP-----KPKTTRSPA 748
Cdd:PLN03209 519 GKVAPSSTNEVV-----KVGNSAPPTALADEQHHAQPK--PRPLSPYTmyedlKPPTSPTPS 573
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
669-1164 |
9.99e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 9.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 669 ADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPAtslePFIITEAPGTTLVPKLPQQP-DYPHPKPKTTRSP 747
Cdd:PRK10263 335 APVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPA----PEGYPQQSQYAQPAVQYNEPlQQPVQPQQPYYAP 410
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 748 AASPTELVPTPVFEPVTPLKEDPVTtidlERVTDLETPVAFRTEAPGTTLVPAVVLEPVTLRPEvqvTTLAPQKTQKKHR 827
Cdd:PRK10263 411 AAEQPAQQPYYAPAPEQPAQQPYYA----PAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQ---PAAQEPLYQQPQP 483
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 828 PSPKPKPVPSPEVTESKPVLP-----------RVREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVA- 895
Cdd:PRK10263 484 VEQQPVVEPEPVVEETKPARPplyyfeeveekRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSp 563
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 896 -ATDLEPSALSTEVPATVVlATALTPVTLRTKAPKTTT-LAPNVQRTRRPH-PRPKTTASTGVS-ESKSAPTElqslvlK 971
Cdd:PRK10263 564 lASGVKKATLATGAAATVA-APVFSLANSGGPRPQVKEgIGPQLPRPKRIRvPTRRELASYGIKlPSQRAAEE------K 636
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 972 PVTSPSLEIIQSQSVSDDlELVAFSTESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEP------LHTTSKPKMPPS 1045
Cdd:PRK10263 637 AREAQRNQYDSGDQYNDD-EIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAelarqfAQTQQQRYSGEQ 715
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1046 PEVADTTSVPKDERLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPLETRGIPLIPViSPRPSQEELQTAMEETDQSTQE 1125
Cdd:PRK10263 716 PAGANPFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPV-APQPQYQQPQQPVAPQPQYQQP 794
|
490 500 510
....*....|....*....|....*....|....*....
gi 1907118310 1126 LFTTKIPRTTELAKTTQAPHRLHTAPVRPRIPGRPHGRP 1164
Cdd:PRK10263 795 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 833
|
|
|