|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
507-889 |
3.82e-12 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 71.89 E-value: 3.82e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 654 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 726
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 727 VPKLPQQPDYPHPKPKT--------TRSPAASPTELVPTPV-------------FEPVTPL-----------KEDPVTTI 774
Cdd:PHA03247 2792 SESRESLPSPWDPADPPaavlapaaALPPAASPAGPLPPPTsaqptapppppgpPPPSLPLggsvapggdvrRRPPSRSP 2871
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 775 ASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVPSPE 854
Cdd:PHA03247 2872 AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
|
410 420 430 440
....*....|....*....|....*....|....*....|....*..
gi 1907118306 855 VTESKPVLPRVREPVTLRTETWVT------------TKAPKTPKRTR 889
Cdd:PHA03247 2952 AGEPSGAVPQPWLGALVPGRVAVPrfrvpqpapsreAPASSTPPLTG 2998
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1375-1466 |
3.02e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 3.02e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1375 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1452
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1907118306 1453 LGEGPASNTVAFST 1466
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1376-1456 |
9.14e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.08 E-value: 9.14e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1376 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1453
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1907118306 1454 GEG 1456
Cdd:smart00060 81 GEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1376-1459 |
1.21e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 47.79 E-value: 1.21e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1376 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1452
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1907118306 1453 LGEGPAS 1459
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
954-1397 |
7.22e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 7.22e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 954 RTRRPHPRPK-TTASTGVSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSddlelvAFSTESPQKTIAPR----QTTSMP 1028
Cdd:PHA03247 2585 RARRPDAPPQsARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPA------ANEPDPHPPPTVPPperpRDDPAP 2658
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1029 PKLKTPH-SRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVTF--RVEPPKTTIA 1105
Cdd:PHA03247 2659 GRVSRPRrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaaRQASPALPAA 2738
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1106 PLeTRGIPLIPVI--SPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRlhTAPVRPRIPGrPHGRPALNK 1183
Cdd:PHA03247 2739 PA-PPAVPAGPATpgGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR--ESLPSPWDPA-DPPAAVLAP 2814
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1184 TTTRPDKTKPRGTSHKNGvgTGTKQAPKPPSPGRNASVDSHATRKPGSvSGTRRPPiphrhSSTRPVSPERRPLPPNNVT 1263
Cdd:PHA03247 2815 AAALPPAASPAGPLPPPT--SAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPP-----SRSPAAKPAAPARPPVRRL 2886
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1264 GKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHvryiP 1343
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ----P 2962
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 1907118306 1344 KPENKPCSITDSVRRFPTEEATEGNATSPPQNPPTNLTVVTVEGCPSFVILDWE 1397
Cdd:PHA03247 2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEE 3016
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
384-758 |
9.97e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.07 E-value: 9.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154 171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154 251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154 327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEP 678
Cdd:pfam03154 407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 679 PLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154 484 STSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1360-1471 |
1.04e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.92 E-value: 1.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1360 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1439
Cdd:COG3401 220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
|
90 100 110
....*....|....*....|....*....|...
gi 1907118306 1440 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1471
Cdd:COG3401 295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
1.89e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.63 E-value: 1.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1907118306 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
114-195 |
2.62e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.45 E-value: 2.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69
|
....*.
gi 1907118306 190 YEFGVK 195
Cdd:smart00060 70 YEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
114-195 |
2.97e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 41.33 E-value: 2.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71
|
....
gi 1907118306 192 FGVK 195
Cdd:cd00063 72 FRVR 75
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
491-703 |
6.48e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 44.37 E-value: 6.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839 286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839 365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118306 648 THRQRTKYKTTQSPKIPHSKPAdlgpITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:NF033839 431 VKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKPEVKPQPEKPKPEVKPQPEKP 482
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
428-770 |
4.07e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 41.96 E-value: 4.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665 208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665 288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665 364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665 416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665 496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
507-889 |
3.82e-12 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 71.89 E-value: 3.82e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 654 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 726
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 727 VPKLPQQPDYPHPKPKT--------TRSPAASPTELVPTPV-------------FEPVTPL-----------KEDPVTTI 774
Cdd:PHA03247 2792 SESRESLPSPWDPADPPaavlapaaALPPAASPAGPLPPPTsaqptapppppgpPPPSLPLggsvapggdvrRRPPSRSP 2871
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 775 ASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVPSPE 854
Cdd:PHA03247 2872 AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
|
410 420 430 440
....*....|....*....|....*....|....*....|....*..
gi 1907118306 855 VTESKPVLPRVREPVTLRTETWVT------------TKAPKTPKRTR 889
Cdd:PHA03247 2952 AGEPSGAVPQPWLGALVPGRVAVPrfrvpqpapsreAPASSTPPLTG 2998
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
509-778 |
7.41e-12 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 70.49 E-value: 7.41e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449 542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 580 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 659
Cdd:PTZ00449 620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 660 SPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 739
Cdd:PTZ00449 656 SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
|
250 260 270
....*....|....*....|....*....|....*....
gi 1907118306 740 KPKTTRSPAASPTELVPTPVfEPVTPLKEDPVTTIASKI 778
Cdd:PTZ00449 735 PIGDPDAEQPDDIEFFTPPE-EERTFFHETPADTPLPDI 772
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
448-779 |
5.63e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 68.04 E-value: 5.63e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 448 TATRDPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 527
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 528 ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKP-ALEPATVTPEILVPkivpkPPQKPKATRRPEVPQVKPAHE 606
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPaAALPPAASPAGPLP-----PPTSAQPTAPPPPPGPPPPSL 2850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 607 PvTFGSEAPALAIVTTTDiepviTRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKK 686
Cdd:PHA03247 2851 P-LGGSVAPGGDVRRRPP-----SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 687 VR-RPRPKPQTTPHPEVPhtilvPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKtTRSPAASPTELVPTPVFEPVTP 765
Cdd:PHA03247 2925 PPpQPQPPPPPPPRPQPP-----LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPR-FRVPQPAPSREAPASSTPPLTG 2998
|
330
....*....|....
gi 1907118306 766 LKEDPVTTIASKIS 779
Cdd:PHA03247 2999 HSLSRVSSWASSLA 3012
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
456-890 |
1.28e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 66.89 E-value: 1.28e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247 2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247 2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247 2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKED 769
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA 2838
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 770 PVTT-------------IASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQK 836
Cdd:PHA03247 2839 PPPPpgppppslplggsVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQ 2918
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118306 837 TQKKHRPSPKPKPVPSPEVTESKPVLPR-----VREPVTLRTETWVTTKAPKTPKRTRR 890
Cdd:PHA03247 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRF 2977
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1375-1466 |
3.02e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 3.02e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1375 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1452
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1907118306 1453 LGEGPASNTVAFST 1466
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
545-1180 |
6.40e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.57 E-value: 6.40e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-TFGSEAPALAIVTTT 623
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMlTWIRGLEELASDDAG 2549
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 624 DIEPvitrtkasvtTLAPKPPRPRTHRQrtkykttqspkIPHSKPAdlgPITSEPplaSTTKKVRRPRPKPQttphpevP 703
Cdd:PHA03247 2550 DPPP----------PLPPAAPPAAPDRS-----------VPPPRPA---PRPSEP---AVTSRARRPDAPPQ-------S 2595
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 704 HTILVPATSLEPFIITEAPGTtlvpkLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPlKEDPVTTIASKISQRTH 783
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSP-----LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERP-RDDPAPGRVSRPRRARR 2669
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 784 RPRPRPRPRPRPRPRPKATLSPQAPETKTV--PAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVPSPEVTESKPV 861
Cdd:PHA03247 2670 LGRAAQASSPPQRPRRRAARPTVGSLTSLAdpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 862 LPRVREPVTLRTetwvTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPSALSTEVPATVVLATALTPVTLR-- 939
Cdd:PHA03247 2750 TPGGPARPARPP----TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASpa 2825
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 940 TKAPKTTTLAPNVQRTRRPHPRPKTTASTGVSE----SKSAPTelQSLVLKPVTS--PSLEIIQSQSVSDDLELVAFSTE 1013
Cdd:PHA03247 2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPS--RSPAAKPAAParPPVRRLARPAVSRSTESFALPPD 2903
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1014 SPQKtiaPRQTTSMPPKLKTPHSRMPAKEPVPKEPlhttskPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPV 1093
Cdd:PHA03247 2904 QPER---PPQPQAPPPPQPQPQPPPPPQPQPPPPP------PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV 2974
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1094 T-FRVEPPKTTIAPLETRGIPLIPVISPRPSQEELQTAM-EETDQSTQELFTT-KIPRTTELAKTTQA----PHRLHTAP 1166
Cdd:PHA03247 2975 PrFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTlWPPDDTEDSDADSLfdsdSERSDLEA 3054
|
650
....*....|....
gi 1907118306 1167 VRPrIPGRPHGRPA 1180
Cdd:PHA03247 3055 LDP-LPPEPHDPFA 3067
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
511-928 |
3.15e-09 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 62.01 E-value: 3.15e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 511 KRQSTPKPPRVKPAPE--PETRPSAQTTKAPRK-TKKPGHHRlrRPKTTRSPEVPKsKPAlePATVTPEILVPKIVPKP- 586
Cdd:PTZ00449 506 KHDEPPEGPEASGLPPkaPGDKEGEEGEHEDSKeSDEPKEGG--KPGETKEGEVGK-KPG--PAKEHKPSKIPTLSKKPe 580
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 587 -PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRTHRQRTKYKTTQSPKIP 664
Cdd:PTZ00449 581 fPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPqRPSSPERPEGPKIIKSPKPP 660
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 665 HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHPKPKTT 744
Cdd:PTZ00449 661 KSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFEPIGDP 739
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 745 RSPAASPTELVPTPVfEPVTPLKEDPVTTIASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETK---TVPAVV---- 817
Cdd:PTZ00449 740 DAEQPDDIEFFTPPE-EERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDKppgDHPSLPkkrh 818
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 818 ----LEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREPVTLR-------TETWVTTKAPKTPK 886
Cdd:PTZ00449 819 rldgLALSTTDLESDAGRIAKDASGKIVKLKRSKSFDDLTTVEEAEEMGAEARKIVVDDdgteaddEDTHPPEEKHKSEV 898
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 1907118306 887 RTRRPRPKPQTTPTPETPLTKPVAATDLEPSALSTEVPATVV 928
Cdd:PTZ00449 899 RRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSIIAIFLVSLIV 940
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1376-1456 |
9.14e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.08 E-value: 9.14e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1376 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1453
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1907118306 1454 GEG 1456
Cdd:smart00060 81 GEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1376-1459 |
1.21e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 47.79 E-value: 1.21e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1376 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1452
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1907118306 1453 LGEGPAS 1459
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
494-817 |
5.15e-06 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 51.61 E-value: 5.15e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 494 RPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEpetrpSAQTTKAPRKTKKPGhhRLRRPKTTRSPEVPKSKPALEPATV 573
Cdd:PTZ00449 560 KPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPE-----EPKKPKRPRSAQRPT--RPKSPKLPELLDIPKSPKRPESPKS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 574 tpeilvPKiVPKPPQKPKATRRPEVPQV----KPAHEP-VTFG-----------SEAPALAIVTTTDIEPVITRTKASVT 637
Cdd:PTZ00449 633 ------PK-RPPPPQRPSSPERPEGPKIikspKPPKSPkPPFDpkfkekfyddyLDAAAKSKETKTTVVLDESFESILKE 705
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 638 TLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADlgpitsePPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFI 717
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGD-------PDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFK 778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 718 ITEAPGTTLVP----KLPQQPDYPHPKPKTT-----------------------------RSPAASPTELVPTPVFEPVT 764
Cdd:PTZ00449 779 EEDIHAETGEPdeamKRPDSPSEHEDKPPGDhpslpkkrhrldglalsttdlesdagriaKDASGKIVKLKRSKSFDDLT 858
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118306 765 PLKEDPVT--------------------TIASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVV 817
Cdd:PTZ00449 859 TVEEAEEMgaearkivvdddgteaddedTHPPEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSII 931
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
517-703 |
5.93e-06 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 51.21 E-value: 5.93e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377 414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPK---IPHSKPAD 670
Cdd:PHA03377 489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGFQRSGRRQKratPPKVSPSD 559
|
170 180 190
....*....|....*....|....*....|...
gi 1907118306 671 LGPITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:PHA03377 560 RGPPKASPPVMAPPSTGPRVMATPSTGPRDMAP 592
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
954-1397 |
7.22e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 7.22e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 954 RTRRPHPRPK-TTASTGVSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSddlelvAFSTESPQKTIAPR----QTTSMP 1028
Cdd:PHA03247 2585 RARRPDAPPQsARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPA------ANEPDPHPPPTVPPperpRDDPAP 2658
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1029 PKLKTPH-SRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVTF--RVEPPKTTIA 1105
Cdd:PHA03247 2659 GRVSRPRrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaaRQASPALPAA 2738
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1106 PLeTRGIPLIPVI--SPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRlhTAPVRPRIPGrPHGRPALNK 1183
Cdd:PHA03247 2739 PA-PPAVPAGPATpgGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR--ESLPSPWDPA-DPPAAVLAP 2814
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1184 TTTRPDKTKPRGTSHKNGvgTGTKQAPKPPSPGRNASVDSHATRKPGSvSGTRRPPiphrhSSTRPVSPERRPLPPNNVT 1263
Cdd:PHA03247 2815 AAALPPAASPAGPLPPPT--SAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPP-----SRSPAAKPAAPARPPVRRL 2886
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1264 GKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHvryiP 1343
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ----P 2962
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 1907118306 1344 KPENKPCSITDSVRRFPTEEATEGNATSPPQNPPTNLTVVTVEGCPSFVILDWE 1397
Cdd:PHA03247 2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEE 3016
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1158-1378 |
4.15e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 4.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1158 APHRLHTAPVRPRIPGRPHGRP---ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVDSHATRKPGSVSG 1234
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPsepAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1235 TRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGT 1314
Cdd:PHA03247 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907118306 1315 TTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1378
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
690-1322 |
4.95e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 4.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 690 PRPKPQTTPHPEVPHTILVPAtslePFIITEAPGTTLVPklPQQPDYPHPKPKTTRSPAASPTELVPtPVFEPVTPLKED 769
Cdd:PHA03247 2496 PDPGGGGPPDPDAPPAPSRLA----PAILPDEPVGEPVH--PRMLTWIRGLEELASDDAGDPPPPLP-PAAPPAAPDRSV 2568
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 770 PVTTIASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKkhrPSPKPKP 849
Cdd:PHA03247 2569 PPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEP---DPHPPPT 2645
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 850 VPSPEVTESKPVLPRVREPVTLR-----------TETWVTTKAPKT------------PKRTRRPRPKPQttptpetplt 906
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARrlgraaqasspPQRPRRRAARPTvgsltsladpppPPPTPEPAPHAL---------- 2715
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 907 kpVAATDLEPSALStevpatvvlATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGVSESKSAP-TELQSLVL 985
Cdd:PHA03247 2716 --VSATPLPPGPAA---------ARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPaAGPPRRLT 2784
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 986 KPVTSPSLEIIQSQSVSDDLELVAFSTESPQKTIAPRQTTSMPpkLKTPHSRMPAKEPVPKEPLHTTSKP---------- 1055
Cdd:PHA03247 2785 RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP--LPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdv 2862
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1056 -KMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPLEtrgiplipvisPRPSQEELQTAMEET 1134
Cdd:PHA03247 2863 rRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPP-----------PQPQPQPPPPPQPQP 2931
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1135 DQSTQELFTTKIPRTTELAKTTQAPHRLHTAPVRPRIPGRPhgrPALNKTTTRPDKTKPrgtshkngvgtgtkqAPKPPS 1214
Cdd:PHA03247 2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRV---AVPRFRVPQPAPSRE---------------APASST 2993
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1215 PGRNASVDSHATRKPGSVSgtrrppiPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSpplKATLHPIGTATA 1294
Cdd:PHA03247 2994 PPLTGHSLSRVSSWASSLA-------LHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSD---LEALDPLPPEPH 3063
|
650 660
....*....|....*....|....*...
gi 1907118306 1295 RPGAEQKEPTAPASEEEFGTTTDFSSSP 1322
Cdd:PHA03247 3064 DPFAHEPDPATPEAGARESPSSQFGPPP 3091
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
384-758 |
9.97e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.07 E-value: 9.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154 171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154 251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154 327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEP 678
Cdd:pfam03154 407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 679 PLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154 484 STSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1360-1471 |
1.04e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.92 E-value: 1.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1360 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1439
Cdd:COG3401 220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
|
90 100 110
....*....|....*....|....*....|...
gi 1907118306 1440 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1471
Cdd:COG3401 295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1360-1514 |
1.36e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.53 E-value: 1.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1360 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1437
Cdd:COG3401 314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118306 1438 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1514
Cdd:COG3401 388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
1.89e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.63 E-value: 1.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1907118306 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
553-777 |
2.50e-04 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 45.69 E-value: 2.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 553 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 627
Cdd:PLN03209 330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 628 VITRTKASVTTLAPKPPRPRTHRQRTKyktTQSPKIPHSKPADLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTIL 707
Cdd:PLN03209 407 PAEPDVVPSPGSASNVPEVEPAQVEAK---KTRPLSPYARYEDLKPPTS-------------PSPTAPTGVSPSVSSTSS 470
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907118306 708 VPATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIASK 777
Cdd:PLN03209 471 VPAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADE 544
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
114-195 |
2.62e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.45 E-value: 2.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69
|
....*.
gi 1907118306 190 YEFGVK 195
Cdd:smart00060 70 YEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
114-195 |
2.97e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 41.33 E-value: 2.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71
|
....
gi 1907118306 192 FGVK 195
Cdd:cd00063 72 FRVR 75
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
515-626 |
5.12e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 44.80 E-value: 5.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
|
90 100 110
....*....|....*....|....*....|..
gi 1907118306 595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950 434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
491-703 |
6.48e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 44.37 E-value: 6.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839 286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839 365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118306 648 THRQRTKYKTTQSPKIPHSKPAdlgpITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:NF033839 431 VKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKPEVKPQPEKPKPEVKPQPEKP 482
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
458-771 |
1.49e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 43.30 E-value: 1.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003 372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003 451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTK--------ASVTTLAPKPPRPRTHRQrtkyktTQSPKIPHS 666
Cdd:PRK07003 531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAraaaaakpAAAPAAAPKPAAPRVAVQ------VPTPRARAA 604
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 667 KPADLGPITSEPPLASTTkkvRRPRPkpqttPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRS 746
Cdd:PRK07003 605 TGDAPPNGAARAEQAAES---RGAPP-----PWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADA 673
|
330 340
....*....|....*....|....*
gi 1907118306 747 PAAsPTELVPTPvfePVTPLkeDPV 771
Cdd:PRK07003 674 PAP-PVDTRPLP---PAIPL--DAI 692
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
420-747 |
1.54e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.54 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 420 SRVVQPQTATYDVI---------SSSTTSDETEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 489
Cdd:PRK10263 297 NRATQPEYDEYDPLlngapitepVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 490 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:PRK10263 376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 568 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 637
Cdd:PRK10263 456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 638 TLAPKPPRPrthrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRrprpkpQTTPHPEVPHTILVPATSLepfi 717
Cdd:PRK10263 525 AWYQPIPEP------VKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK------KATLATGAAATVAAPVFSL---- 588
|
330 340 350
....*....|....*....|....*....|
gi 1907118306 718 iteAPGTTLVPKLPQQPDYPHPKPKTTRSP 747
Cdd:PRK10263 589 ---ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
491-779 |
1.76e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 43.06 E-value: 1.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PHA03369 362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPT 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 571 ATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaivtTTDIEPVITRTKASVTTLAPKPPRPRTHR 650
Cdd:PHA03369 442 NPYVMPISMANMVYPGHPQEHGHERKRKRGGELKEELIETLKLVK------KLKEEQESLAKELEATAHKSEIKKIAESE 515
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 651 QRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT------ 724
Cdd:PHA03369 516 FKNAGAKTAAANIEPNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTaealag 595
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118306 725 ---TLVPKLPQQPDYPHpkpktTRSPAASPTELVPTPVFEPVTPLKEDPVTTIASKIS 779
Cdd:PHA03369 596 aieTLLTQASAQPAGLS-----LPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPSLE 648
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
490-618 |
1.84e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 42.93 E-value: 1.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994 370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118306 569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994 450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1015-1266 |
1.93e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.14 E-value: 1.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1015 PQKTIAPRQTTSMPPKLKTPhsRMPAKEPVPKEPLhTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT 1094
Cdd:PTZ00449 569 PSKIPTLSKKPEFPKDPKHP--KDPEEPKKPKRPR-SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSP 645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1095 FRVEPPKT--TIAPLETRGIPLIPVISPRPSQEELQTAMEETDQST----QELFTTKIPRTTELAKTTQAPHRLHTAPVR 1168
Cdd:PTZ00449 646 ERPEGPKIikSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTtvvlDESFESILKETLPETPGTPFTTPRPLPPKL 725
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1169 PRIPGRPHGRPAlnktttRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRhsstR 1248
Cdd:PTZ00449 726 PRDEEFPFEPIG------DPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK----R 795
|
250
....*....|....*....
gi 1907118306 1249 PVSP-ERRPLPPNNVTGKP 1266
Cdd:PTZ00449 796 PDSPsEHEDKPPGDHPSLP 814
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
480-628 |
2.18e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 42.49 E-value: 2.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950 351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118306 560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950 427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
515-718 |
2.54e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 42.51 E-value: 2.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPE--ILVPKIVPKP 586
Cdd:PRK14086 87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPEpgAWPRAADDYG 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 587 PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHS 666
Cdd:PRK14086 167 WQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRG 241
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907118306 667 KPADLGPItSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 718
Cdd:PRK14086 242 GPGPPERD-DAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
491-766 |
2.94e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.45 E-value: 2.94e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPRT 648
Cdd:pfam03154 252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 649 hrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPkPQTTPHPEVPhtilVPATSLEPfiiteaPGTTLVP 728
Cdd:pfam03154 300 ----LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-PREQPLPPAP----LSMPHIKP------PPTTPIP 364
|
250 260 270
....*....|....*....|....*....|....*...
gi 1907118306 729 KLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:pfam03154 365 QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
469-600 |
3.46e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.28 E-value: 3.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764 371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118306 545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764 449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
428-770 |
4.07e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 41.96 E-value: 4.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665 208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665 288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665 364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665 416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665 496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
515-612 |
4.37e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 41.85 E-value: 4.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954 385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
|
90
....*....|....*...
gi 1907118306 595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954 451 PRNVASGKPG---VDLGS 465
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
531-781 |
4.63e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 4.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 531 PSAQTTKAPRKTKKPGHHRlrrpKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPqvkPAHEPVTF 610
Cdd:PHA03247 255 PAPPPVVGEGADRAPETAR----GATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA---PAGDAEEE 327
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 611 GSEAPALAIVTttdiePVitrtkasvttlapkpPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRp 690
Cdd:PHA03247 328 DDEDGAMEVVS-----PL---------------PRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRR- 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 691 rpkpqTTPHPEVPHTiLVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:PHA03247 387 -----SARHAATPFA-RGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATE 460
|
250
....*....|.
gi 1907118306 771 VTTIASKISQR 781
Cdd:PHA03247 461 PAPDDPDDATR 471
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
669-836 |
4.69e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.99 E-value: 4.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 669 ADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPA 748
Cdd:PRK10263 335 APVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQ 414
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 749 ASPTELVPTPVFEPVTPLKEDPVTTIASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTV-PAVVLEPVTLRPEV 827
Cdd:PRK10263 415 PAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQqPQPVEQQPVVEPEP 494
|
....*....
gi 1907118306 828 QVTTLAPQK 836
Cdd:PRK10263 495 VVEETKPAR 503
|
|
| COG3979 |
COG3979 |
Chitodextrinase [Carbohydrate transport and metabolism]; |
1372-1471 |
5.10e-03 |
|
Chitodextrinase [Carbohydrate transport and metabolism];
Pssm-ID: 443178 [Multi-domain] Cd Length: 369 Bit Score: 41.30 E-value: 5.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 1372 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1450
Cdd:COG3979 2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
|
90 100
....*....|....*....|.
gi 1907118306 1451 nplgeGPASNTVAFSTESADP 1471
Cdd:COG3979 72 -----DAAGNVSAASGTSTAM 87
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
491-673 |
5.22e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.40 E-value: 5.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323 383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118306 571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323 463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
|
170 180 190
....*....|....*....|....*....|...
gi 1907118306 641 PKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGP 673
Cdd:PRK12323 543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
|
|
|