|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
507-1002 |
7.31e-16 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 84.22 E-value: 7.31e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 654 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 726
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 727 VPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERVTDLETPVA----FRTEAPG 798
Cdd:PHA03247 2792 SESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPSLPLGGSVApggdVRRRPPS 2868
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 799 TTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVP 878
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 879 SPEVTESKPVLPRVREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLepsALSTEvpaTVAPNV 958
Cdd:PHA03247 2949 PAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSL---ALHEE---TDPPPV 3022
|
490 500 510 520
....*....|....*....|....*....|....*....|....
gi 1907118313 959 QRTRRPHPRPKTTASTGVSESKSVSDDLELVAFSTESPQKTIAP 1002
Cdd:PHA03247 3023 SLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPF 3066
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1356-1447 |
2.93e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 2.93e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1356 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1433
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1907118313 1434 LGEGPASNTVAFST 1447
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1357-1437 |
9.03e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.08 E-value: 9.03e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1357 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1434
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1907118313 1435 GEG 1437
Cdd:smart00060 81 GEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1357-1440 |
1.11e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 48.18 E-value: 1.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1357 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1433
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1907118313 1434 LGEGPAS 1440
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
938-1378 |
1.94e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 1.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 938 ATDLEPSALSTEVPATVAPNVQRTRRPHPRPKttASTGVSESKSVSDDLELVAFSTESPQKTIAPRQTTsmPPKLKTPHS 1017
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPP--PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVS--RPRRARRLG 2671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1018 RMPAKEPVPKEPLHTTSKPKMPPSPEVADttsvPKDERLSLKPDPEVTHSETVLPPVTF--RVEPPKTTIAPLeTRGIPL 1095
Cdd:PHA03247 2672 RAAQASSPPQRPRRRAARPTVGSLTSLAD----PPPPPPTPEPAPHALVSATPLPPGPAaaRQASPALPAAPA-PPAVPA 2746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1096 IPVI--SPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRlhTAPVRPRIPGrPHGRPALNKTTTRPDKTK 1173
Cdd:PHA03247 2747 GPATpgGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR--ESLPSPWDPA-DPPAAVLAPAAALPPAAS 2823
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1174 PRGTSHKNGvgTGTKQAPKPPSPGRNASVDSHATRKPGSvSGTRRPPiphrhSSTRPVSPERRPLPPNNVTGKPGRAGIV 1253
Cdd:PHA03247 2824 PAGPLPPPT--SAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPP-----SRSPAAKPAAPARPPVRRLARPAVSRST 2895
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1254 SSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHvryiPKPENKPCSI 1333
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ----PWLGALVPGR 2971
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 1907118313 1334 TDSVRRFPTEEATEGNATSPPQNPPTNLTVVTVEGCPSFVILDWE 1378
Cdd:PHA03247 2972 VAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEE 3016
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
495-758 |
4.68e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.69 E-value: 4.68e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 495 PTTAAPQQTTSIPSTPKRQS---TPKPPRVKPAPE-----PETRPSAQT----TKAPRKTKKPGHhrLRRPKTTRSPEVP 562
Cdd:pfam03154 313 PSPAAPGQSQQRIHTPPSQSqlqSQQPPREQPLPPaplsmPHIKPPPTTpipqLPNPQSHKHPPH--LSGPSPFQMNSNL 390
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 563 KSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPahepvtfgseapalaivtttdiePVITRTKASVTTLAPK 642
Cdd:pfam03154 391 PPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-----------------------PVLTQSQSLPPPAASH 447
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 643 PPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAP 722
Cdd:pfam03154 448 PPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA 523
|
250 260 270
....*....|....*....|....*....|....*.
gi 1907118313 723 gttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154 524 -----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1341-1452 |
9.88e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.92 E-value: 9.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1341 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1420
Cdd:COG3401 220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
|
90 100 110
....*....|....*....|....*....|...
gi 1907118313 1421 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1452
Cdd:COG3401 295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
1.83e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.63 E-value: 1.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1907118313 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
114-195 |
2.59e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.45 E-value: 2.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69
|
....*.
gi 1907118313 190 YEFGVK 195
Cdd:smart00060 70 YEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
114-195 |
2.85e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 41.33 E-value: 2.85e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71
|
....
gi 1907118313 192 FGVK 195
Cdd:cd00063 72 FRVR 75
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
491-703 |
5.92e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 44.37 E-value: 5.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839 286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839 365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118313 648 THRQRTKYKttqspkiPHSKPADLGPITSEPPLASTTKKVRRP---RPKPQTTPHPEVP 703
Cdd:NF033839 431 VKPQPEKPK-------PEVKPQPEKPKPEVKPQPETPKPEVKPqpeKPKPEVKPQPEKP 482
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
428-770 |
2.84e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 42.34 E-value: 2.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665 208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665 288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665 364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665 416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665 496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
507-1002 |
7.31e-16 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 84.22 E-value: 7.31e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 654 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 726
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 727 VPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERVTDLETPVA----FRTEAPG 798
Cdd:PHA03247 2792 SESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPSLPLGGSVApggdVRRRPPS 2868
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 799 TTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVP 878
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 879 SPEVTESKPVLPRVREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLepsALSTEvpaTVAPNV 958
Cdd:PHA03247 2949 PAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSL---ALHEE---TDPPPV 3022
|
490 500 510 520
....*....|....*....|....*....|....*....|....
gi 1907118313 959 QRTRRPHPRPKTTASTGVSESKSVSDDLELVAFSTESPQKTIAP 1002
Cdd:PHA03247 3023 SLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPF 3066
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
509-776 |
2.17e-12 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 72.41 E-value: 2.17e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449 542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 580 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 659
Cdd:PTZ00449 620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 660 SPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 739
Cdd:PTZ00449 656 SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
|
250 260 270
....*....|....*....|....*....|....*..
gi 1907118313 740 KPKTTRSPAASPTELVPTPVfEPVTPLKEDPVTTIVP 776
Cdd:PTZ00449 735 PIGDPDAEQPDDIEFFTPPE-EERTFFHETPADTPLP 770
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
545-1161 |
9.03e-12 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 70.74 E-value: 9.03e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 611
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 612 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPR 691
Cdd:PHA03247 2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 692 PKPQT-TPHP-EVPHTILVPATSLEPFIITEAPGTTLVPK---LPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:PHA03247 2626 PPPPSpSPAAnEPDPHPPPTVPPPERPRDDPAPGRVSRPRrarRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP 2705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 767 K--EDPVTTIVPITDLERVtdletPVAFRTEAPGTTLASkiSQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVV 844
Cdd:PHA03247 2706 PtpEPAPHALVSATPLPPG-----PAAARQASPALPAAP--APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 845 LEPVTLRPEVqvttlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREPVT-LRTETWVTTKAPKTPKrtrrprpkpq 923
Cdd:PHA03247 2779 PPRRLTRPAV-----ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGpLPPPTSAQPTAPPPPP---------- 2843
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 924 ttptpetpltkpvaatdlEPSALSTEVPATVAPNVQRTRRPHPRPKTTASTGVSESKSVSDDLELVAFSTES---PQKTI 1000
Cdd:PHA03247 2844 ------------------GPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalPPDQP 2905
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1001 APRQTTSMPPKLKTPHSRMPAKEPVPKEPlhttsKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT-FRVE 1079
Cdd:PHA03247 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPP-----PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVP 2980
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1080 PPKTTIAPLETRGIPLIPVISPRPSQEELQTAM-EETDQSTQELFTT-KIPRTTELAKTTQA----PHRLHTAPVRPrIP 1153
Cdd:PHA03247 2981 QPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTlWPPDDTEDSDADSLfdsdSERSDLEALDP-LP 3059
|
....*...
gi 1907118313 1154 GRPHGRPA 1161
Cdd:PHA03247 3060 PEPHDPFA 3067
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
448-743 |
2.74e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 65.73 E-value: 2.74e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 448 TATRDPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 527
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 528 ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKP-ALEPATVTPEILVP---KIVPKPPQKPKATRRPEVPQVKP 603
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPaAALPPAASPAGPLPpptSAQPTAPPPPPGPPPPSLPLGGS 2855
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 604 AHEPVTFGSEAPALAIVTTtdiepVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAST 683
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAK-----PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118313 684 TKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKL---PQQPDYPHPKPKT 743
Cdd:PHA03247 2931 PPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFrvpQPAPSREAPASST 2993
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1356-1447 |
2.93e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 58.28 E-value: 2.93e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1356 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1433
Cdd:cd00063 2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
|
90
....*....|....
gi 1907118313 1434 LGEGPASNTVAFST 1447
Cdd:cd00063 80 GGESPPSESVTVTT 93
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
494-844 |
8.44e-08 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 57.39 E-value: 8.44e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 494 RPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEpetrpSAQTTKAPRKTKKPGhhRLRRPKTTRSPEVPKSKPALEPATV 573
Cdd:PTZ00449 560 KPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPE-----EPKKPKRPRSAQRPT--RPKSPKLPELLDIPKSPKRPESPKS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 574 tpeilvPKiVPKPPQKPKATRRPEVPQV----KPAHEP-VTFGseaPALAIVTTTDIEPVITRTKASVTTLAPKpprpRT 648
Cdd:PTZ00449 633 ------PK-RPPPPQRPSSPERPEGPKIikspKPPKSPkPPFD---PKFKEKFYDDYLDAAAKSKETKTTVVLD----ES 698
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 649 HRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTT--KKVRRPRPKPQTTPHPEVPHTILV---PATSLEPFIITEAPG 723
Cdd:PTZ00449 699 FESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEpiGDPDAEQPDDIEFFTPPEEERTFFhetPADTPLPDILAEEFK 778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 724 TTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVTPLK----------------------EDPVTTIVPITDL 780
Cdd:PTZ00449 779 EEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSLPKKrhrldglalsttdlesdagriaKDASGKIVKLKRS 851
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 781 ERVTDLET--------PVAFR-------TEAPGT-TLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVV 844
Cdd:PTZ00449 852 KSFDDLTTveeaeemgAEARKivvdddgTEADDEdTHPPEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSII 931
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
1357-1437 |
9.03e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 51.08 E-value: 9.03e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1357 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1434
Cdd:smart00060 3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1907118313 1435 GEG 1437
Cdd:smart00060 81 GEG 83
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
381-767 |
1.86e-07 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 56.24 E-value: 1.86e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 381 KRFPEFPEAKTAFPLEKPRGSWASSEEPwVVPGAKTSEdSRVVQPQTATYDVISSSTTSDETEIEI---------HTATR 451
Cdd:PTZ00449 494 KKLAPIEEEDSDKHDEPPEGPEASGLPP-KAPGDKEGE-EGEHEDSKESDEPKEGGKPGETKEGEVgkkpgpakeHKPSK 571
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 452 DPILDSVP-----PKTSRTAEQPRATLAPIEAlfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpAPE 526
Cdd:PTZ00449 572 IPTLSKKPefpkdPKHPKDPEEPKKPKRPRSA-----------QRPTRPKSPKLPELLDIPKSPKRPESPKSPK---RPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 527 PETRPSAQttkaprktkkpghhrlRRPKTTRSPEVPKSKPAlepatvtpeilvpkivPKPPQKPKATRRPEVPQVKPAHE 606
Cdd:PTZ00449 638 PPQRPSSP----------------ERPEGPKIIKSPKPPKS----------------PKPPFDPKFKEKFYDDYLDAAAK 685
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 607 PVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRThrqrtkykttqsPKIPHSKPADlgpitsePPLASTTK 685
Cdd:PTZ00449 686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRD------------EEFPFEPIGD-------PDAEQPDD 746
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 686 KVRRPRPKPQTTPHPEvphtilVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVT 764
Cdd:PTZ00449 747 IEFFTPPEEERTFFHE------TPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSL 813
|
...
gi 1907118313 765 PLK 767
Cdd:PTZ00449 814 PKK 816
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
1357-1440 |
1.11e-06 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 48.18 E-value: 1.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1357 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1433
Cdd:pfam00041 2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78
|
....*..
gi 1907118313 1434 LGEGPAS 1440
Cdd:pfam00041 79 GGEGPPS 85
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
938-1378 |
1.94e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 1.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 938 ATDLEPSALSTEVPATVAPNVQRTRRPHPRPKttASTGVSESKSVSDDLELVAFSTESPQKTIAPRQTTsmPPKLKTPHS 1017
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPP--PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVS--RPRRARRLG 2671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1018 RMPAKEPVPKEPLHTTSKPKMPPSPEVADttsvPKDERLSLKPDPEVTHSETVLPPVTF--RVEPPKTTIAPLeTRGIPL 1095
Cdd:PHA03247 2672 RAAQASSPPQRPRRRAARPTVGSLTSLAD----PPPPPPTPEPAPHALVSATPLPPGPAaaRQASPALPAAPA-PPAVPA 2746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1096 IPVI--SPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRlhTAPVRPRIPGrPHGRPALNKTTTRPDKTK 1173
Cdd:PHA03247 2747 GPATpgGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR--ESLPSPWDPA-DPPAAVLAPAAALPPAAS 2823
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1174 PRGTSHKNGvgTGTKQAPKPPSPGRNASVDSHATRKPGSvSGTRRPPiphrhSSTRPVSPERRPLPPNNVTGKPGRAGIV 1253
Cdd:PHA03247 2824 PAGPLPPPT--SAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPP-----SRSPAAKPAAPARPPVRRLARPAVSRST 2895
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1254 SSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHvryiPKPENKPCSI 1333
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ----PWLGALVPGR 2971
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 1907118313 1334 TDSVRRFPTEEATEGNATSPPQNPPTNLTVVTVEGCPSFVILDWE 1378
Cdd:PHA03247 2972 VAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEE 3016
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
495-758 |
4.68e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.69 E-value: 4.68e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 495 PTTAAPQQTTSIPSTPKRQS---TPKPPRVKPAPE-----PETRPSAQT----TKAPRKTKKPGHhrLRRPKTTRSPEVP 562
Cdd:pfam03154 313 PSPAAPGQSQQRIHTPPSQSqlqSQQPPREQPLPPaplsmPHIKPPPTTpipqLPNPQSHKHPPH--LSGPSPFQMNSNL 390
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 563 KSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPahepvtfgseapalaivtttdiePVITRTKASVTTLAPK 642
Cdd:pfam03154 391 PPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-----------------------PVLTQSQSLPPPAASH 447
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 643 PPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAP 722
Cdd:pfam03154 448 PPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA 523
|
250 260 270
....*....|....*....|....*....|....*.
gi 1907118313 723 gttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154 524 -----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
517-703 |
5.71e-06 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 51.21 E-value: 5.71e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377 414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPK---IPHSKPAD 670
Cdd:PHA03377 489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGFQRSGRRQKratPPKVSPSD 559
|
170 180 190
....*....|....*....|....*....|...
gi 1907118313 671 LGPITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:PHA03377 560 RGPPKASPPVMAPPSTGPRVMATPSTGPRDMAP 592
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1139-1359 |
4.06e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 4.06e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1139 APHRLHTAPVRPRIPGRPHGRP---ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVDSHATRKPGSVSG 1215
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPsepAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1216 TRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGT 1295
Cdd:PHA03247 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907118313 1296 TTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1359
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1341-1452 |
9.88e-05 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.92 E-value: 9.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1341 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1420
Cdd:COG3401 220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
|
90 100 110
....*....|....*....|....*....|...
gi 1907118313 1421 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1452
Cdd:COG3401 295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
690-1303 |
1.13e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 1.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 690 PRPKPQTTPHPEVPHTILVPATSLEPfiiTEAPGTTLVPKL------------PQQPDYPHPKPKTTRSPAasPTELVPT 757
Cdd:PHA03247 2496 PDPGGGGPPDPDAPPAPSRLAPAILP---DEPVGEPVHPRMltwirgleelasDDAGDPPPPLPPAAPPAA--PDRSVPP 2570
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 758 PVFEPVTPlkEDPVTTIVPITDL-ERVTDLETPVAFRTEAPGTTLASKISQRTHRPRP-RPRPRPrprprpkATLSPQAP 835
Cdd:PHA03247 2571 PRPAPRPS--EPAVTSRARRPDApPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPpPPSPSP-------AANEPDPH 2641
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 836 ETKTVPAvvlepvtlrPEVQVTTLAPQKTqKKHRPSPKPKPVPSPEVTESKPVLPRVREPVTLRTETWVTTKAPKTPKRT 915
Cdd:PHA03247 2642 PPPTVPP---------PERPRDDPAPGRV-SRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPA 2711
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 916 RRPRPKPQTTPTPETPLTKPVAATDLEPSAlsteVPATVAPNVQRTRRPHPRPKTTASTGVSESKSVSddlelvafSTES 995
Cdd:PHA03247 2712 PHALVSATPLPPGPAAARQASPALPAAPAP----PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP--------AAGP 2779
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 996 PQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEPLHTTSKPKMPPSPEVA-DTTSVPKDERLSLKPDPEVTHSE-TVLPP 1073
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPpPTSAQPTAPPPPPGPPPPSLPLGgSVAPG 2859
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1074 VTFRVEPPkttiapleTRGIPLIPVISPRP-----SQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRLHTAPV 1148
Cdd:PHA03247 2860 GDVRRRPP--------SRSPAAKPAAPARPpvrrlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1149 RPRIPGRPHGRPALNKTTTRPDKTKPRGTSHKNG-VGTGTKQAPK----PPSPGRNASVDSHATRKPGSVSGTRRPPIP- 1222
Cdd:PHA03247 2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGaLVPGRVAVPRfrvpQPAPSREAPASSTPPLTGHSLSRVSSWASSl 3011
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1223 --HRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSpplKATLHPIGTATARPGAEQKEPTAPASEEEFGTTTDFS 1300
Cdd:PHA03247 3012 alHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSD---LEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFG 3088
|
...
gi 1907118313 1301 SSP 1303
Cdd:PHA03247 3089 PPP 3091
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1341-1495 |
1.29e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 46.53 E-value: 1.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1341 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1418
Cdd:COG3401 314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118313 1419 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1495
Cdd:COG3401 388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
116-195 |
1.83e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 41.63 E-value: 1.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71
|
...
gi 1907118313 193 GVK 195
Cdd:pfam00041 72 RVQ 74
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
114-195 |
2.59e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 41.45 E-value: 2.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69
|
....*.
gi 1907118313 190 YEFGVK 195
Cdd:smart00060 70 YEFRVR 75
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
114-195 |
2.85e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 41.33 E-value: 2.85e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71
|
....
gi 1907118313 192 FGVK 195
Cdd:cd00063 72 FRVR 75
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
515-626 |
4.84e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 44.80 E-value: 4.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
|
90 100 110
....*....|....*....|....*....|..
gi 1907118313 595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950 434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
491-703 |
5.92e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 44.37 E-value: 5.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839 286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839 365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118313 648 THRQRTKYKttqspkiPHSKPADLGPITSEPPLASTTKKVRRP---RPKPQTTPHPEVP 703
Cdd:NF033839 431 VKPQPEKPK-------PEVKPQPEKPKPEVKPQPETPKPEVKPqpeKPKPEVKPQPEKP 482
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
553-775 |
8.63e-04 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 43.76 E-value: 8.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 553 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 627
Cdd:PLN03209 330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 628 VITRTKASVTTLAPKPPRPRTHRQRTKyktTQSPKIPHSKPADLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTIL 707
Cdd:PLN03209 407 PAEPDVVPSPGSASNVPEVEPAQVEAK---KTRPLSPYARYEDLKPPTS-------------PSPTAPTGVSPSVSSTSS 470
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907118313 708 VPATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIV 775
Cdd:PLN03209 471 VPAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
491-705 |
1.10e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 43.90 E-value: 1.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 491 PEVRPTTAapQQTTSIPSTPKRqSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGH----HRLRRPKTTRSPEV---PK 563
Cdd:PHA03378 576 PLTSPTTS--QLASSAPSYAQT-PWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRpipmRPLRMQPITFNVLVfptPH 652
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 564 SKPALEPATVTPEILVPKIVP-----------KPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRT 632
Cdd:PHA03378 653 QPPQVEITPYKPTWTQIGHIPyqpsptgantmLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPG 732
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118313 633 KASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHT 705
Cdd:PHA03378 733 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPT 805
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
458-771 |
1.46e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 43.30 E-value: 1.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003 372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003 451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTK--------ASVTTLAPKPPRPRTHRQrtkyktTQSPKIPHS 666
Cdd:PRK07003 531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAraaaaakpAAAPAAAPKPAAPRVAVQ------VPTPRARAA 604
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 667 KPADLGPITSEPPLASTTkkvRRPRPkpqttPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRS 746
Cdd:PRK07003 605 TGDAPPNGAARAEQAAES---RGAPP-----PWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADA 673
|
330 340
....*....|....*....|....*
gi 1907118313 747 PAAsPTELVPTPvfePVTPLkeDPV 771
Cdd:PRK07003 674 PAP-PVDTRPLP---PAIPL--DAI 692
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
420-747 |
1.49e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.54 E-value: 1.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 420 SRVVQPQTATYDVI---------SSSTTSDETEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 489
Cdd:PRK10263 297 NRATQPEYDEYDPLlngapitepVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 490 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:PRK10263 376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 568 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 637
Cdd:PRK10263 456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 638 TLAPKPPRPrthrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRrprpkpQTTPHPEVPHTILVPATSLepfi 717
Cdd:PRK10263 525 AWYQPIPEP------VKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK------KATLATGAAATVAAPVFSL---- 588
|
330 340 350
....*....|....*....|....*....|
gi 1907118313 718 iteAPGTTLVPKLPQQPDYPHPKPKTTRSP 747
Cdd:PRK10263 589 ---ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
490-618 |
1.68e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 42.93 E-value: 1.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994 370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118313 569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994 450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
996-1247 |
1.90e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.14 E-value: 1.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 996 PQKTIAPRQTTSMPPKLKTPhsRMPAKEPVPKEPLhTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT 1075
Cdd:PTZ00449 569 PSKIPTLSKKPEFPKDPKHP--KDPEEPKKPKRPR-SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSP 645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1076 FRVEPPKT--TIAPLETRGIPLIPVISPRPSQEELQTAMEETDQST----QELFTTKIPRTTELAKTTQAPHRLHTAPVR 1149
Cdd:PTZ00449 646 ERPEGPKIikSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTtvvlDESFESILKETLPETPGTPFTTPRPLPPKL 725
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1150 PRIPGRPHGRPAlnktttRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRhsstR 1229
Cdd:PTZ00449 726 PRDEEFPFEPIG------DPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK----R 795
|
250
....*....|....*....
gi 1907118313 1230 PVSP-ERRPLPPNNVTGKP 1247
Cdd:PTZ00449 796 PDSPsEHEDKPPGDHPSLP 814
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
480-628 |
2.08e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 42.49 E-value: 2.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950 351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118313 560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950 427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
515-718 |
2.51e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 42.51 E-value: 2.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPE--ILVPKIVPKP 586
Cdd:PRK14086 87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPEpgAWPRAADDYG 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 587 PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHS 666
Cdd:PRK14086 167 WQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRG 241
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907118313 667 KPADLGPItSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 718
Cdd:PRK14086 242 GPGPPERD-DAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
491-779 |
2.63e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 42.29 E-value: 2.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PHA03369 362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPT 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 571 ATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaivtTTDIEPVITRTKASVTTLAPKPPRPRTHR 650
Cdd:PHA03369 442 NPYVMPISMANMVYPGHPQEHGHERKRKRGGELKEELIETLKLVK------KLKEEQESLAKELEATAHKSEIKKIAESE 515
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 651 QRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT------ 724
Cdd:PHA03369 516 FKNAGAKTAAANIEPNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTaealag 595
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118313 725 ---TLVPKLPQQPDYPHpkpktTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPITD 779
Cdd:PHA03369 596 aieTLLTQASAQPAGLS-----LPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPSLE 648
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
491-766 |
2.71e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.45 E-value: 2.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPRT 648
Cdd:pfam03154 252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 649 hrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPkPQTTPHPEVPhtilVPATSLEPfiiteaPGTTLVP 728
Cdd:pfam03154 300 ----LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-PREQPLPPAP----LSMPHIKP------PPTTPIP 364
|
250 260 270
....*....|....*....|....*....|....*...
gi 1907118313 729 KLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:pfam03154 365 QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
428-770 |
2.84e-03 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 42.34 E-value: 2.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665 208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665 288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665 364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665 416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665 496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
469-600 |
3.59e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.90 E-value: 3.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764 371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118313 545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764 449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
|
|
| PRK14954 |
PRK14954 |
DNA polymerase III subunits gamma and tau; Provisional |
515-612 |
4.31e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184918 [Multi-domain] Cd Length: 620 Bit Score: 41.85 E-value: 4.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954 385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
|
90
....*....|....*...
gi 1907118313 595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954 451 PRNVASGKPG---VDLGS 465
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
531-802 |
4.81e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 4.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 531 PSAQTTKAPRKTKKPGHHRlrrpKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPqvkPAHEPVTF 610
Cdd:PHA03247 255 PAPPPVVGEGADRAPETAR----GATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA---PAGDAEEE 327
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 611 GSEAPALAIVTttdiePVitrtkasvttlapkpPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRp 690
Cdd:PHA03247 328 DDEDGAMEVVS-----PL---------------PRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRR- 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 691 rpkpqTTPHPEVPHTiLVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:PHA03247 387 -----SARHAATPFA-RGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATE 460
|
250 260 270
....*....|....*....|....*....|...
gi 1907118313 771 VTTivPITDLERVTDLETPVAFRT-EAPGTTLA 802
Cdd:PHA03247 461 PAP--DDPDDATRKALDALRERRPpEPPGADLA 491
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
491-673 |
4.98e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.40 E-value: 4.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323 383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323 463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
|
170 180 190
....*....|....*....|....*....|...
gi 1907118313 641 PKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGP 673
Cdd:PRK12323 543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
|
|
| COG3979 |
COG3979 |
Chitodextrinase [Carbohydrate transport and metabolism]; |
1353-1452 |
5.17e-03 |
|
Chitodextrinase [Carbohydrate transport and metabolism];
Pssm-ID: 443178 [Multi-domain] Cd Length: 369 Bit Score: 40.91 E-value: 5.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 1353 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1431
Cdd:COG3979 2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
|
90 100
....*....|....*....|.
gi 1907118313 1432 nplgeGPASNTVAFSTESADP 1452
Cdd:COG3979 72 -----DAAGNVSAASGTSTAM 87
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
471-748 |
8.63e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 40.68 E-value: 8.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 471 ATLAPIEALFESrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtRPSAQttkAPRKTKKPGHHRL 550
Cdd:PLN03209 311 APLTPMEELLAK-------IPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPP-QPKAV---VPRPLSPYTAYED 379
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 551 RRPKTTRSPEVPKSKPA----------LEPATVTPEILVPKIVP--KPPQKPKATRRPEVPQVK-PAHEPVTFGSEAPAL 617
Cdd:PLN03209 380 LKPPTSPIPTPPSSSPAssksvdavakPAEPDVVPSPGSASNVPevEPAQVEAKKTRPLSPYARyEDLKPPTSPSPTAPT 459
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118313 618 AIVTTTDIEPVITRT------KASVTTLAPKPPRPRthrqrtkykttqsPKIPHSKPADLGPITSEPPLAsttkkvrrPR 691
Cdd:PLN03209 460 GVSPSVSSTSSVPAVpdtapaTAATDAAAPPPANMR-------------PLSPYAVYDDLKPPTSPSPAA--------PV 518
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907118313 692 PKPQTTPHPEVPhtilvPATSLEPFIITEAPGTTLVPKlpQQPDYPHP-----KPKTTRSPA 748
Cdd:PLN03209 519 GKVAPSSTNEVV-----KVGNSAPPTALADEQHHAQPK--PRPLSPYTmyedlKPPTSPTPS 573
|
|
|