NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568995610|ref|XP_006522325|]
View 

target of Nesh-SH3 isoform X13 [Mus musculus]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 10440918)

fibronectin type III (FN3) domain-containing protein similar to human Target of Nesh-SH3 (Tarsh) and Drosophila melanogaster cytokine receptor (protein domeless)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
456-1059 1.53e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.53e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247 2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247 2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247 2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTT 773
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTS 2833
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  774 IVPITDLERVTDLETPvafrtEAPGTTLVPAvvlEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPR 853
Cdd:PHA03247 2834 AQPTAPPPPPGPPPPS-----LPLGGSVAPG---GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPD 2903
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  854 VREPvtlrtetwvtTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPK 933
Cdd:PHA03247 2904 QPER----------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPW 2963
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  934 TTTLAPN---VQRTRRPHPRP--KTTASTGVSESKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvafstes 1003
Cdd:PHA03247 2964 LGALVPGrvaVPRFRVPQPAPsrEAPASSTPPLTGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE-------- 3035
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995610 1004 pqKTIAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1059
Cdd:PHA03247 3036 --DSDADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1364-1455 3.22e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1364 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1441
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995610 1442 LGEGPASNTVAFST 1455
Cdd:cd00063    80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
919-1310 3.36e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  919 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 998
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  999 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1068
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1069 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1146
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1147 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1217
Cdd:PHA03247 2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1218 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1297
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                         410
                  ....*....|...
gi 568995610 1298 EEEFGTTTDFSSS 1310
Cdd:PHA03247 2990 ASSTPPLTGHSLS 3002
fn3 pfam00041
Fibronectin type III domain;
116-195 1.91e-04

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995610   193 GVK 195
Cdd:pfam00041   72 RVQ 74
PHA03247 super family cl33720
large tegument protein UL36; Provisional
307-575 5.67e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 5.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247 2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247 2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  464 RTAEqPRATLAPiealfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTK 543
Cdd:PHA03247 2893 RSTE-SFALPPD--------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 568995610  544 KPGHHRLRRPKTTRSP----EVPKSKPALE-PATVTP 575
Cdd:PHA03247 2958 AVPQPWLGALVPGRVAvprfRVPQPAPSREaPASSTP 2994
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-1059 1.53e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.53e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247 2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247 2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247 2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTT 773
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTS 2833
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  774 IVPITDLERVTDLETPvafrtEAPGTTLVPAvvlEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPR 853
Cdd:PHA03247 2834 AQPTAPPPPPGPPPPS-----LPLGGSVAPG---GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPD 2903
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  854 VREPvtlrtetwvtTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPK 933
Cdd:PHA03247 2904 QPER----------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPW 2963
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  934 TTTLAPN---VQRTRRPHPRP--KTTASTGVSESKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvafstes 1003
Cdd:PHA03247 2964 LGALVPGrvaVPRFRVPQPAPsrEAPASSTPPLTGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE-------- 3035
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995610 1004 pqKTIAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1059
Cdd:PHA03247 3036 --DSDADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1364-1455 3.22e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1364 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1441
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995610 1442 LGEGPASNTVAFST 1455
Cdd:cd00063    80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1365-1445 9.72e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 9.72e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   1365 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1442
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 568995610   1443 GEG 1445
Cdd:smart00060   81 GEG 83
fn3 pfam00041
Fibronectin type III domain;
1365-1448 1.23e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  1365 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1441
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 568995610  1442 LGEGPAS 1448
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
919-1310 3.36e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  919 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 998
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  999 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1068
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1069 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1146
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1147 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1217
Cdd:PHA03247 2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1218 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1297
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                         410
                  ....*....|...
gi 568995610 1298 EEEFGTTTDFSSS 1310
Cdd:PHA03247 2990 ASSTPPLTGHSLS 3002
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
384-758 7.48e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 7.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154  251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154  327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEP 678
Cdd:pfam03154  407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   679 PLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154  484 STSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1349-1460 1.17e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 1.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1349 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1428
Cdd:COG3401   220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                          90       100       110
                  ....*....|....*....|....*....|...
gi 568995610 1429 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1460
Cdd:COG3401   295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
fn3 pfam00041
Fibronectin type III domain;
116-195 1.91e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995610   193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.76e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.76e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610    114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 568995610    190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 3.00e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 3.00e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063     1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                  ....
gi 568995610  192 FGVK 195
Cdd:cd00063    72 FRVR 75
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-703 4.27e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.76  E-value: 4.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839  286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839  365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568995610  648 THRQRTKYKttqspkiPHSKPADLGPITSEPPLASTTKKVRRP---RPKPQTTPHPEVP 703
Cdd:NF033839  431 VKPQPEKPK-------PEVKPQPEKPKPEVKPQPETPKPEVKPqpeKPKPEVKPQPEKP 482
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-770 3.62e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.96  E-value: 3.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665   208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665   288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665   364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665   416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665   496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PHA03247 PHA03247
large tegument protein UL36; Provisional
307-575 5.67e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 5.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247 2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247 2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  464 RTAEqPRATLAPiealfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTK 543
Cdd:PHA03247 2893 RSTE-SFALPPD--------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 568995610  544 KPGHHRLRRPKTTRSP----EVPKSKPALE-PATVTP 575
Cdd:PHA03247 2958 AVPQPWLGALVPGRVAvprfRVPQPAPSREaPASSTP 2994
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-1059 1.53e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.53e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247 2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247 2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247 2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTT 773
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTS 2833
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  774 IVPITDLERVTDLETPvafrtEAPGTTLVPAvvlEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPR 853
Cdd:PHA03247 2834 AQPTAPPPPPGPPPPS-----LPLGGSVAPG---GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPD 2903
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  854 VREPvtlrtetwvtTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPK 933
Cdd:PHA03247 2904 QPER----------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPW 2963
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  934 TTTLAPN---VQRTRRPHPRP--KTTASTGVSESKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvafstes 1003
Cdd:PHA03247 2964 LGALVPGrvaVPRFRVPQPAPsrEAPASSTPPLTGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE-------- 3035
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995610 1004 pqKTIAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1059
Cdd:PHA03247 3036 --DSDADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
PHA03247 PHA03247
large tegument protein UL36; Provisional
507-1037 3.35e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 78.83  E-value: 3.35e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  654 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 726
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  727 VPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLKedPVTTIVPITDLERVTDLETPvafrtEAPGTTLVPAvv 806
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPLP--PPTSAQPTAPPPPPGPPPPS-----LPLGGSVAPG-- 2859
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  807 lEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREPvtlrtetwvtTKAPKTPKRTRRPRPKPQT 886
Cdd:PHA03247 2860 -GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER----------PPQPQAPPPPQPQPQPPPP 2926
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  887 TPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPKTTTLAPN---VQRTRRPHPRP--KTTASTGVSE 961
Cdd:PHA03247 2927 PQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPWLGALVPGrvaVPRFRVPQPAPsrEAPASSTPPL 2996
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  962 SKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvAFSTESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKE 1036
Cdd:PHA03247 2997 TGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE--DSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPAT 3074

                  .
gi 568995610 1037 P 1037
Cdd:PHA03247 3075 P 3075
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
509-776 1.77e-12

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 72.41  E-value: 1.77e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449  542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  580 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 659
Cdd:PTZ00449  620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  660 SPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 739
Cdd:PTZ00449  656 SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 568995610  740 KPKTTRSPAASPTELVPTPVfEPVTPLKEDPVTTIVP 776
Cdd:PTZ00449  735 PIGDPDAEQPDDIEFFTPPE-EERTFFHETPADTPLP 770
PHA03247 PHA03247
large tegument protein UL36; Provisional
545-1169 5.63e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.04  E-value: 5.63e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 611
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  612 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPitsepplasttkkvrrPR 691
Cdd:PHA03247 2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD----------------PR 2609
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  692 PKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPV 771
Cdd:PHA03247 2610 GPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR 2689
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  772 TTIVPITDLERVTDLETPvafrTEAPGTTLVPAVVLEPVtlrpevqvttlapqkTQKKHRPSPKPKPVPSPEVTESKPVL 851
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPT----PEPAPHALVSATPLPPG---------------PAAARQASPALPAAPAPPAVPAGPAT 2750
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  852 PRVREPVTLRTetwvTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPSALSTEVPATVVLATALTPVTLR--T 929
Cdd:PHA03247 2751 PGGPARPARPP----TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASpaG 2826
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  930 KAPKTTTLAPNVQRTRRPHPRPKTTASTGVSE----SKSAPTelQSLVLKPVTS--PSLEIIQSQSVSDDLELVAFSTES 1003
Cdd:PHA03247 2827 PLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPS--RSPAAKPAAParPPVRRLARPAVSRSTESFALPPDQ 2904
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1004 PQKtiaPRQTTSMPPKLKTPHSRMPAKEPVPKEPlhttskPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT 1083
Cdd:PHA03247 2905 PER---PPQPQAPPPPQPQPQPPPPPQPQPPPPP------PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP 2975
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1084 -FRVEPPKTTIAPLETRGIPLIPVISPRPSQEELQTAM-EETDQSTQELFTT-KIPRTTELAKTTQA----PHRLHTAPV 1156
Cdd:PHA03247 2976 rFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTlWPPDDTEDSDADSLfdsdSERSDLEAL 3055
                         650
                  ....*....|...
gi 568995610 1157 RPrIPGRPHGRPA 1169
Cdd:PHA03247 3056 DP-LPPEPHDPFA 3067
PHA03247 PHA03247
large tegument protein UL36; Provisional
448-743 2.76e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 65.73  E-value: 2.76e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  448 TATRDPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 527
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  528 ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKP-ALEPATVTPEILVP---KIVPKPPQKPKATRRPEVPQVKP 603
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPaAALPPAASPAGPLPpptSAQPTAPPPPPGPPPPSLPLGGS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  604 AHEPVTFGSEAPALAIVTTtdiepVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAST 683
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAK-----PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995610  684 TKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKL---PQQPDYPHPKPKT 743
Cdd:PHA03247 2931 PPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFrvpQPAPSREAPASST 2993
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1364-1455 3.22e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1364 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1441
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995610 1442 LGEGPASNTVAFST 1455
Cdd:cd00063    80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1365-1445 9.72e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 9.72e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   1365 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1442
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 568995610   1443 GEG 1445
Cdd:smart00060   81 GEG 83
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
381-767 1.44e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 56.62  E-value: 1.44e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  381 KRFPEFPEAKTAFPLEKPRGSWASSEEPwVVPGAKTSEdSRVVQPQTATYDVISSSTTSDETEIEI---------HTATR 451
Cdd:PTZ00449  494 KKLAPIEEEDSDKHDEPPEGPEASGLPP-KAPGDKEGE-EGEHEDSKESDEPKEGGKPGETKEGEVgkkpgpakeHKPSK 571
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  452 DPILDSVP-----PKTSRTAEQPRATLAPIEAlfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpAPE 526
Cdd:PTZ00449  572 IPTLSKKPefpkdPKHPKDPEEPKKPKRPRSA-----------QRPTRPKSPKLPELLDIPKSPKRPESPKSPK---RPP 637
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  527 PETRPSAQttkaprktkkpghhrlRRPKTTRSPEVPKSKPAlepatvtpeilvpkivPKPPQKPKATRRPEVPQVKPAHE 606
Cdd:PTZ00449  638 PPQRPSSP----------------ERPEGPKIIKSPKPPKS----------------PKPPFDPKFKEKFYDDYLDAAAK 685
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  607 PVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRThrqrtkykttqsPKIPHSKPADlgpitsePPLASTTK 685
Cdd:PTZ00449  686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRD------------EEFPFEPIGD-------PDAEQPDD 746
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  686 KVRRPRPKPQTTPHPEvphtilVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVT 764
Cdd:PTZ00449  747 IEFFTPPEEERTFFHE------TPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSL 813

                  ...
gi 568995610  765 PLK 767
Cdd:PTZ00449  814 PKK 816
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
511-1049 1.45e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 56.62  E-value: 1.45e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  511 KRQSTPKPPRVKPAPE--PETRPSAQTTKAPRK-TKKPGHHRlrRPKTTRSPEVPKsKPAlePATVTPEILVPKIVPKP- 586
Cdd:PTZ00449  506 KHDEPPEGPEASGLPPkaPGDKEGEEGEHEDSKeSDEPKEGG--KPGETKEGEVGK-KPG--PAKEHKPSKIPTLSKKPe 580
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  587 -PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRTHRQRTKYKTTQSPKIP 664
Cdd:PTZ00449  581 fPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPqRPSSPERPEGPKIIKSPKPP 660
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  665 HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHPKPKTT 744
Cdd:PTZ00449  661 KSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFEPIGDP 739
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  745 RSPAASPTELVPTPVFEPVtplkedpvttivpitdlervtdletpvaFRTEAPGTTLVPAVVLEPVTlRPEVQVTTLAPQ 824
Cdd:PTZ00449  740 DAEQPDDIEFFTPPEEERT----------------------------FFHETPADTPLPDILAEEFK-EEDIHAETGEPD 790
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  825 KTQKKhrpspkpkpvpspevteskPVLPRVREPVTlrtetwvTTKAPKTPKRTRRPRPKPqttptpetpltkpVAATDLE 904
Cdd:PTZ00449  791 EAMKR-------------------PDSPSEHEDKP-------PGDHPSLPKKRHRLDGLA-------------LSTTDLE 831
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  905 PSalstevPATVVLATALTPVTLRtkapktttlapnvqrtrrphprpkttastgvsESKSApTELQSLVLKPVTSPSLEI 984
Cdd:PTZ00449  832 SD------AGRIAKDASGKIVKLK--------------------------------RSKSF-DDLTTVEEAEEMGAEARK 872
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568995610  985 IqsqSVSDDlelvafSTESPQKTIAPrqttSMPPKLKTPHSRMPAKEPVPKEPLHTTSKPKMPPS 1049
Cdd:PTZ00449  873 I---VVDDD------GTEADDEDTHP----PEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDS 924
fn3 pfam00041
Fibronectin type III domain;
1365-1448 1.23e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  1365 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1441
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 568995610  1442 LGEGPAS 1448
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
919-1310 3.36e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  919 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 998
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  999 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1068
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1069 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1146
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1147 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1217
Cdd:PHA03247 2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1218 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1297
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                         410
                  ....*....|...
gi 568995610 1298 EEEFGTTTDFSSS 1310
Cdd:PHA03247 2990 ASSTPPLTGHSLS 3002
PHA03377 PHA03377
EBNA-3C; Provisional
517-703 5.74e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 51.21  E-value: 5.74e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377  414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPK---IPHSKPAD 670
Cdd:PHA03377  489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGFQRSGRRQKratPPKVSPSD 559
                         170       180       190
                  ....*....|....*....|....*....|...
gi 568995610  671 LGPITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:PHA03377  560 RGPPKASPPVMAPPSTGPRVMATPSTGPRDMAP 592
PHA03247 PHA03247
large tegument protein UL36; Provisional
1147-1367 4.05e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 4.05e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1147 APHRLHTAPVRPRIPGRPHGRP---ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVDSHATRKPGSVSG 1223
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPsepAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1224 TRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGT 1303
Cdd:PHA03247 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568995610 1304 TTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1367
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
384-758 7.48e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 7.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154  251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154  327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEP 678
Cdd:pfam03154  407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   679 PLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154  484 STSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
PHA03247 PHA03247
large tegument protein UL36; Provisional
1017-1366 1.06e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 1.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1017 PPKLKTPHSRMPAKE---------PVPKEPLhTTSKPKMPPSPEVADTTSVPKDERlslkPDPEVTHSETVLPPVTFRVE 1087
Cdd:PHA03247 2551 PPPPLPPAAPPAAPDrsvppprpaPRPSEPA-VTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPD 2625
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1088 PPKTTIAPLETRGIPLIPVISPRPSQEELQTAMEETDQSTQelfTTKIPRTTELAKTTQAPHRLHTAP--------VRPR 1159
Cdd:PHA03247 2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR---ARRLGRAAQASSPPQRPRRRAARPtvgsltslADPP 2702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1160 IPGR-PHGRP---------------ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSG 1223
Cdd:PHA03247 2703 PPPPtPEPAPhalvsatplppgpaaARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1224 TRRPPIPHRHSSTRPVSPERRPLPPNNVTgkPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGt 1303
Cdd:PHA03247 2783 LTRPAVASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG- 2859
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568995610 1304 tTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSIT-----DSVRRFPTEEATEGNATSPPQNPP 1366
Cdd:PHA03247 2860 -GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalppDQPERPPQPQAPPPPQPQPQPPPP 2926
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1349-1460 1.17e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 1.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1349 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1428
Cdd:COG3401   220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                          90       100       110
                  ....*....|....*....|....*....|...
gi 568995610 1429 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1460
Cdd:COG3401   295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1349-1503 1.59e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1349 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1426
Cdd:COG3401   314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568995610 1427 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1503
Cdd:COG3401   388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
fn3 pfam00041
Fibronectin type III domain;
116-195 1.91e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995610   193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.76e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.76e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610    114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 568995610    190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 3.00e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 3.00e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063     1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                  ....
gi 568995610  192 FGVK 195
Cdd:cd00063    72 FRVR 75
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-703 4.27e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.76  E-value: 4.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839  286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839  365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568995610  648 THRQRTKYKttqspkiPHSKPADLGPITSEPPLASTTKKVRRP---RPKPQTTPHPEVP 703
Cdd:NF033839  431 VKPQPEKPK-------PEVKPQPEKPKPEVKPQPETPKPEVKPqpeKPKPEVKPQPEKP 482
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
515-626 5.08e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 5.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950  361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
                          90       100       110
                  ....*....|....*....|....*....|..
gi 568995610  595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950  434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
553-775 9.54e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.76  E-value: 9.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  553 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 627
Cdd:PLN03209  330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  628 VITRTKASVTTLAPKPPRPRTHRQRTKyktTQSPKIPHSKPADLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTIL 707
Cdd:PLN03209  407 PAEPDVVPSPGSASNVPEVEPAQVEAK---KTRPLSPYARYEDLKPPTS-------------PSPTAPTGVSPSVSSTSS 470
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568995610  708 VPATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIV 775
Cdd:PLN03209  471 VPAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
PHA03378 PHA03378
EBNA-3B; Provisional
491-705 1.10e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 1.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  491 PEVRPTTAapQQTTSIPSTPKRqSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGH----HRLRRPKTTRSPEV---PK 563
Cdd:PHA03378  576 PLTSPTTS--QLASSAPSYAQT-PWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRpipmRPLRMQPITFNVLVfptPH 652
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  564 SKPALEPATVTPEILVPKIVP-----------KPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRT 632
Cdd:PHA03378  653 QPPQVEITPYKPTWTQIGHIPyqpsptgantmLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPG 732
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995610  633 KASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHT 705
Cdd:PHA03378  733 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPT 805
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
458-771 1.46e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 43.30  E-value: 1.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003  372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003  451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTK--------ASVTTLAPKPPRPRTHRQrtkyktTQSPKIPHS 666
Cdd:PRK07003  531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAraaaaakpAAAPAAAPKPAAPRVAVQ------VPTPRARAA 604
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  667 KPADLGPITSEPPLASTTkkvRRPRPkpqttPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRS 746
Cdd:PRK07003  605 TGDAPPNGAARAEQAAES---RGAPP-----PWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADA 673
                         330       340
                  ....*....|....*....|....*
gi 568995610  747 PAAsPTELVPTPvfePVTPLkeDPV 771
Cdd:PRK07003  674 PAP-PVDTRPLP---PAIPL--DAI 692
PRK10263 PRK10263
DNA translocase FtsK; Provisional
420-747 1.50e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.54  E-value: 1.50e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  420 SRVVQPQTATYDVI---------SSSTTSDETEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 489
Cdd:PRK10263  297 NRATQPEYDEYDPLlngapitepVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  490 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:PRK10263  376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  568 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 637
Cdd:PRK10263  456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  638 TLAPKPPRPrthrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRrprpkpQTTPHPEVPHTILVPATSLepfi 717
Cdd:PRK10263  525 AWYQPIPEP------VKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK------KATLATGAAATVAAPVFSL---- 588
                         330       340       350
                  ....*....|....*....|....*....|
gi 568995610  718 iteAPGTTLVPKLPQQPDYPHPKPKTTRSP 747
Cdd:PRK10263  589 ---ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1004-1255 1.55e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.14  E-value: 1.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1004 PQKTIAPRQTTSMPPKLKTPhsRMPAKEPVPKEPLhTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT 1083
Cdd:PTZ00449  569 PSKIPTLSKKPEFPKDPKHP--KDPEEPKKPKRPR-SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSP 645
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1084 FRVEPPKT--TIAPLETRGIPLIPVISPRPSQEELQTAMEETDQST----QELFTTKIPRTTELAKTTQAPHRLHTAPVR 1157
Cdd:PTZ00449  646 ERPEGPKIikSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTtvvlDESFESILKETLPETPGTPFTTPRPLPPKL 725
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1158 PRIPGRPHGRPAlnktttRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRhsstR 1237
Cdd:PTZ00449  726 PRDEEFPFEPIG------DPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK----R 795
                         250
                  ....*....|....*....
gi 568995610 1238 PVSP-ERRPLPPNNVTGKP 1255
Cdd:PTZ00449  796 PDSPsEHEDKPPGDHPSLP 814
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
490-618 1.84e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.93  E-value: 1.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994  370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995610  569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994  450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
491-766 2.13e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 2.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPRT 648
Cdd:pfam03154  252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQPFP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610   649 hrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPkPQTTPHPEVPhtilVPATSLEPfiiteaPGTTLVP 728
Cdd:pfam03154  300 ----LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-PREQPLPPAP----LSMPHIKP------PPTTPIP 364
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 568995610   729 KLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:pfam03154  365 QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
480-628 2.17e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.49  E-value: 2.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950  351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995610  560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950  427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
dnaA PRK14086
chromosomal replication initiator protein DnaA;
515-718 2.52e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.51  E-value: 2.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPE--ILVPKIVPKP 586
Cdd:PRK14086   87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPEpgAWPRAADDYG 166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  587 PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHS 666
Cdd:PRK14086  167 WQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRG 241
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 568995610  667 KPADLGPItSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 718
Cdd:PRK14086  242 GPGPPERD-DAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
PHA03369 PHA03369
capsid maturational protease; Provisional
491-779 2.67e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 42.29  E-value: 2.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PHA03369  362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPT 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  571 ATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaivtTTDIEPVITRTKASVTTLAPKPPRPRTHR 650
Cdd:PHA03369  442 NPYVMPISMANMVYPGHPQEHGHERKRKRGGELKEELIETLKLVK------KLKEEQESLAKELEATAHKSEIKKIAESE 515
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  651 QRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT------ 724
Cdd:PHA03369  516 FKNAGAKTAAANIEPNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTaealag 595
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 568995610  725 ---TLVPKLPQQPDYPHpkpktTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPITD 779
Cdd:PHA03369  596 aieTLLTQASAQPAGLS-----LPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPSLE 648
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
469-600 3.61e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 3.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764  371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995610  545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764  449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-770 3.62e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.96  E-value: 3.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665   208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665   288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665   364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665   416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665   496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PHA03378 PHA03378
EBNA-3B; Provisional
467-871 4.09e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.98  E-value: 4.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  467 EQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPstpkrQSTPKPPRVKPAPEP-ETRPSAQTTKAPR----- 540
Cdd:PHA03378  345 EAVRLPDDPIIVEDDDESEEIESECDPDEDKSGAEALASIP-----QTLPDPPTVYGRPKVfARKADLKSTKKCRaivtd 419
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  541 ---------KTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKP--PQKPKATrrpevPQVKPA--HEP 607
Cdd:PHA03378  420 psvikaieeEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQPLEGPTGPLSVQAPlePWQPLPH-----PQVTPVilHQP 494
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  608 VTFGSEAP-ALAIVTTTDIEPVITRTKAsvTTLAPKPPRPRTHRQ-----------RTKYKTTQSPKIPHSKPAD-LGPI 674
Cdd:PHA03378  495 PAQGVQAHgSMLDLLEKDDEDMEQRVMA--TLLPPSPPQPRAGRRapcvytedldiESDEPASTEPVHDQLLPAPgLGPL 572
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  675 TSEPPLASTTKKVRRPRPKPQTTPHPeVPHtilvPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTEL 754
Cdd:PHA03378  573 QIQPLTSPTTSQLASSAPSYAQTPWP-VPH----PSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLV 647
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  755 VPTPVFEP-VTPLKEDPVTTIVPITDLERVTDLETPVAFRTEAPGTTLVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPS 833
Cdd:PHA03378  648 FPTPHQPPqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPP 727
                         410       420       430
                  ....*....|....*....|....*....|....*...
gi 568995610  834 PKPKPVPSPEVTESKPVLPRVREPVTLRTETWVTTKAP 871
Cdd:PHA03378  728 AAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 765
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
515-612 4.19e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.85  E-value: 4.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954  385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
                          90
                  ....*....|....*...
gi 568995610  595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954  451 PRNVASGKPG---VDLGS 465
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
491-673 5.14e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 5.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323  383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323  463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
                         170       180       190
                  ....*....|....*....|....*....|...
gi 568995610  641 PKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGP 673
Cdd:PRK12323  543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
PHA03247 PHA03247
large tegument protein UL36; Provisional
307-575 5.67e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 5.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247 2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247 2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  464 RTAEqPRATLAPiealfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTK 543
Cdd:PHA03247 2893 RSTE-SFALPPD--------------QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 568995610  544 KPGHHRLRRPKTTRSP----EVPKSKPALE-PATVTP 575
Cdd:PHA03247 2958 AVPQPWLGALVPGRVAvprfRVPQPAPSREaPASSTP 2994
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
489-751 6.02e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 6.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  489 TSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPK----- 563
Cdd:PHA03307   88 PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVasdaa 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  564 ----------SKPALEPATVTPEILVPKIVPKPPQKPKAtRRPEVPQVKPAHEPVTFGSEAPALAIVTTTD--IEPVITR 631
Cdd:PHA03307  168 ssrqaalplsSPEETARAPSSPPAEPPPSTPPAAASPRP-PRRSSPISASASSPAPAPGRSAADDAGASSSdsSSSESSG 246
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  632 TKASVTTLAPKpPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKvRRPRPKPQTTPHPEVPHTILVPAT 711
Cdd:PHA03307  247 CGWGPENECPL-PRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSP-SSPGSGPAPSSPRASSSSSSSRES 324
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 568995610  712 SLE-PFIITEAPGTTLVPklPQQPDYPHPKPKTTRSPAASP 751
Cdd:PHA03307  325 SSSsTSSSSESSRGAAVS--PGPSPSRSPSPSRPPPPADPS 363
COG3979 COG3979
Chitodextrinase [Carbohydrate transport and metabolism];
1361-1460 6.02e-03

Chitodextrinase [Carbohydrate transport and metabolism];


Pssm-ID: 443178 [Multi-domain]  Cd Length: 369  Bit Score: 40.91  E-value: 6.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1361 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1439
Cdd:COG3979     2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
                          90       100
                  ....*....|....*....|.
gi 568995610 1440 nplgeGPASNTVAFSTESADP 1460
Cdd:COG3979    72 -----DAAGNVSAASGTSTAM 87
PHA03247 PHA03247
large tegument protein UL36; Provisional
531-776 6.07e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 6.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  531 PSAQTTKAPRKTKKPGHHRlrrpKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPqvkPAHEPVTF 610
Cdd:PHA03247  255 PAPPPVVGEGADRAPETAR----GATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA---PAGDAEEE 327
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  611 GSEAPALAIVTttdiePVitrtkasvttlapkpPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRp 690
Cdd:PHA03247  328 DDEDGAMEVVS-----PL---------------PRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRR- 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  691 rpkpqTTPHPEVPHTiLVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:PHA03247  387 -----SARHAATPFA-RGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATE 460

                  ....*.
gi 568995610  771 VTTIVP 776
Cdd:PHA03247  461 PAPDDP 466
PRK10263 PRK10263
DNA translocase FtsK; Provisional
669-1168 7.85e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 7.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  669 ADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPhtilvpatslEPFIITEAPGTTLVPKLPqQPDYPHPKPKTTRSPA 748
Cdd:PRK10263  335 APVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTG----------EPVIAPAPEGYPQQSQYA-QPAVQYNEPLQQPVQP 403
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  749 ASPTELVPTPVFEPVTPLKEDPVTTIVPITDLERVTDLETPVAFRTEAPGTTLVPAVVLEPVTLRPEvqvTTLAPQKTQK 828
Cdd:PRK10263  404 QQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQ---PAAQEPLYQQ 480
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  829 KHRPSPKPKPVPSPEVTESKPVLP-----------RVREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKP 897
Cdd:PRK10263  481 PQPVEQQPVVEPEPVVEETKPARPplyyfeeveekRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAA 560
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  898 VA--ATDLEPSALSTEVPATVVlATALTPVTLRTKAPKTTT-LAPNVQRTRRPH-PRPKTTASTGVS-ESKSAPTElqsl 972
Cdd:PRK10263  561 VSplASGVKKATLATGAAATVA-APVFSLANSGGPRPQVKEgIGPQLPRPKRIRvPTRRELASYGIKlPSQRAAEE---- 635
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  973 vlKPVTSPSLEIIQSQSVSDDlELVAFSTESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEP------LHTTSKPKM 1046
Cdd:PRK10263  636 --KAREAQRNQYDSGDQYNDD-EIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAelarqfAQTQQQRYS 712
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610 1047 PPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPLETRGIPLIPViSPRPSQEELQTAMEETDQS 1126
Cdd:PRK10263  713 GEQPAGANPFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPV-APQPQYQQPQQPVAPQPQY 791
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|..
gi 568995610 1127 TQELFTTKIPRTTELAKTTQAPHRLHTAPVRPRIPGRPHGRP 1168
Cdd:PRK10263  792 QQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 833
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
471-748 9.94e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 40.29  E-value: 9.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  471 ATLAPIEALFESrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtRPSAQttkAPRKTKKPGHHRL 550
Cdd:PLN03209  311 APLTPMEELLAK-------IPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPP-QPKAV---VPRPLSPYTAYED 379
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  551 RRPKTTRSPEVPKSKPA----------LEPATVTPEILVPKIVP--KPPQKPKATRRPEVPQVK-PAHEPVTFGSEAPAL 617
Cdd:PLN03209  380 LKPPTSPIPTPPSSSPAssksvdavakPAEPDVVPSPGSASNVPevEPAQVEAKKTRPLSPYARyEDLKPPTSPSPTAPT 459
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995610  618 AIVTTTDIEPVITRT------KASVTTLAPKPPRPRthrqrtkykttqsPKIPHSKPADLGPITSEPPLAsttkkvrrPR 691
Cdd:PLN03209  460 GVSPSVSSTSSVPAVpdtapaTAATDAAAPPPANMR-------------PLSPYAVYDDLKPPTSPSPAA--------PV 518
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568995610  692 PKPQTTPHPEVPhtilvPATSLEPFIITEAPGTTLVPKlpQQPDYPHP-----KPKTTRSPA 748
Cdd:PLN03209  519 GKVAPSSTNEVV-----KVGNSAPPTALADEQHHAQPK--PRPLSPYTmyedlKPPTSPTPS 573
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH