NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907118310|ref|XP_036015884|]
View 

target of Nesh-SH3 isoform X14 [Mus musculus]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 10440918)

fibronectin type III (FN3) domain-containing protein similar to human Target of Nesh-SH3 (Tarsh) and Drosophila melanogaster cytokine receptor (protein domeless)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
456-1055 1.59e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.59e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247  2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247  2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247  2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLkedPVTT 773
Cdd:PHA03247  2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPL---PPPT 2832
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  774 IDLERVTDLETPVAFRTEAPGTTLVPAvvlEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREP 853
Cdd:PHA03247  2833 SAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER 2907
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  854 vtlrtetwvtTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPKTTTL 933
Cdd:PHA03247  2908 ----------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPWLGAL 2967
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  934 APN---VQRTRRPHPRP--KTTASTGVSESKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvafstespqKT 1003
Cdd:PHA03247  2968 VPGrvaVPRFRVPQPAPsrEAPASSTPPLTGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE----------DS 3037
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907118310 1004 IAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1055
Cdd:PHA03247  3038 DADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1360-1451 3.06e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.06e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1360 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1437
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118310 1438 LGEGPASNTVAFST 1451
Cdd:cd00063     80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
915-1306 3.30e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.30e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  915 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 994
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  995 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1064
Cdd:PHA03247  2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1065 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1142
Cdd:PHA03247  2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1143 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1213
Cdd:PHA03247  2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1214 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1293
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                          410
                   ....*....|...
gi 1907118310 1294 EEEFGTTTDFSSS 1306
Cdd:PHA03247  2990 ASSTPPLTGHSLS 3002
fn3 pfam00041
Fibronectin type III domain;
116-195 1.87e-04

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118310  193 GVK 195
Cdd:pfam00041   72 RVQ 74
PHA03247 super family cl33720
large tegument protein UL36; Provisional
307-575 3.47e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  464 RTAE---QPRATLAPiealfeSRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtrPSAQTTKAPR 540
Cdd:PHA03247  2893 RSTEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE--PSGAVPQPWL 2964
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907118310  541 KTKKPGHHRLRRpktTRSPEVPKSKPALEPATVTP 575
Cdd:PHA03247  2965 GALVPGRVAVPR---FRVPQPAPSREAPASSTPPL 2996
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-1055 1.59e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.59e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247  2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247  2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247  2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLkedPVTT 773
Cdd:PHA03247  2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPL---PPPT 2832
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  774 IDLERVTDLETPVAFRTEAPGTTLVPAvvlEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREP 853
Cdd:PHA03247  2833 SAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER 2907
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  854 vtlrtetwvtTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPKTTTL 933
Cdd:PHA03247  2908 ----------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPWLGAL 2967
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  934 APN---VQRTRRPHPRP--KTTASTGVSESKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvafstespqKT 1003
Cdd:PHA03247  2968 VPGrvaVPRFRVPQPAPsrEAPASSTPPLTGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE----------DS 3037
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907118310 1004 IAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1055
Cdd:PHA03247  3038 DADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1360-1451 3.06e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.06e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1360 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1437
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118310 1438 LGEGPASNTVAFST 1451
Cdd:cd00063     80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1361-1441 9.23e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 9.23e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  1361 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1438
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1907118310  1439 GEG 1441
Cdd:smart00060   81 GEG 83
fn3 pfam00041
Fibronectin type III domain;
1361-1444 1.18e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.18e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1361 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1437
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1907118310 1438 LGEGPAS 1444
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
915-1306 3.30e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.30e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  915 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 994
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  995 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1064
Cdd:PHA03247  2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1065 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1142
Cdd:PHA03247  2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1143 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1213
Cdd:PHA03247  2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1214 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1293
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                          410
                   ....*....|...
gi 1907118310 1294 EEEFGTTTDFSSS 1306
Cdd:PHA03247  2990 ASSTPPLTGHSLS 3002
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
384-758 7.46e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 7.46e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154  251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154  327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEP 678
Cdd:pfam03154  407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  679 PLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154  484 STSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1345-1456 1.05e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 1.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1345 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1424
Cdd:COG3401    220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907118310 1425 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1456
Cdd:COG3401    295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
fn3 pfam00041
Fibronectin type III domain;
116-195 1.87e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118310  193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.68e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.68e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310   114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 1907118310   190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 3.02e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 3.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                   ....
gi 1907118310  192 FGVK 195
Cdd:cd00063     72 FRVR 75
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-703 4.26e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.76  E-value: 4.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839   286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839   365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118310  648 THRQRTKYKttqspkiPHSKPADLGPITSEPPLASTTKKVRRP---RPKPQTTPHPEVP 703
Cdd:NF033839   431 VKPQPEKPK-------PEVKPQPEKPKPEVKPQPETPKPEVKPqpeKPKPEVKPQPEKP 482
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-770 3.31e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 42.34  E-value: 3.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665    208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665    288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665    364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665    416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665    496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PHA03247 PHA03247
large tegument protein UL36; Provisional
307-575 3.47e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  464 RTAE---QPRATLAPiealfeSRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtrPSAQTTKAPR 540
Cdd:PHA03247  2893 RSTEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE--PSGAVPQPWL 2964
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907118310  541 KTKKPGHHRLRRpktTRSPEVPKSKPALEPATVTP 575
Cdd:PHA03247  2965 GALVPGRVAVPR---FRVPQPAPSREAPASSTPPL 2996
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
456-1055 1.59e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.59e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  456 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 535
Cdd:PHA03247  2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  536 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 615
Cdd:PHA03247  2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  616 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 693
Cdd:PHA03247  2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  694 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTElvpTPVFEPVTPLkedPVTT 773
Cdd:PHA03247  2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL---PPAASPAGPL---PPPT 2832
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  774 IDLERVTDLETPVAFRTEAPGTTLVPAvvlEPVTLRPEVQVTtlAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPRVREP 853
Cdd:PHA03247  2833 SAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRPPSRSP--AAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER 2907
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  854 vtlrtetwvtTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPsalsTEVPATVVLATALTPvtlrtkAPKTTTL 933
Cdd:PHA03247  2908 ----------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP----TTDPAGAGEPSGAVP------QPWLGAL 2967
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  934 APN---VQRTRRPHPRP--KTTASTGVSESKSAPTEL----QSLVLKPVTSPS-LEIIQSQSVSDDLElvafstespqKT 1003
Cdd:PHA03247  2968 VPGrvaVPRFRVPQPAPsrEAPASSTPPLTGHSLSRVsswaSSLALHEETDPPpVSLKQTLWPPDDTE----------DS 3037
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907118310 1004 IAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMPPSPEvADTTSVP 1055
Cdd:PHA03247  3038 DADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDPATPE-AGARESP 3083
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
509-851 4.79e-12

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 71.26  E-value: 4.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449   542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  580 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 659
Cdd:PTZ00449   620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  660 SPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 739
Cdd:PTZ00449   656 SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  740 KPKTTRSPAASPTELVPTPVFEPVtplkedpvttidlervtdletpvaFRTEAPGTTLVPAVVLEPVTlRPEVQVTTLAP 819
Cdd:PTZ00449   735 PIGDPDAEQPDDIEFFTPPEEERT------------------------FFHETPADTPLPDILAEEFK-EEDIHAETGEP 789
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1907118310  820 QKTQKKHRPSPKPKPVPspevTESKPVLPRVR 851
Cdd:PTZ00449   790 DEAMKRPDSPSEHEDKP----PGDHPSLPKKR 817
PHA03247 PHA03247
large tegument protein UL36; Provisional
398-765 9.52e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 70.74  E-value: 9.52e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  398 PRGSWASSEEPWVVPGAKTSEDSRVVQPQTAtydvissSTTSDETEIEIHTATRDPILDSVPPKTSRtaeqPRATLAPIE 477
Cdd:PHA03247  2604 DRGDPRGPAPPSPLPPDTHAPDPPPPSPSPA-------ANEPDPHPPPTVPPPERPRDDPAPGRVSR----PRRARRLGR 2672
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  478 ALFESRNVEIFTSPEVRPTTaAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGhhrlrRPKTTR 557
Cdd:PHA03247  2673 AAQASSPPQRPRRRAARPTV-GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPA-----PPAVPA 2746
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  558 SPEVPKS--KPALEPATVTPEILVPKIVPKPPQKPKATRrpevPQVKPAHEPVTFGSEAPALAIVTTtdiePVITRTKAS 635
Cdd:PHA03247  2747 GPATPGGpaRPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRESLPSPWDPADPPA----AVLAPAAAL 2818
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  636 VTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKP-----ADLGPITSEPPLASTTKKV-----------------RRPRPK 693
Cdd:PHA03247  2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvAPGGDVRRRPPSRSPAAKPaaparppvrrlarpavsRSTESF 2898
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907118310  694 PQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPhPKPKTTRSPAASPTELVPTPVFEPVTP 765
Cdd:PHA03247  2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQPWLGALVP 2969
PHA03247 PHA03247
large tegument protein UL36; Provisional
507-1079 1.07e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 1.07e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPghhrlrRPKTTRSPEVPKSKPALEPAtvtpeilvpkivP 584
Cdd:PHA03247  2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAP------------P 2614
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  585 KPPqkPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEpvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIP 664
Cdd:PHA03247  2615 SPL--PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRD-------------DPAPGRVSRPRRARRLGRAAQASSP 2679
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  665 HSKPAdlgPITSEPPLASTTKKVRRPRP--KPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPhPKPK 742
Cdd:PHA03247  2680 PQRPR---RRAARPTVGSLTSLADPPPPppTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP-GGPA 2755
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  743 TTRSPAASPTELVPTPVFEPVTPlkedPVTTIDLERVTDLETPVAFRTEAPGTTLVPAVVLEPVTLRPEVQ--VTTLAPQ 820
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAG----PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspAGPLPPP 2831
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  821 KTQKKHRPSPKPKPVPSPEVTESKpVLPrvREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPEtpltkpvaatdlE 900
Cdd:PHA03247  2832 TSAQPTAPPPPPGPPPPSLPLGGS-VAP--GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRST------------E 2896
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  901 PSALSTEVPATVVLATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTAST-GVSESKSAPTELQSLVLKPVTSPSLE 979
Cdd:PHA03247  2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPaGAGEPSGAVPQPWLGALVPGRVAVPR 2976
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  980 IIQSQSvSDDLELVAFSTESPQKTIAPR------------QTTSMPPKLK--------TPHSRMPAKEPVPKEPLHTTSK 1039
Cdd:PHA03247  2977 FRVPQP-APSREAPASSTPPLTGHSLSRvsswasslalheETDPPPVSLKqtlwppddTEDSDADSLFDSDSERSDLEAL 3055
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|
gi 1907118310 1040 PKMPPSPEvaDTTSVPKDERLSLKPDPEVTHSETVLPPVT 1079
Cdd:PHA03247  3056 DPLPPEPH--DPFAHEPDPATPEAGARESPSSQFGPPPLS 3093
PHA03247 PHA03247
large tegument protein UL36; Provisional
545-1165 1.75e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.50  E-value: 1.75e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 611
Cdd:PHA03247  2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  612 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPR 691
Cdd:PHA03247  2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  692 PKPQT-TPHP-EVPHTILVPATSLEPFIITEAPGTTLVPK---LPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:PHA03247  2626 PPPPSpSPAAnEPDPHPPPTVPPPERPRDDPAPGRVSRPRrarRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPP 2705
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  767 KE-DPVTTIDLERVTDLETPVAFRTEAPGTTLVPAVvlepvtlrPEVQVTTLAPQKTQKKHRPSPKPkpvpspevTESKP 845
Cdd:PHA03247  2706 PTpEPAPHALVSATPLPPGPAAARQASPALPAAPAP--------PAVPAGPATPGGPARPARPPTTA--------GPPAP 2769
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  846 VLPRVREPVTLRTETwVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPSALSTEVPATVVLATALTPVTLRT 925
Cdd:PHA03247  2770 APPAAPAAGPPRRLT-RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  926 KAPKTTTLAPNVQRTRRPHPRPKTTASTGVSESKSaptelqSLVLKPVTSPSLEiiqsqsvsddlelvafSTESPQKTIA 1005
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV------RRLARPAVSRSTE----------------SFALPPDQPE 2906
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1006 PRQTTSMPPKLKTPHSRMPAKEPVPKEPlhttsKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT-FRVEP 1084
Cdd:PHA03247  2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPP-----PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQ 2981
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1085 PKTTIAPLETRGIPLIPVISPRPSQEELQTAM-EETDQSTQELFTT-KIPRTTELAKTTQA----PHRLHTAPVRPrIPG 1158
Cdd:PHA03247  2982 PAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTlWPPDDTEDSDADSLfdsdSERSDLEALDP-LPP 3060

                   ....*..
gi 1907118310 1159 RPHGRPA 1165
Cdd:PHA03247  3061 EPHDPFA 3067
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1360-1451 3.06e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.06e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1360 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1437
Cdd:cd00063      2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                           90
                   ....*....|....
gi 1907118310 1438 LGEGPASNTVAFST 1451
Cdd:cd00063     80 GGESPPSESVTVTT 93
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
381-784 3.72e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 58.55  E-value: 3.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  381 KRFPEFPEAKTAFPLEKPRGSWASSEEPwVVPGAKTSEdSRVVQPQTATYDVISSSTTSDETEIEI---------HTATR 451
Cdd:PTZ00449   494 KKLAPIEEEDSDKHDEPPEGPEASGLPP-KAPGDKEGE-EGEHEDSKESDEPKEGGKPGETKEGEVgkkpgpakeHKPSK 571
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  452 DPILDSVP-----PKTSRTAEQPRATLAPIEAlfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpAPE 526
Cdd:PTZ00449   572 IPTLSKKPefpkdPKHPKDPEEPKKPKRPRSA-----------QRPTRPKSPKLPELLDIPKSPKRPESPKSPK---RPP 637
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  527 PETRPSAQttkaprktkkpghhrlRRPKTTRSPEVPKSKPAlepatvtpeilvpkivPKPPQKPKATRRPEVPQVKPAHE 606
Cdd:PTZ00449   638 PPQRPSSP----------------ERPEGPKIIKSPKPPKS----------------PKPPFDPKFKEKFYDDYLDAAAK 685
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  607 PVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRThrqrtkykttqsPKIPHSKPADlgpitsePPLASTTK 685
Cdd:PTZ00449   686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRD------------EEFPFEPIGD-------PDAEQPDD 746
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  686 KVRRPRPKPQTTPHPEvphtilVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVT 764
Cdd:PTZ00449   747 IEFFTPPEEERTFFHE------TPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSL 813
                          410       420
                   ....*....|....*....|
gi 1907118310  765 PLKEDPVTTIDLErVTDLET 784
Cdd:PTZ00449   814 PKKRHRLDGLALS-TTDLES 832
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
511-1045 4.72e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 58.16  E-value: 4.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  511 KRQSTPKPPRVKPAPE--PETRPSAQTTKAPRK-TKKPGHHRlrRPKTTRSPEVPKsKPAlePATVTPEILVPKIVPKP- 586
Cdd:PTZ00449   506 KHDEPPEGPEASGLPPkaPGDKEGEEGEHEDSKeSDEPKEGG--KPGETKEGEVGK-KPG--PAKEHKPSKIPTLSKKPe 580
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  587 -PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRTHRQRTKYKTTQSPKIP 664
Cdd:PTZ00449   581 fPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPqRPSSPERPEGPKIIKSPKPP 660
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  665 HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHPKPKTT 744
Cdd:PTZ00449   661 KSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFEPIGDP 739
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  745 RSPAASPTELVPTPVFEPVtplkedpvttidlervtdletpvaFRTEAPGTTLVPAVVLEPVTlRPEVQVTTLAPQKTQK 824
Cdd:PTZ00449   740 DAEQPDDIEFFTPPEEERT------------------------FFHETPADTPLPDILAEEFK-EEDIHAETGEPDEAMK 794
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  825 KhrpspkpkpvpspevteskPVLPRVREPVTlrtetwvTTKAPKTPKRTRRPRPKPqttptpetpltkpVAATDLEPSal 904
Cdd:PTZ00449   795 R-------------------PDSPSEHEDKP-------PGDHPSLPKKRHRLDGLA-------------LSTTDLESD-- 833
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  905 stevPATVVLATALTPVTLRtkapktttlapnvqrtrrphprpkttastgvsESKSApTELQSLVLKPVTSPSLEIIqsq 984
Cdd:PTZ00449   834 ----AGRIAKDASGKIVKLK--------------------------------RSKSF-DDLTTVEEAEEMGAEARKI--- 873
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907118310  985 SVSDDlelvafSTESPQKTIAPrqttSMPPKLKTPHSRMPAKEPVPKEPLHTTSKPKMPPS 1045
Cdd:PTZ00449   874 VVDDD------GTEADDEDTHP----PEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDS 924
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1361-1441 9.23e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 9.23e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  1361 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1438
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 1907118310  1439 GEG 1441
Cdd:smart00060   81 GEG 83
fn3 pfam00041
Fibronectin type III domain;
1361-1444 1.18e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.18e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1361 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1437
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 1907118310 1438 LGEGPAS 1444
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
915-1306 3.30e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.30e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  915 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 994
Cdd:PHA03247  2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  995 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEP-VPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKP 1064
Cdd:PHA03247  2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1065 DPEVTHSETVLPPVTFRVEPPKTTIAPLETR--GIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQ 1142
Cdd:PHA03247  2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRpaVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1143 APhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRK 1213
Cdd:PHA03247  2835 QP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1214 PGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPAS 1293
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAP 2989
                          410
                   ....*....|...
gi 1907118310 1294 EEEFGTTTDFSSS 1306
Cdd:PHA03247  2990 ASSTPPLTGHSLS 3002
PHA03377 PHA03377
EBNA-3C; Provisional
517-703 5.72e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 51.21  E-value: 5.72e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377   414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPK---IPHSKPAD 670
Cdd:PHA03377   489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGFQRSGRRQKratPPKVSPSD 559
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1907118310  671 LGPITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:PHA03377   560 RGPPKASPPVMAPPSTGPRVMATPSTGPRDMAP 592
PHA03247 PHA03247
large tegument protein UL36; Provisional
1143-1363 4.00e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 4.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1143 APHRLHTAPVRPRIPGRPHGRP---ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVDSHATRKPGSVSG 1219
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPsepAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPDTHAPDPPPPSPS 2632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1220 TRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGT 1299
Cdd:PHA03247  2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907118310 1300 TTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1363
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
384-758 7.46e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 7.46e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  384 PEFPEAKTAFPLEKPRGSWASSEEPWVVPGAKTSEDSRVVQPQTATYDVISSSTTSDETEIEIHTATRDPILDSV----- 458
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplq 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  459 -------PPKTSRTAEQPRATLAPIEALFESrnveIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPE----- 526
Cdd:pfam03154  251 pmtqpppPSQVSPQPLPQPSLHGQMPPMPHS----LQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqrih 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  527 -PETRPSAQTTKAPRKTKKP----GHHRLRRPKTTRSPEVPKSKPALEPATVT---PEILVPKIVPKPPQKPKATRRPEV 598
Cdd:pfam03154  327 tPPSQSQLQSQQPPREQPLPpaplSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpsPFQMNSNLPPPPALKPLSSLSTHH 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  599 PqvkPAHEPVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEP 678
Cdd:pfam03154  407 P---PSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  679 PLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAPgttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154  484 STSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA-----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
PHA03247 PHA03247
large tegument protein UL36; Provisional
1012-1362 1.04e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1012 MPPKLK--TPHSRMPAKEPVPKEP-LHTTSKPKMPPSPEVADTTSVPKDERlslkPDPEVTHSETVLPPVTFRVEPPKTT 1088
Cdd:PHA03247  2555 LPPAAPpaAPDRSVPPPRPAPRPSePAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1089 IAPLETRGIPLIPVISPRPSQEELQTAMEETDQSTQelfTTKIPRTTELAKTTQAPHRLHTAP--------VRPRIPGR- 1159
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR---ARRLGRAAQASSPPQRPRRRAARPtvgsltslADPPPPPPt 2707
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1160 PHGRP---------------ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPP 1224
Cdd:PHA03247  2708 PEPAPhalvsatplppgpaaARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1225 IPHRHSSTRPVSPERRPLPPNNVTgkPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGttTDFS 1304
Cdd:PHA03247  2788 VASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG--GDVR 2863
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118310 1305 SSPTKETDPLGKPRFIGPHVRYIPKPENKPCSIT-----DSVRRFPTEEATEGNATSPPQNPP 1362
Cdd:PHA03247  2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalppDQPERPPQPQAPPPPQPQPQPPPP 2926
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1345-1456 1.05e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 1.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1345 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1424
Cdd:COG3401    220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1907118310 1425 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1456
Cdd:COG3401    295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1345-1499 1.44e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1345 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1422
Cdd:COG3401    314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907118310 1423 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1499
Cdd:COG3401    388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
fn3 pfam00041
Fibronectin type III domain;
116-195 1.87e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 1907118310  193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.68e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.68e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310   114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 1907118310   190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 3.02e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 3.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                   ....
gi 1907118310  192 FGVK 195
Cdd:cd00063     72 FRVR 75
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-703 4.26e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.76  E-value: 4.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839   286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839   365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907118310  648 THRQRTKYKttqspkiPHSKPADLGPITSEPPLASTTKKVRRP---RPKPQTTPHPEVP 703
Cdd:NF033839   431 VKPQPEKPK-------PEVKPQPEKPKPEVKPQPETPKPEVKPqpeKPKPEVKPQPEKP 482
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
515-626 4.86e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 4.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950   361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1907118310  595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950   434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
PHA03378 PHA03378
EBNA-3B; Provisional
491-705 1.09e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  491 PEVRPTTAapQQTTSIPSTPKRqSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGH----HRLRRPKTTRSPEV---PK 563
Cdd:PHA03378   576 PLTSPTTS--QLASSAPSYAQT-PWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRpipmRPLRMQPITFNVLVfptPH 652
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  564 SKPALEPATVTPEILVPKIVP-----------KPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRT 632
Cdd:PHA03378   653 QPPQVEITPYKPTWTQIGHIPyqpsptgantmLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPG 732
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118310  633 KASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHT 705
Cdd:PHA03378   733 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPT 805
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
553-773 1.22e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.38  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  553 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 627
Cdd:PLN03209   330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  628 VITRTKASVTTLAPKPPRPRTHRQRTKyktTQSPKIPHSKPADLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTIL 707
Cdd:PLN03209   407 PAEPDVVPSPGSASNVPEVEPAQVEAK---KTRPLSPYARYEDLKPPTS-------------PSPTAPTGVSPSVSSTSS 470
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  708 VPATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTT 773
Cdd:PLN03209   471 VPAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTA 540
PRK10263 PRK10263
DNA translocase FtsK; Provisional
420-747 1.45e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.54  E-value: 1.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  420 SRVVQPQTATYDVI---------SSSTTSDETEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 489
Cdd:PRK10263   297 NRATQPEYDEYDPLlngapitepVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  490 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:PRK10263   376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  568 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 637
Cdd:PRK10263   456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  638 TLAPKPPRPrthrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRrprpkpQTTPHPEVPHTILVPATSLepfi 717
Cdd:PRK10263   525 AWYQPIPEP------VKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK------KATLATGAAATVAAPVFSL---- 588
                          330       340       350
                   ....*....|....*....|....*....|
gi 1907118310  718 iteAPGTTLVPKLPQQPDYPHPKPKTTRSP 747
Cdd:PRK10263   589 ---ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
458-771 1.47e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 43.30  E-value: 1.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003   372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003   451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTK--------ASVTTLAPKPPRPRTHRQrtkyktTQSPKIPHS 666
Cdd:PRK07003   531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAraaaaakpAAAPAAAPKPAAPRVAVQ------VPTPRARAA 604
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  667 KPADLGPITSEPPLASTTkkvRRPRPkpqttPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRS 746
Cdd:PRK07003   605 TGDAPPNGAARAEQAAES---RGAPP-----PWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADA 673
                          330       340
                   ....*....|....*....|....*
gi 1907118310  747 PAAsPTELVPTPvfePVTPLkeDPV 771
Cdd:PRK07003   674 PAP-PVDTRPLP---PAIPL--DAI 692
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1000-1251 1.50e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.52  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1000 PQKTIAPRQTTSMPPKLKTPhsRMPAKEPVPKEPLhTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT 1079
Cdd:PTZ00449   569 PSKIPTLSKKPEFPKDPKHP--KDPEEPKKPKRPR-SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSP 645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1080 FRVEPPKT--TIAPLETRGIPLIPVISPRPSQEELQTAMEETDQST----QELFTTKIPRTTELAKTTQAPHRLHTAPVR 1153
Cdd:PTZ00449   646 ERPEGPKIikSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTtvvlDESFESILKETLPETPGTPFTTPRPLPPKL 725
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1154 PRIPGRPHGRPAlnktttRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRhsstR 1233
Cdd:PTZ00449   726 PRDEEFPFEPIG------DPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK----R 795
                          250
                   ....*....|....*....
gi 1907118310 1234 PVSP-ERRPLPPNNVTGKP 1251
Cdd:PTZ00449   796 PDSPsEHEDKPPGDHPSLP 814
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
490-618 1.84e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.93  E-value: 1.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994   370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118310  569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994   450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
491-766 2.14e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 2.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPRT 648
Cdd:pfam03154  252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQPFP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  649 hrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPkPQTTPHPEVPhtilVPATSLEPfiiteaPGTTLVP 728
Cdd:pfam03154  300 ----LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-PREQPLPPAP----LSMPHIKP------PPTTPIP 364
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1907118310  729 KLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:pfam03154  365 QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
480-628 2.16e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.49  E-value: 2.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950   351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907118310  560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950   427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
dnaA PRK14086
chromosomal replication initiator protein DnaA;
515-718 2.50e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.51  E-value: 2.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPEilvPKIVPKPP- 587
Cdd:PRK14086    87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPE---PGAWPRAAd 163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  588 ----QKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKI 663
Cdd:PRK14086   164 dygwQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHV 238
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907118310  664 PHSKPADLGPItSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 718
Cdd:PRK14086   239 HRGGPGPPERD-DAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
PHA03378 PHA03378
EBNA-3B; Provisional
467-870 3.27e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.36  E-value: 3.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  467 EQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPstpkrQSTPKPPRVKPAPEP-ETRPSAQTTKAPR----- 540
Cdd:PHA03378   345 EAVRLPDDPIIVEDDDESEEIESECDPDEDKSGAEALASIP-----QTLPDPPTVYGRPKVfARKADLKSTKKCRaivtd 419
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  541 ---------KTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKP--PQKPKATrrpevPQVKPA--HEP 607
Cdd:PHA03378   420 psvikaieeEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQPLEGPTGPLSVQAPlePWQPLPH-----PQVTPVilHQP 494
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  608 VTFGSEAP-ALAIVTTTDIEPVITRTKAsvTTLAPKPPRPRTHRQ-----------RTKYKTTQSPKIPHSKPAD-LGPI 674
Cdd:PHA03378   495 PAQGVQAHgSMLDLLEKDDEDMEQRVMA--TLLPPSPPQPRAGRRapcvytedldiESDEPASTEPVHDQLLPAPgLGPL 572
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  675 TSEPPLASTTKKVR--------RPRPKPQTTPHPEVPHT-ILVPATSLEPFIITEAPGTTLVPKLPQQPDYPHP-KPKTT 744
Cdd:PHA03378   573 QIQPLTSPTTSQLAssapsyaqTPWPVPHPSQTPEPPTTqSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLvFPTPH 652
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  745 RSPAASPTELVPTPVFEPVTPLKEDPVTTIDLERVTDLETPVAFRTEAPGTTLVPAVvlEPVTLRPEVQVTTLAPQKTQK 824
Cdd:PHA03378   653 QPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAA--PPGRAQRPAAATGRARPPAAA 730
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1907118310  825 KHRPSPKPKPVPSPEVTESKPvlPRVREPVTLRTETWVTTKAPKTP 870
Cdd:PHA03378   731 PGRARPPAAAPGRARPPAAAP--GRARPPAAAPGRARPPAAAPGAP 774
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-770 3.31e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 42.34  E-value: 3.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665    208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665    288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665    364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665    416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665    496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PHA03247 PHA03247
large tegument protein UL36; Provisional
307-575 3.47e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  307 TLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALVLSEKTSE-TARSVLIPEFELPLSTLAPkrfpe 385
Cdd:PHA03247  2756 RPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPP----- 2830
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  386 fPEAKTAFPLEKPRGSWASSEEP--WVVPGAKtsedsrvvqpqtatydvISSSTTSDETEIEIHTATRDPILDSVPPKTS 463
Cdd:PHA03247  2831 -PTSAQPTAPPPPPGPPPPSLPLggSVAPGGD-----------------VRRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  464 RTAE---QPRATLAPiealfeSRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtrPSAQTTKAPR 540
Cdd:PHA03247  2893 RSTEsfaLPPDQPER------PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE--PSGAVPQPWL 2964
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907118310  541 KTKKPGHHRLRRpktTRSPEVPKSKPALEPATVTP 575
Cdd:PHA03247  2965 GALVPGRVAVPR---FRVPQPAPSREAPASSTPPL 2996
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
469-600 3.54e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 3.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764   371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907118310  545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764   449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
515-612 4.18e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.85  E-value: 4.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954   385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
                           90
                   ....*....|....*...
gi 1907118310  595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954   451 PRNVASGKPG---VDLGS 465
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
491-673 4.95e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 4.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323   383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323   463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1907118310  641 PKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGP 673
Cdd:PRK12323   543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
COG3979 COG3979
Chitodextrinase [Carbohydrate transport and metabolism];
1357-1456 5.75e-03

Chitodextrinase [Carbohydrate transport and metabolism];


Pssm-ID: 443178 [Multi-domain]  Cd Length: 369  Bit Score: 40.91  E-value: 5.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1357 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1435
Cdd:COG3979      2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
                           90       100
                   ....*....|....*....|.
gi 1907118310 1436 nplgeGPASNTVAFSTESADP 1456
Cdd:COG3979     72 -----DAAGNVSAASGTSTAM 87
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
489-751 6.00e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 6.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  489 TSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPK----- 563
Cdd:PHA03307    88 PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVasdaa 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  564 ----------SKPALEPATVTPEILVPKIVPKPPQKPKAtRRPEVPQVKPAHEPVTFGSEAPALAIVTTTD--IEPVITR 631
Cdd:PHA03307   168 ssrqaalplsSPEETARAPSSPPAEPPPSTPPAAASPRP-PRRSSPISASASSPAPAPGRSAADDAGASSSdsSSSESSG 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  632 TKASVTTLAPKpPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKvRRPRPKPQTTPHPEVPHTILVPAT 711
Cdd:PHA03307   247 CGWGPENECPL-PRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSP-SSPGSGPAPSSPRASSSSSSSRES 324
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1907118310  712 SLE-PFIITEAPGTTLVPklPQQPDYPHPKPKTTRSPAASP 751
Cdd:PHA03307   325 SSSsTSSSSESSRGAAVS--PGPSPSRSPSPSRPPPPADPS 363
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
471-748 8.80e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 40.68  E-value: 8.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  471 ATLAPIEALFESrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPEtRPSAQttkAPRKTKKPGHHRL 550
Cdd:PLN03209   311 APLTPMEELLAK-------IPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPP-QPKAV---VPRPLSPYTAYED 379
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  551 RRPKTTRSPEVPKSKPA----------LEPATVTPEILVPKIVP--KPPQKPKATRRPEVPQVK-PAHEPVTFGSEAPAL 617
Cdd:PLN03209   380 LKPPTSPIPTPPSSSPAssksvdavakPAEPDVVPSPGSASNVPevEPAQVEAKKTRPLSPYARyEDLKPPTSPSPTAPT 459
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  618 AIVTTTDIEPVITRT------KASVTTLAPKPPRPRthrqrtkykttqsPKIPHSKPADLGPITSEPPLAsttkkvrrPR 691
Cdd:PLN03209   460 GVSPSVSSTSSVPAVpdtapaTAATDAAAPPPANMR-------------PLSPYAVYDDLKPPTSPSPAA--------PV 518
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907118310  692 PKPQTTPHPEVPhtilvPATSLEPFIITEAPGTTLVPKlpQQPDYPHP-----KPKTTRSPA 748
Cdd:PLN03209   519 GKVAPSSTNEVV-----KVGNSAPPTALADEQHHAQPK--PRPLSPYTmyedlKPPTSPTPS 573
PRK10263 PRK10263
DNA translocase FtsK; Provisional
669-1164 9.99e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 9.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  669 ADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPAtslePFIITEAPGTTLVPKLPQQP-DYPHPKPKTTRSP 747
Cdd:PRK10263   335 APVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPA----PEGYPQQSQYAQPAVQYNEPlQQPVQPQQPYYAP 410
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  748 AASPTELVPTPVFEPVTPLKEDPVTtidlERVTDLETPVAFRTEAPGTTLVPAVVLEPVTLRPEvqvTTLAPQKTQKKHR 827
Cdd:PRK10263   411 AAEQPAQQPYYAPAPEQPAQQPYYA----PAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQ---PAAQEPLYQQPQP 483
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  828 PSPKPKPVPSPEVTESKPVLP-----------RVREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPETPLTKPVA- 895
Cdd:PRK10263   484 VEQQPVVEPEPVVEETKPARPplyyfeeveekRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSp 563
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  896 -ATDLEPSALSTEVPATVVlATALTPVTLRTKAPKTTT-LAPNVQRTRRPH-PRPKTTASTGVS-ESKSAPTElqslvlK 971
Cdd:PRK10263   564 lASGVKKATLATGAAATVA-APVFSLANSGGPRPQVKEgIGPQLPRPKRIRvPTRRELASYGIKlPSQRAAEE------K 636
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310  972 PVTSPSLEIIQSQSVSDDlELVAFSTESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEP------LHTTSKPKMPPS 1045
Cdd:PRK10263   637 AREAQRNQYDSGDQYNDD-EIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAelarqfAQTQQQRYSGEQ 715
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907118310 1046 PEVADTTSVPKDERLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPLETRGIPLIPViSPRPSQEELQTAMEETDQSTQE 1125
Cdd:PRK10263   716 PAGANPFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPV-APQPQYQQPQQPVAPQPQYQQP 794
                          490       500       510
                   ....*....|....*....|....*....|....*....
gi 1907118310 1126 LFTTKIPRTTELAKTTQAPHRLHTAPVRPRIPGRPHGRP 1164
Cdd:PRK10263   795 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 833
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH