NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568995602|ref|XP_006522321|]
View 

target of Nesh-SH3 isoform X10 [Mus musculus]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 10440918)

fibronectin type III (FN3) domain-containing protein similar to human Target of Nesh-SH3 (Tarsh) and Drosophila melanogaster cytokine receptor (protein domeless)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
482-891 5.32e-15

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 81.14  E-value: 5.32e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  482 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 558
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  559 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 628
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  629 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 701
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  702 VPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERVTDLETPVA----FRTEAPG 773
Cdd:PHA03247 2792 SESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPSLPLGGSVApggdVRRRPPS 2868
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  774 TTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVP 853
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|
gi 568995602  854 SPEVTESKPVLPRVREPVTLRTETWVT------------TKAPKTPKRTR 891
Cdd:PHA03247 2949 PAGAGEPSGAVPQPWLGALVPGRVAVPrfrvpqpapsreAPASSTPPLTG 2998
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1377-1468 3.21e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.21e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1377 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1454
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995602 1455 LGEGPASNTVAFST 1468
Cdd:cd00063    80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
287-624 9.23e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 9.23e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  287 PTSIMLEISDALKAQLAKNETLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALAPKRFPEFPEAKT 366
Cdd:PHA03247 2712 PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  367 AFPLEKPRGSWASSEEPWVVPGAKTSEdsrvvqPQTATYDVISSSTTSDEteieihtatrdPILDSVPPKTSRTAEQPRA 446
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAAL------PPAASPAGPLPPPTSAQ-----------PTAPPPPPGPPPPSLPLGG 2854
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  447 TLAPIEALFEsrnveiftspevRPTTAAPQQTTSIPSTPKRQSTPKPPrVKPAPEPETRPSAQttkaprktkkpghhrLR 526
Cdd:PHA03247 2855 SVAPGGDVRR------------RPPSRSPAAKPAAPARPPVRRLARPA-VSRSTESFALPPDQ---------------PE 2906
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  527 RPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPeVPQVKPAHEPVTFGSEAPALAIVTTTDIEPvitr 606
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG-AGEPSGAVPQPWLGALVPGRVAVPRFRVPQ---- 2981
                         330
                  ....*....|....*...
gi 568995602  607 TKASVTTLAPKPPRPRTH 624
Cdd:PHA03247 2982 PAPSREAPASSTPPLTGH 2999
PHA03247 super family cl33720
large tegument protein UL36; Provisional
932-1323 3.54e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.54e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  932 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1011
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1012 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEPVPKEPLHTTSK-------PKMPPSPEVADTTSVPKDE 1075
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSATPLPPGPAAArqaspalPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1076 RLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPletrGIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELA 1155
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA----VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP 2830
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1156 KTTQAPhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSH 1226
Cdd:PHA03247 2831 PTSAQP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1227 ATRKPGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPT 1306
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                         410
                  ....*....|....*..
gi 568995602 1307 APASEEEFGTTTDFSSS 1323
Cdd:PHA03247 2986 REAPASSTPPLTGHSLS 3002
fn3 pfam00041
Fibronectin type III domain;
116-195 1.91e-04

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995602   193 GVK 195
Cdd:pfam00041   72 RVQ 74
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
482-891 5.32e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 81.14  E-value: 5.32e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  482 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 558
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  559 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 628
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  629 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 701
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  702 VPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERVTDLETPVA----FRTEAPG 773
Cdd:PHA03247 2792 SESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPSLPLGGSVApggdVRRRPPS 2868
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  774 TTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVP 853
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|
gi 568995602  854 SPEVTESKPVLPRVREPVTLRTETWVT------------TKAPKTPKRTR 891
Cdd:PHA03247 2949 PAGAGEPSGAVPQPWLGALVPGRVAVPrfrvpqpapsreAPASSTPPLTG 2998
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1377-1468 3.21e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.21e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1377 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1454
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995602 1455 LGEGPASNTVAFST 1468
Cdd:cd00063    80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1378-1458 9.80e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 9.80e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   1378 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1455
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 568995602   1456 GEG 1458
Cdd:smart00060   81 GEG 83
PHA03247 PHA03247
large tegument protein UL36; Provisional
287-624 9.23e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 9.23e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  287 PTSIMLEISDALKAQLAKNETLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALAPKRFPEFPEAKT 366
Cdd:PHA03247 2712 PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  367 AFPLEKPRGSWASSEEPWVVPGAKTSEdsrvvqPQTATYDVISSSTTSDEteieihtatrdPILDSVPPKTSRTAEQPRA 446
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAAL------PPAASPAGPLPPPTSAQ-----------PTAPPPPPGPPPPSLPLGG 2854
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  447 TLAPIEALFEsrnveiftspevRPTTAAPQQTTSIPSTPKRQSTPKPPrVKPAPEPETRPSAQttkaprktkkpghhrLR 526
Cdd:PHA03247 2855 SVAPGGDVRR------------RPPSRSPAAKPAAPARPPVRRLARPA-VSRSTESFALPPDQ---------------PE 2906
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  527 RPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPeVPQVKPAHEPVTFGSEAPALAIVTTTDIEPvitr 606
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG-AGEPSGAVPQPWLGALVPGRVAVPRFRVPQ---- 2981
                         330
                  ....*....|....*...
gi 568995602  607 TKASVTTLAPKPPRPRTH 624
Cdd:PHA03247 2982 PAPSREAPASSTPPLTGH 2999
fn3 pfam00041
Fibronectin type III domain;
1378-1461 1.21e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.21e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  1378 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1454
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 568995602  1455 LGEGPAS 1461
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
932-1323 3.54e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.54e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  932 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1011
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1012 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEPVPKEPLHTTSK-------PKMPPSPEVADTTSVPKDE 1075
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSATPLPPGPAAArqaspalPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1076 RLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPletrGIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELA 1155
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA----VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP 2830
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1156 KTTQAPhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSH 1226
Cdd:PHA03247 2831 PTSAQP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1227 ATRKPGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPT 1306
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                         410
                  ....*....|....*..
gi 568995602 1307 APASEEEFGTTTDFSSS 1323
Cdd:PHA03247 2986 REAPASSTPPLTGHSLS 3002
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
470-733 4.87e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 4.87e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   470 PTTAAPQQTTSIPSTPKRQS---TPKPPRVKPAPE-----PETRPSAQT----TKAPRKTKKPGHhrLRRPKTTRSPEVP 537
Cdd:pfam03154  313 PSPAAPGQSQQRIHTPPSQSqlqSQQPPREQPLPPaplsmPHIKPPPTTpipqLPNPQSHKHPPH--LSGPSPFQMNSNL 390
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   538 KSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPahepvtfgseapalaivtttdiePVITRTKASVTTLAPK 617
Cdd:pfam03154  391 PPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-----------------------PVLTQSQSLPPPAASH 447
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   618 PPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAP 697
Cdd:pfam03154  448 PPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA 523
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 568995602   698 gttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 733
Cdd:pfam03154  524 -----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1362-1473 1.26e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.26e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1362 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1441
Cdd:COG3401   220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                          90       100       110
                  ....*....|....*....|....*....|...
gi 568995602 1442 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1473
Cdd:COG3401   295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
fn3 pfam00041
Fibronectin type III domain;
116-195 1.91e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995602   193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.76e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.76e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602    114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 568995602    190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 3.03e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 3.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063     1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                  ....
gi 568995602  192 FGVK 195
Cdd:cd00063    72 FRVR 75
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
466-678 6.06e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 6.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  466 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 542
Cdd:NF033839  286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  543 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 622
Cdd:NF033839  365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995602  623 THRQRTKYKTTQSPKIPHSKPAdlgpITSEPPLASTTKKVRRPRPKPQTTPHPEVP 678
Cdd:NF033839  431 VKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKPEVKPQPEKPKPEVKPQPEKP 482
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
403-745 4.14e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.96  E-value: 4.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  403 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 469
Cdd:COG5665   208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  470 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 539
Cdd:COG5665   288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  540 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 618
Cdd:COG5665   364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  619 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 690
Cdd:COG5665   416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  691 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 745
Cdd:COG5665   496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
482-891 5.32e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 81.14  E-value: 5.32e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  482 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 558
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  559 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 628
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  629 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 701
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  702 VPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERVTDLETPVA----FRTEAPG 773
Cdd:PHA03247 2792 SESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPSLPLGGSVApggdVRRRPPS 2868
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  774 TTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVP 853
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|
gi 568995602  854 SPEVTESKPVLPRVREPVTLRTETWVT------------TKAPKTPKRTR 891
Cdd:PHA03247 2949 PAGAGEPSGAVPQPWLGALVPGRVAVPrfrvpqpapsreAPASSTPPLTG 2998
PHA03247 PHA03247
large tegument protein UL36; Provisional
431-1072 1.76e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 1.76e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  431 DSVPPKTSRTAEQPRATLAPIEALFESRNveifTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSaqt 510
Cdd:PHA03247 2558 AAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS--- 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  511 tKAPRKTKKPGHHRLRRPKttrsPEVPKSKPAlePATVTPEILVPKIvPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAP 590
Cdd:PHA03247 2631 -PSPAANEPDPHPPPTVPP----PERPRDDPA--PGRVSRPRRARRL-GRAAQASSPPQRPRRRAARPTVGSLTSLADPP 2702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  591 AlaivtttdiepvitrtkasvttlAPKPPRPRTHRQRTKYKTTQSPKIPH--SKPADLGPITSEPPLASTTKKVRRPRPK 668
Cdd:PHA03247 2703 P-----------------------PPPTPEPAPHALVSATPLPPGPAAARqaSPALPAAPAPPAVPAGPATPGGPARPAR 2759
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  669 PQTTPHPEVPHTILVPATSLEPfIITEAPGTTLVPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKED 744
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA 2838
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  745 PVTTIVPitdLERVTDLETPVA----FRTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVL 820
Cdd:PHA03247 2839 PPPPPGP---PPPSLPLGGSVApggdVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPP 2915
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  821 EPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVPSPEVTESKPVLPrvrepvtlrtETWVTTKAPKTPKRTRRPRPkpqtt 900
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP----------QPWLGALVPGRVAVPRFRVP----- 2980
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  901 ptpetpltkpvaatdlePSALSTEVPAtvvlataltpvtlrtkaPKTTTlapnvqRTRRPHPRPKTTASTGVSESKSAPt 980
Cdd:PHA03247 2981 -----------------QPAPSREAPA-----------------SSTPP------LTGHSLSRVSSWASSLALHEETDP- 3019
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  981 elqslvlkpvtsPSLEIIQSQSVSDDLElvafstespqKTIAPRQTTSMPPKLKTPhsrmpAKEPVPKEPLHTTSKPKMP 1060
Cdd:PHA03247 3020 ------------PPVSLKQTLWPPDDTE----------DSDADSLFDSDSERSDLE-----ALDPLPPEPHDPFAHEPDP 3072
                         650
                  ....*....|..
gi 568995602 1061 PSPEvADTTSVP 1072
Cdd:PHA03247 3073 ATPE-AGARESP 3083
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
484-751 2.21e-12

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 72.41  E-value: 2.21e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  484 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 554
Cdd:PTZ00449  542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  555 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 634
Cdd:PTZ00449  620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  635 SPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 714
Cdd:PTZ00449  656 SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 568995602  715 KPKTTRSPAASPTELVPTPVfEPVTPLKEDPVTTIVP 751
Cdd:PTZ00449  735 PIGDPDAEQPDDIEFFTPPE-EERTFFHETPADTPLP 770
PHA03247 PHA03247
large tegument protein UL36; Provisional
311-718 1.39e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.89  E-value: 1.39e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  311 PAESKTPEVeklAGQPVTVTPESVSRSTKPTLSSALDTAETALAPKRF-PEFPEAKTAFPLEKPRGSWASSEepwVVPGA 389
Cdd:PHA03247 2592 PPQSARPRA---PVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANePDPHPPPTVPPPERPRDDPAPGR---VSRPR 2665
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  390 KTSEDSRVVQPQTATYDVISSSTTSDETEIeihTATRDPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR 469
Cdd:PHA03247 2666 RARRLGRAAQASSPPQRPRRRAARPTVGSL---TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPP 2742
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  470 PTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKP-ALEPATV 548
Cdd:PHA03247 2743 AVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPaAALPPAA 2822
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  549 TPEILVP---KIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTtdiepVITRTKASVTTLAPKPPRPRTHR 625
Cdd:PHA03247 2823 SPAGPLPpptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAK-----PAAPARPPVRRLARPAVSRSTES 2897
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  626 QRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKL 705
Cdd:PHA03247 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRF 2977
                         410
                  ....*....|....*.
gi 568995602  706 ---PQQPDYPHPKPKT 718
Cdd:PHA03247 2978 rvpQPAPSREAPASST 2993
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1377-1468 3.21e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.28  E-value: 3.21e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1377 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1454
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995602 1455 LGEGPASNTVAFST 1468
Cdd:cd00063    80 GGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
520-1182 3.63e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 3.63e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  520 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 586
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  587 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPitsepplasttkkvrrPR 666
Cdd:PHA03247 2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD----------------PR 2609
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  667 PKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPV 746
Cdd:PHA03247 2610 GPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR 2689
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  747 TTIVPITDLERvtdletpvafrTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVtlR 826
Cdd:PHA03247 2690 PTVGSLTSLAD-----------PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPA--R 2756
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  827 PEVQVTTLAPqktqkkhrpspkpkpvpspevteSKPVLPRVREPVTLRTETwVTTKAPKTPKRTRRPRPKPQTTPTPETP 906
Cdd:PHA03247 2757 PARPPTTAGP-----------------------PAPAPPAAPAAGPPRRLT-RPAVASLSESRESLPSPWDPADPPAAVL 2812
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  907 LTKPVAATDLEPSALSTEVPATVVLATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTgvsesksAPTElqslv 986
Cdd:PHA03247 2813 APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA-------APAR----- 2880
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  987 lkpvtsPSLEIIQSQSVSDDLELVAFSTESPQKtiaPRQTTSMPPKLKTPhsrmpaKEPVPKEPLHTTSKPKMPPSPEVA 1066
Cdd:PHA03247 2881 ------PPVRRLARPAVSRSTESFALPPDQPER---PPQPQAPPPPQPQP------QPPPPPQPQPPPPPPPRPQPPLAP 2945
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1067 DTTSVPKDERLSLKPDPEVTHSETVLPPVT-FRVEPPKTTIAPLETRGIPLIPVISPRPSQEELQTAM-EETDQSTQELF 1144
Cdd:PHA03247 2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLK 3025
                         650       660       670       680
                  ....*....|....*....|....*....|....*....|...
gi 568995602 1145 TT-KIPRTTELAKTTQA----PHRLHTAPVRPrIPGRPHGRPA 1182
Cdd:PHA03247 3026 QTlWPPDDTEDSDADSLfdsdSERSDLEALDP-LPPEPHDPFA 3067
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
469-819 8.50e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 57.39  E-value: 8.50e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  469 RPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEpetrpSAQTTKAPRKTKKPGhhRLRRPKTTRSPEVPKSKPALEPATV 548
Cdd:PTZ00449  560 KPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPE-----EPKKPKRPRSAQRPT--RPKSPKLPELLDIPKSPKRPESPKS 632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  549 tpeilvPKiVPKPPQKPKATRRPEVPQV----KPAHEP-VTFGseaPALAIVTTTDIEPVITRTKASVTTLAPKpprpRT 623
Cdd:PTZ00449  633 ------PK-RPPPPQRPSSPERPEGPKIikspKPPKSPkPPFD---PKFKEKFYDDYLDAAAKSKETKTTVVLD----ES 698
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  624 HRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTT--KKVRRPRPKPQTTPHPEVPHTILV---PATSLEPFIITEAPG 698
Cdd:PTZ00449  699 FESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEpiGDPDAEQPDDIEFFTPPEEERTFFhetPADTPLPDILAEEFK 778
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  699 TTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVTPLK----------------------EDPVTTIVPITDL 755
Cdd:PTZ00449  779 EEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSLPKKrhrldglalsttdlesdagriaKDASGKIVKLKRS 851
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  756 ERVTDLET--------PVAFR-------TEAPGT-TLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVV 819
Cdd:PTZ00449  852 KSFDDLTTveeaeemgAEARKivvdddgTEADDEdTHPPEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSII 931
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1378-1458 9.80e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 9.80e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   1378 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1455
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 568995602   1456 GEG 1458
Cdd:smart00060   81 GEG 83
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
356-742 1.88e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 56.24  E-value: 1.88e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  356 KRFPEFPEAKTAFPLEKPRGSWASSEEPwVVPGAKTSEdSRVVQPQTATYDVISSSTTSDETEIEI---------HTATR 426
Cdd:PTZ00449  494 KKLAPIEEEDSDKHDEPPEGPEASGLPP-KAPGDKEGE-EGEHEDSKESDEPKEGGKPGETKEGEVgkkpgpakeHKPSK 571
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  427 DPILDSVP-----PKTSRTAEQPRATLAPIEAlfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpAPE 501
Cdd:PTZ00449  572 IPTLSKKPefpkdPKHPKDPEEPKKPKRPRSA-----------QRPTRPKSPKLPELLDIPKSPKRPESPKSPK---RPP 637
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  502 PETRPSAQttkaprktkkpghhrlRRPKTTRSPEVPKSKPAlepatvtpeilvpkivPKPPQKPKATRRPEVPQVKPAHE 581
Cdd:PTZ00449  638 PPQRPSSP----------------ERPEGPKIIKSPKPPKS----------------PKPPFDPKFKEKFYDDYLDAAAK 685
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  582 PVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRThrqrtkykttqsPKIPHSKPADlgpitsePPLASTTK 660
Cdd:PTZ00449  686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRD------------EEFPFEPIGD-------PDAEQPDD 746
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  661 KVRRPRPKPQTTPHPEvphtilVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVT 739
Cdd:PTZ00449  747 IEFFTPPEEERTFFHE------TPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSL 813

                  ...
gi 568995602  740 PLK 742
Cdd:PTZ00449  814 PKK 816
PHA03247 PHA03247
large tegument protein UL36; Provisional
499-1027 2.12e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.12e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  499 APEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKskPALEPATVTPEILVPKIV-------------------P 559
Cdd:PHA03247 2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLA--PAILPDEPVGEPVHPRMLtwirgleelasddagdpppP 2554
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  560 KPPQKPKATRRPEVPQVKPAHEPvtfgSEAPALAIVTTTDIEPVITRTKASV-------TTLAPKPPRPRTHR------- 625
Cdd:PHA03247 2555 LPPAAPPAAPDRSVPPPRPAPRP----SEPAVTSRARRPDAPPQSARPRAPVddrgdprGPAPPSPLPPDTHApdpppps 2630
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  626 QRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSlepfiiteAPGTTLVpKL 705
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV--------GSLTSLA-DP 2701
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  706 PQQPDYPHPKPKTTRSpaASPTELVPTPVFEPVTPLKEDPVTTIVPITDLERVTDLETPVAFRTEAPGTTLASKISQRTH 785
Cdd:PHA03247 2702 PPPPPTPEPAPHALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  786 RPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVPSPEVTESKpVLP 865
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS-VAP 2858
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  866 rvREPVTLRTETWVTTKAPKTPKRTRRPRPKPQTTPTPEtpltkpvaatdlEPSALSTEVPATVVLATALTPVTLRTKAP 945
Cdd:PHA03247 2859 --GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRST------------ESFALPPDQPERPPQPQAPPPPQPQPQPP 2924
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  946 KTTTLAPNVQRTRRPHPRPKTTAST-GVSESKSAPTELQSLVLKPVTSPSLEIIQSQSvSDDLELVAFSTESPQKTIAPR 1024
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLAPTTDPaGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP-APSREAPASSTPPLTGHSLSR 3003

                  ...
gi 568995602 1025 QTT 1027
Cdd:PHA03247 3004 VSS 3006
PHA03247 PHA03247
large tegument protein UL36; Provisional
287-624 9.23e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 9.23e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  287 PTSIMLEISDALKAQLAKNETLALPAESKTPEVEKLAGQPVTVTPESVSRSTKPTLSSALDTAETALAPKRFPEFPEAKT 366
Cdd:PHA03247 2712 PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  367 AFPLEKPRGSWASSEEPWVVPGAKTSEdsrvvqPQTATYDVISSSTTSDEteieihtatrdPILDSVPPKTSRTAEQPRA 446
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAAL------PPAASPAGPLPPPTSAQ-----------PTAPPPPPGPPPPSLPLGG 2854
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  447 TLAPIEALFEsrnveiftspevRPTTAAPQQTTSIPSTPKRQSTPKPPrVKPAPEPETRPSAQttkaprktkkpghhrLR 526
Cdd:PHA03247 2855 SVAPGGDVRR------------RPPSRSPAAKPAAPARPPVRRLARPA-VSRSTESFALPPDQ---------------PE 2906
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  527 RPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPeVPQVKPAHEPVTFGSEAPALAIVTTTDIEPvitr 606
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG-AGEPSGAVPQPWLGALVPGRVAVPRFRVPQ---- 2981
                         330
                  ....*....|....*...
gi 568995602  607 TKASVTTLAPKPPRPRTH 624
Cdd:PHA03247 2982 PAPSREAPASSTPPLTGH 2999
fn3 pfam00041
Fibronectin type III domain;
1378-1461 1.21e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.79  E-value: 1.21e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  1378 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1454
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 568995602  1455 LGEGPAS 1461
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
932-1323 3.54e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.54e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  932 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1011
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1012 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEPVPKEPLHTTSK-------PKMPPSPEVADTTSVPKDE 1075
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSATPLPPGPAAArqaspalPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1076 RLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPletrGIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELA 1155
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA----VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP 2830
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1156 KTTQAPhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSH 1226
Cdd:PHA03247 2831 PTSAQP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1227 ATRKPGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPT 1306
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                         410
                  ....*....|....*..
gi 568995602 1307 APASEEEFGTTTDFSSS 1323
Cdd:PHA03247 2986 REAPASSTPPLTGHSLS 3002
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
470-733 4.87e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 4.87e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   470 PTTAAPQQTTSIPSTPKRQS---TPKPPRVKPAPE-----PETRPSAQT----TKAPRKTKKPGHhrLRRPKTTRSPEVP 537
Cdd:pfam03154  313 PSPAAPGQSQQRIHTPPSQSqlqSQQPPREQPLPPaplsmPHIKPPPTTpipqLPNPQSHKHPPH--LSGPSPFQMNSNL 390
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   538 KSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPahepvtfgseapalaivtttdiePVITRTKASVTTLAPK 617
Cdd:pfam03154  391 PPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-----------------------PVLTQSQSLPPPAASH 447
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   618 PPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAP 697
Cdd:pfam03154  448 PPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA 523
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 568995602   698 gttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 733
Cdd:pfam03154  524 -----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
PHA03377 PHA03377
EBNA-3C; Provisional
492-678 6.52e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 51.21  E-value: 6.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  492 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 568
Cdd:PHA03377  414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  569 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPK---IPHSKPAD 645
Cdd:PHA03377  489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGFQRSGRRQKratPPKVSPSD 559
                         170       180       190
                  ....*....|....*....|....*....|...
gi 568995602  646 LGPITSEPPLASTTKKVRRPRPKPQTTPHPEVP 678
Cdd:PHA03377  560 RGPPKASPPVMAPPSTGPRVMATPSTGPRDMAP 592
PHA03247 PHA03247
large tegument protein UL36; Provisional
956-1399 7.87e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 7.87e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  956 RTRRPHPRPK-TTASTGVSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSddlelvAFSTESPQKTIAPR----QTTSMP 1030
Cdd:PHA03247 2585 RARRPDAPPQsARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPA------ANEPDPHPPPTVPPperpRDDPAP 2658
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1031 PKLKTPH-SRMPAKEPVPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVTF--RVEPPKTTIA 1107
Cdd:PHA03247 2659 GRVSRPRrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaaRQASPALPAA 2738
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1108 PLeTRGIPLIPVI--SPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRlhTAPVRPRIPGrPHGRPALNK 1185
Cdd:PHA03247 2739 PA-PPAVPAGPATpgGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR--ESLPSPWDPA-DPPAAVLAP 2814
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1186 TTTRPDKTKPRGTSHKNGvgTGTKQAPKPPSPGRNASVDSHATRKPGSvSGTRRPPiphrhSSTRPVSPERRPLPPNNVT 1265
Cdd:PHA03247 2815 AAALPPAASPAGPLPPPT--SAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPP-----SRSPAAKPAAPARPPVRRL 2886
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1266 GKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGTTTDFSSSPTKETDPLGKPRFIGPHvryiP 1345
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQ----P 2962
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....
gi 568995602 1346 KPENKPCSITDSVRRFPTEEATEGNATSPPQNPPTNLTVVTVEGCPSFVILDWE 1399
Cdd:PHA03247 2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEE 3016
PHA03247 PHA03247
large tegument protein UL36; Provisional
1160-1380 4.45e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 4.45e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1160 APHRLHTAPVRPRIPGRPHGRP---ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVDSHATRKPGSVSG 1236
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPsepAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1237 TRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGT 1316
Cdd:PHA03247 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568995602 1317 TTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1380
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
PHA03247 PHA03247
large tegument protein UL36; Provisional
1029-1379 1.16e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 1.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1029 MPPKLK--TPHSRMPAKEPVPKEP-LHTTSKPKMPPSPEVADTTSVPKDERlslkPDPEVTHSETVLPPVTFRVEPPKTT 1105
Cdd:PHA03247 2555 LPPAAPpaAPDRSVPPPRPAPRPSePAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1106 IAPLETRGIPLIPVISPRPSQEELQTAMEETDQSTQelfTTKIPRTTELAKTTQAPHRLHTAP--------VRPRIPGR- 1176
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR---ARRLGRAAQASSPPQRPRRRAARPtvgsltslADPPPPPPt 2707
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1177 PHGRP---------------ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPP 1241
Cdd:PHA03247 2708 PEPAPhalvsatplppgpaaARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1242 IPHRHSSTRPVSPERRPLPPNNVTgkPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGttTDFS 1321
Cdd:PHA03247 2788 VASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG--GDVR 2863
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995602 1322 SSPTKETDPLGKPRFIGPHVRYIPKPENKPCSIT-----DSVRRFPTEEATEGNATSPPQNPP 1379
Cdd:PHA03247 2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalppDQPERPPQPQAPPPPQPQPQPPPP 2926
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1362-1473 1.26e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.26e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1362 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1441
Cdd:COG3401   220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                          90       100       110
                  ....*....|....*....|....*....|...
gi 568995602 1442 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1473
Cdd:COG3401   295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1362-1516 1.72e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.15  E-value: 1.72e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1362 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1439
Cdd:COG3401   314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568995602 1440 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1516
Cdd:COG3401   388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
fn3 pfam00041
Fibronectin type III domain;
116-195 1.91e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995602   193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.76e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 2.76e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602    114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 568995602    190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 3.03e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.33  E-value: 3.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063     1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                  ....
gi 568995602  192 FGVK 195
Cdd:cd00063    72 FRVR 75
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
490-601 5.00e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 5.00e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  490 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 569
Cdd:PRK14950  361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
                          90       100       110
                  ....*....|....*....|....*....|..
gi 568995602  570 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 601
Cdd:PRK14950  434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
466-678 6.06e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 6.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  466 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 542
Cdd:NF033839  286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  543 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 622
Cdd:NF033839  365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995602  623 THRQRTKYKTTQSPKIPHSKPAdlgpITSEPPLASTTKKVRRPRPKPQTTPHPEVP 678
Cdd:NF033839  431 VKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKPEVKPQPEKPKPEVKPQPEKP 482
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
528-750 9.62e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.76  E-value: 9.62e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  528 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 602
Cdd:PLN03209  330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  603 VITRTKASVTTLAPKPPRPRTHRQRTKyktTQSPKIPHSKPADLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTIL 682
Cdd:PLN03209  407 PAEPDVVPSPGSASNVPEVEPAQVEAK---KTRPLSPYARYEDLKPPTS-------------PSPTAPTGVSPSVSSTSS 470
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568995602  683 VPATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIV 750
Cdd:PLN03209  471 VPAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
PHA03378 PHA03378
EBNA-3B; Provisional
466-680 1.19e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.52  E-value: 1.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  466 PEVRPTTAapQQTTSIPSTPKRqSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGH----HRLRRPKTTRSPEV---PK 538
Cdd:PHA03378  576 PLTSPTTS--QLASSAPSYAQT-PWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRpipmRPLRMQPITFNVLVfptPH 652
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  539 SKPALEPATVTPEILVPKIVP-----------KPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRT 607
Cdd:PHA03378  653 QPPQVEITPYKPTWTQIGHIPyqpsptgantmLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPG 732
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995602  608 KASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHT 680
Cdd:PHA03378  733 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPT 805
PRK10263 PRK10263
DNA translocase FtsK; Provisional
395-722 1.61e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.54  E-value: 1.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  395 SRVVQPQTATYDVI---------SSSTTSDETEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 464
Cdd:PRK10263  297 NRATQPEYDEYDPLlngapitepVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  465 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 542
Cdd:PRK10263  376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  543 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 612
Cdd:PRK10263  456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  613 TLAPKPPRPrthrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRrprpkpQTTPHPEVPHTILVPATSLepfi 692
Cdd:PRK10263  525 AWYQPIPEP------VKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK------KATLATGAAATVAAPVFSL---- 588
                         330       340       350
                  ....*....|....*....|....*....|
gi 568995602  693 iteAPGTTLVPKLPQQPDYPHPKPKTTRSP 722
Cdd:PRK10263  589 ---ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
433-746 1.64e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 43.30  E-value: 1.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  433 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 502
Cdd:PRK07003  372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  503 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 571
Cdd:PRK07003  451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  572 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTK--------ASVTTLAPKPPRPRTHRQrtkyktTQSPKIPHS 641
Cdd:PRK07003  531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAraaaaakpAAAPAAAPKPAAPRVAVQ------VPTPRARAA 604
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  642 KPADLGPITSEPPLASTTkkvRRPRPkpqttPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRS 721
Cdd:PRK07003  605 TGDAPPNGAARAEQAAES---RGAPP-----PWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADA 673
                         330       340
                  ....*....|....*....|....*
gi 568995602  722 PAAsPTELVPTPvfePVTPLkeDPV 746
Cdd:PRK07003  674 PAP-PVDTRPLP---PAIPL--DAI 692
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
465-593 1.89e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.93  E-value: 1.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  465 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 543
Cdd:PRK07994  370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995602  544 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 593
Cdd:PRK07994  450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1017-1268 1.93e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.14  E-value: 1.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1017 PQKTIAPRQTTSMPPKLKTPhsRMPAKEPVPKEPLhTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVT 1096
Cdd:PTZ00449  569 PSKIPTLSKKPEFPKDPKHP--KDPEEPKKPKRPR-SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSP 645
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1097 FRVEPPKT--TIAPLETRGIPLIPVISPRPSQEELQTAMEETDQST----QELFTTKIPRTTELAKTTQAPHRLHTAPVR 1170
Cdd:PTZ00449  646 ERPEGPKIikSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTtvvlDESFESILKETLPETPGTPFTTPRPLPPKL 725
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1171 PRIPGRPHGRPAlnktttRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHRhsstR 1250
Cdd:PTZ00449  726 PRDEEFPFEPIG------DPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMK----R 795
                         250
                  ....*....|....*....
gi 568995602 1251 PVSP-ERRPLPPNNVTGKP 1268
Cdd:PTZ00449  796 PDSPsEHEDKPPGDHPSLP 814
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
455-603 2.30e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.49  E-value: 2.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  455 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 534
Cdd:PRK14950  351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995602  535 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 603
Cdd:PRK14950  427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
dnaA PRK14086
chromosomal replication initiator protein DnaA;
490-693 2.64e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.51  E-value: 2.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  490 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPE--ILVPKIVPKP 561
Cdd:PRK14086   87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPEpgAWPRAADDYG 166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  562 PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHS 641
Cdd:PRK14086  167 WQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRG 241
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 568995602  642 KPADLGPItSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 693
Cdd:PRK14086  242 GPGPPERD-DAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
PHA03369 PHA03369
capsid maturational protease; Provisional
466-754 2.86e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 42.29  E-value: 2.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  466 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 545
Cdd:PHA03369  362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPT 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  546 ATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaivtTTDIEPVITRTKASVTTLAPKPPRPRTHR 625
Cdd:PHA03369  442 NPYVMPISMANMVYPGHPQEHGHERKRKRGGELKEELIETLKLVK------KLKEEQESLAKELEATAHKSEIKKIAESE 515
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  626 QRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT------ 699
Cdd:PHA03369  516 FKNAGAKTAAANIEPNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTaealag 595
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 568995602  700 ---TLVPKLPQQPDYPHpkpktTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPITD 754
Cdd:PHA03369  596 aieTLLTQASAQPAGLS-----LPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPSLE 648
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
466-741 2.87e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 2.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   466 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 545
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   546 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPRT 623
Cdd:pfam03154  252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQPFP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602   624 hrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPkPQTTPHPEVPhtilVPATSLEPfiiteaPGTTLVP 703
Cdd:pfam03154  300 ----LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-PREQPLPPAP----LSMPHIKP------PPTTPIP 364
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 568995602   704 KLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 741
Cdd:pfam03154  365 QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
444-575 3.80e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 3.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  444 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 519
Cdd:PRK07764  371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995602  520 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 575
Cdd:PRK07764  449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
403-745 4.14e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.96  E-value: 4.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  403 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 469
Cdd:COG5665   208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  470 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 539
Cdd:COG5665   288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  540 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 618
Cdd:COG5665   364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  619 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 690
Cdd:COG5665   416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  691 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 745
Cdd:COG5665   496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
490-587 4.30e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.85  E-value: 4.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  490 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 569
Cdd:PRK14954  385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
                          90
                  ....*....|....*...
gi 568995602  570 RPEVPQVKPAhepVTFGS 587
Cdd:PRK14954  451 PRNVASGKPG---VDLGS 465
PHA03247 PHA03247
large tegument protein UL36; Provisional
506-777 5.22e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 5.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  506 PSAQTTKAPRKTKKPGHHRlrrpKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPqvkPAHEPVTF 585
Cdd:PHA03247  255 PAPPPVVGEGADRAPETAR----GATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA---PAGDAEEE 327
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  586 GSEAPALAIVTttdiePVitrtkasvttlapkpPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRp 665
Cdd:PHA03247  328 DDEDGAMEVVS-----PL---------------PRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRR- 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  666 rpkpqTTPHPEVPHTiLVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDP 745
Cdd:PHA03247  387 -----SARHAATPFA-RGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATE 460
                         250       260       270
                  ....*....|....*....|....*....|...
gi 568995602  746 VTTivPITDLERVTDLETPVAFRT-EAPGTTLA 777
Cdd:PHA03247  461 PAP--DDPDDATRKALDALRERRPpEPPGADLA 491
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
466-648 5.69e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 5.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  466 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 545
Cdd:PRK12323  383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602  546 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 615
Cdd:PRK12323  463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
                         170       180       190
                  ....*....|....*....|....*....|...
gi 568995602  616 PKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGP 648
Cdd:PRK12323  543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
COG3979 COG3979
Chitodextrinase [Carbohydrate transport and metabolism];
1374-1473 7.81e-03

Chitodextrinase [Carbohydrate transport and metabolism];


Pssm-ID: 443178 [Multi-domain]  Cd Length: 369  Bit Score: 40.53  E-value: 7.81e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995602 1374 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1452
Cdd:COG3979     2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
                          90       100
                  ....*....|....*....|.
gi 568995602 1453 nplgeGPASNTVAFSTESADP 1473
Cdd:COG3979    72 -----DAAGNVSAASGTSTAM 87
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH