NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568995596|ref|XP_006522318|]
View 

target of Nesh-SH3 isoform X6 [Mus musculus]

Protein Classification

fibronectin type III domain-containing protein( domain architecture ID 10440918)

fibronectin type III (FN3) domain-containing protein similar to human Target of Nesh-SH3 (Tarsh) and Drosophila melanogaster cytokine receptor (protein domeless)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
507-892 8.14e-15

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 8.14e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  654 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 726
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  727 VPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERVTDLETPVA----FRTEAPG 798
Cdd:PHA03247 2792 SESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPSLPLGGSVApggdVRRRPPS 2868
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  799 TTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVP 878
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                         410
                  ....*....|....
gi 568995596  879 SPEVTESKPAPKTP 892
Cdd:PHA03247 2949 PAGAGEPSGAVPQP 2962
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1382-1473 2.57e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.66  E-value: 2.57e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1382 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1459
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995596 1460 LGEGPASNTVAFST 1473
Cdd:cd00063    80 GGESPPSESVTVTT 93
PHA03247 super family cl33720
large tegument protein UL36; Provisional
937-1328 3.52e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  937 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1016
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1017 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEPVPKEPLHTTSK-------PKMPPSPEVADTTSVPKDE 1080
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSATPLPPGPAAArqaspalPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1081 RLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPletrGIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELA 1160
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA----VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP 2830
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1161 KTTQAPhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSH 1231
Cdd:PHA03247 2831 PTSAQP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1232 ATRKPGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPT 1311
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                         410
                  ....*....|....*..
gi 568995596 1312 APASEEEFGTTTDFSSS 1328
Cdd:PHA03247 2986 REAPASSTPPLTGHSLS 3002
fn3 pfam00041
Fibronectin type III domain;
116-195 1.82e-04

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995596   193 GVK 195
Cdd:pfam00041   72 RVQ 74
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
507-892 8.14e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 8.14e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  654 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 726
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  727 VPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERVTDLETPVA----FRTEAPG 798
Cdd:PHA03247 2792 SESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPSLPLGGSVApggdVRRRPPS 2868
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  799 TTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVP 878
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                         410
                  ....*....|....
gi 568995596  879 SPEVTESKPAPKTP 892
Cdd:PHA03247 2949 PAGAGEPSGAVPQP 2962
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1382-1473 2.57e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.66  E-value: 2.57e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1382 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1459
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995596 1460 LGEGPASNTVAFST 1473
Cdd:cd00063    80 GGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1383-1463 8.16e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 8.16e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   1383 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1460
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 568995596   1461 GEG 1463
Cdd:smart00060   81 GEG 83
fn3 pfam00041
Fibronectin type III domain;
1383-1466 1.13e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.18  E-value: 1.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  1383 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1459
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 568995596  1460 LGEGPAS 1466
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
937-1328 3.52e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  937 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1016
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1017 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEPVPKEPLHTTSK-------PKMPPSPEVADTTSVPKDE 1080
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSATPLPPGPAAArqaspalPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1081 RLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPletrGIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELA 1160
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA----VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP 2830
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1161 KTTQAPhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSH 1231
Cdd:PHA03247 2831 PTSAQP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1232 ATRKPGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPT 1311
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                         410
                  ....*....|....*..
gi 568995596 1312 APASEEEFGTTTDFSSS 1328
Cdd:PHA03247 2986 REAPASSTPPLTGHSLS 3002
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
495-758 4.97e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 4.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   495 PTTAAPQQTTSIPSTPKRQS---TPKPPRVKPAPE-----PETRPSAQT----TKAPRKTKKPGHhrLRRPKTTRSPEVP 562
Cdd:pfam03154  313 PSPAAPGQSQQRIHTPPSQSqlqSQQPPREQPLPPaplsmPHIKPPPTTpipqLPNPQSHKHPPH--LSGPSPFQMNSNL 390
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   563 KSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPahepvtfgseapalaivtttdiePVITRTKASVTTLAPK 642
Cdd:pfam03154  391 PPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-----------------------PVLTQSQSLPPPAASH 447
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   643 PPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAP 722
Cdd:pfam03154  448 PPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA 523
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 568995596   723 gttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154  524 -----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1367-1478 1.04e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 1.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1367 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1446
Cdd:COG3401   220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                          90       100       110
                  ....*....|....*....|....*....|...
gi 568995596 1447 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1478
Cdd:COG3401   295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
fn3 pfam00041
Fibronectin type III domain;
116-195 1.82e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995596   193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.44e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.45  E-value: 2.44e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596    114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 568995596    190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 2.57e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.71  E-value: 2.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063     1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                  ....
gi 568995596  192 FGVK 195
Cdd:cd00063    72 FRVR 75
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-703 6.08e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 6.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839  286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839  365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995596  648 THRQRTKYKTTQSPKIPHSKPAdlgpITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:NF033839  431 VKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKPEVKPQPEKPKPEVKPQPEKP 482
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-770 4.76e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.57  E-value: 4.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665   208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665   288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665   364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665   416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665   496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
507-892 8.14e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 8.14e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  507 PSTPKRQSTPKPPRVKPAPEPETRPS--AQTTKAPRKTKKPGHHRLRRPKTTR-SPEVPKSKPALEPATVTPEILVPKIV 583
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSepAVTSRARRPDAPPQSARPRAPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  584 PKPPQ----KPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRTKA------SVTTLAPKPPRPRTHRQRT 653
Cdd:PHA03247 2633 PAANEpdphPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  654 KYKTTQSPKIP-------HSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPfIITEAPGTTL 726
Cdd:PHA03247 2713 HALVSATPLPPgpaaarqASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  727 VPKLPQQPD----YPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPitdLERVTDLETPVA----FRTEAPG 798
Cdd:PHA03247 2792 SESRESLPSpwdpADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP---PPPSLPLGGSVApggdVRRRPPS 2868
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  799 TTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVP 878
Cdd:PHA03247 2869 RSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD 2948
                         410
                  ....*....|....
gi 568995596  879 SPEVTESKPAPKTP 892
Cdd:PHA03247 2949 PAGAGEPSGAVPQP 2962
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
509-776 2.22e-12

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 72.41  E-value: 2.22e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  509 TPKRQSTPKPPRV-----KPAPEPETRPSA--QTTKAPRKTKKPGHHRlrRPKTTRSPEVPKS--KPALEPATVTPEILV 579
Cdd:PTZ00449  542 EPKEGGKPGETKEgevgkKPGPAKEHKPSKipTLSKKPEFPKDPKHPK--DPEEPKKPKRPRSaqRPTRPKSPKLPELLD 619
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  580 PKIVPKPPQKPKATRRPevpqvkpahepvtfgseapalaivtttdiepvitrtkasvttlaPKPPRPRTHRQRTKYKTTQ 659
Cdd:PTZ00449  620 IPKSPKRPESPKSPKRP--------------------------------------------PPPQRPSSPERPEGPKIIK 655
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  660 SPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPgTTLVPKLPQQPDYPHP 739
Cdd:PTZ00449  656 SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-RPLPPKLPRDEEFPFE 734
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 568995596  740 KPKTTRSPAASPTELVPTPVfEPVTPLKEDPVTTIVP 776
Cdd:PTZ00449  735 PIGDPDAEQPDDIEFFTPPE-EERTFFHETPADTPLP 770
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1382-1473 2.57e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 58.66  E-value: 2.57e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1382 NPPTNLTVVTVEgcPSFVILDWEKPLNDT--VTEYEVISRENGSFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPKNP 1459
Cdd:cd00063     2 SPPTNLRVTDVT--STSVTLSWTPPEDDGgpITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNG 79
                          90
                  ....*....|....
gi 568995596 1460 LGEGPASNTVAFST 1473
Cdd:cd00063    80 GGESPPSESVTVTT 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
448-743 3.07e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 65.73  E-value: 3.07e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  448 TATRDPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEP 527
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  528 ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKP-ALEPATVTPEILVP---KIVPKPPQKPKATRRPEVPQVKP 603
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPaAALPPAASPAGPLPpptSAQPTAPPPPPGPPPPSLPLGGS 2855
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  604 AHEPVTFGSEAPALAIVTTtdiepVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAST 683
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAK-----PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995596  684 TKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKL---PQQPDYPHPKPKT 743
Cdd:PHA03247 2931 PPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFrvpQPAPSREAPASST 2993
PHA03247 PHA03247
large tegument protein UL36; Provisional
545-1187 8.85e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 8.85e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  545 PGHHRLRRPKTTRSPEVPKSKPalEPATVTPEilvPKIVPKPPQKPKATRRPEVPQVKPAHEPV-------------TFG 611
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAAP--DPGGGGPP---DPDAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelasdDAG 2549
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  612 SEAPALAivttTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPitsepplasttkkvrrPR 691
Cdd:PHA03247 2550 DPPPPLP----PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD----------------PR 2609
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  692 PKPQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPV 771
Cdd:PHA03247 2610 GPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR 2689
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  772 TTIVPITDLERVTDLE-TPVAFRTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTL 850
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPpTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAP 2769
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  851 RPEVQVTTLAPqktqkkhRPSPKPKPVPSPEVTESKPAPKTPkrtrrprpkpqTTPTPETPLTKPVAATDLEPSALSTEV 930
Cdd:PHA03247 2770 APPAAPAAGPP-------RRLTRPAVASLSESRESLPSPWDP-----------ADPPAAVLAPAAALPPAASPAGPLPPP 2831
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  931 PATVVLATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTgvsesksAPTElqslvlkpvtsPSLEIIQSQSVSD 1010
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA-------APAR-----------PPVRRLARPAVSR 2893
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1011 DLELVAFSTESPQKtiaPRQTTSMPPKLKTPhsrmpaKEPVPKEPLHTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEV 1090
Cdd:PHA03247 2894 STESFALPPDQPER---PPQPQAPPPPQPQP------QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWL 2964
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1091 THSETVLPPVT-FRVEPPKTTIAPLETRGIPLIPVISPRPSQEELQTAM-EETDQSTQELFTT-KIPRTTELAKTTQA-- 1165
Cdd:PHA03247 2965 GALVPGRVAVPrFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALhEETDPPPVSLKQTlWPPDDTEDSDADSLfd 3044
                         650       660
                  ....*....|....*....|....
gi 568995596 1166 --PHRLHTAPVRPrIPGRPHGRPA 1187
Cdd:PHA03247 3045 sdSERSDLEALDP-LPPEPHDPFA 3067
PHA03247 PHA03247
large tegument protein UL36; Provisional
524-1032 1.08e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.34  E-value: 1.08e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  524 APEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKskPALEPATVTPEILVPKIV-------------------P 584
Cdd:PHA03247 2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLA--PAILPDEPVGEPVHPRMLtwirgleelasddagdpppP 2554
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  585 KPPQKPKATRRPEVPQVKPAHEPvtfgSEAPALAIVTTTDIEPVITRTKASV-------TTLAPKPPRPRTHR------- 650
Cdd:PHA03247 2555 LPPAAPPAAPDRSVPPPRPAPRP----SEPAVTSRARRPDAPPQSARPRAPVddrgdprGPAPPSPLPPDTHApdpppps 2630
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  651 QRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSlepfiiteAPGTTLVpKL 730
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV--------GSLTSLA-DP 2701
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  731 PQQPDYPHPKPKTTRSpaASPTELVPTPVFEPVTPLKEDPVTTIVPITDLERVTDLETPVAFRTEAPGTTLASKISQRTH 810
Cdd:PHA03247 2702 PPPPPTPEPAPHALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  811 RPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVVLEPVTLRPEVQVTTLAPQKTQKKHRPSPKPKPVPSPEVTESKPAPK 890
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  891 TPKRTRRPRPKPQTTPTPETPLTKPVAAT-----DLEPSALSTEVPATVVLATALTPVTLRTKAPKTTTLAPNVQRTRRP 965
Cdd:PHA03247 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARpavsrSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568995596  966 HPRPKTTAST-GVSESKSAPTELQSLVLKPVTSPSLEIIQSQSvSDDLELVAFSTESPQKTIAPRQTT 1032
Cdd:PHA03247 2940 QPPLAPTTDPaGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP-APSREAPASSTPPLTGHSLSRVSS 3006
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1383-1463 8.16e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 8.16e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   1383 PPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV-ISRENGSFSGKNKSIQITNQTFS-TVENLKPDTSYEFQVKPKNPL 1460
Cdd:smart00060    3 PPSNLRVTDVT--STSVTLSWEPPPDDGITGYIVgYRVEYREEGSEWKEVNVTPSSTSyTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 568995596   1461 GEG 1463
Cdd:smart00060   81 GEG 83
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
494-844 8.68e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 57.39  E-value: 8.68e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  494 RPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEpetrpSAQTTKAPRKTKKPGhhRLRRPKTTRSPEVPKSKPALEPATV 573
Cdd:PTZ00449  560 KPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPE-----EPKKPKRPRSAQRPT--RPKSPKLPELLDIPKSPKRPESPKS 632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  574 tpeilvPKiVPKPPQKPKATRRPEVPQV----KPAHEP-VTFGseaPALAIVTTTDIEPVITRTKASVTTLAPKpprpRT 648
Cdd:PTZ00449  633 ------PK-RPPPPQRPSSPERPEGPKIikspKPPKSPkPPFD---PKFKEKFYDDYLDAAAKSKETKTTVVLD----ES 698
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  649 HRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTT--KKVRRPRPKPQTTPHPEVPHTILV---PATSLEPFIITEAPG 723
Cdd:PTZ00449  699 FESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEpiGDPDAEQPDDIEFFTPPEEERTFFhetPADTPLPDILAEEFK 778
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  724 TTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVTPLK----------------------EDPVTTIVPITDL 780
Cdd:PTZ00449  779 EEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSLPKKrhrldglalsttdlesdagriaKDASGKIVKLKRS 851
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  781 ERVTDLET--------PVAFR-------TEAPGT-TLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPETKTVPAVV 844
Cdd:PTZ00449  852 KSFDDLTTveeaeemgAEARKivvdddgTEADDEdTHPPEEKHKSEVRRRRPPKKPSKPKKPSKPKKPKKPDSAFIPSII 931
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
381-767 1.91e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 56.24  E-value: 1.91e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  381 KRFPEFPEAKTAFPLEKPRGSWASSEEPwVVPGAKTSEdSRVVQPQTATYDVISSSTTSDETEIEI---------HTATR 451
Cdd:PTZ00449  494 KKLAPIEEEDSDKHDEPPEGPEASGLPP-KAPGDKEGE-EGEHEDSKESDEPKEGGKPGETKEGEVgkkpgpakeHKPSK 571
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  452 DPILDSVP-----PKTSRTAEQPRATLAPIEAlfesrnveiftSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRvkpAPE 526
Cdd:PTZ00449  572 IPTLSKKPefpkdPKHPKDPEEPKKPKRPRSA-----------QRPTRPKSPKLPELLDIPKSPKRPESPKSPK---RPP 637
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  527 PETRPSAQttkaprktkkpghhrlRRPKTTRSPEVPKSKPAlepatvtpeilvpkivPKPPQKPKATRRPEVPQVKPAHE 606
Cdd:PTZ00449  638 PPQRPSSP----------------ERPEGPKIIKSPKPPKS----------------PKPPFDPKFKEKFYDDYLDAAAK 685
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  607 PVTFGSEAPALAIVTTTDIEPVITRTKASVTTLAPKPP-RPRThrqrtkykttqsPKIPHSKPADlgpitsePPLASTTK 685
Cdd:PTZ00449  686 SKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPkLPRD------------EEFPFEPIGD-------PDAEQPDD 746
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  686 KVRRPRPKPQTTPHPEvphtilVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKPKttrspaaSPTELVPTPVFE-PVT 764
Cdd:PTZ00449  747 IEFFTPPEEERTFFHE------TPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPD-------SPSEHEDKPPGDhPSL 813

                  ...
gi 568995596  765 PLK 767
Cdd:PTZ00449  814 PKK 816
fn3 pfam00041
Fibronectin type III domain;
1383-1466 1.13e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.18  E-value: 1.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  1383 PPTNLTVVTVEgcPSFVILDWEKP--LNDTVTEYEVISRENGSFSGKNkSIQITNQTFS-TVENLKPDTSYEFQVKPKNP 1459
Cdd:pfam00041    2 APSNLTVTDVT--STSLTVSWTPPpdGNGPITGYEVEYRPKNSGEPWN-EITVPGTTTSvTLTGLKPGTEYEVRVQAVNG 78

                   ....*..
gi 568995596  1460 LGEGPAS 1466
Cdd:pfam00041   79 GGEGPPS 85
PHA03247 PHA03247
large tegument protein UL36; Provisional
937-1328 3.52e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  937 ATALTPVTLRTKAPKTTTLAPNVQRTRRPHPRPKTTASTGvSESKSAPTELQSLVLKPVTSPSLEIIQSQSVSDDLELVA 1016
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA-NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAA 2674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1017 FSTESPQK--------TIAPRQTTSMPP-KLKTPHSRMPAKEPVPKEPLHTTSK-------PKMPPSPEVADTTSVPKDE 1080
Cdd:PHA03247 2675 QASSPPQRprrraarpTVGSLTSLADPPpPPPTPEPAPHALVSATPLPPGPAAArqaspalPAAPAPPAVPAGPATPGGP 2754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1081 RLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPletrGIPLIPVISPRPSQEELQTAMEETDQSTQELFTTKIPRTTELA 1160
Cdd:PHA03247 2755 ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA----VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP 2830
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1161 KTTQAPhrlhTAPVRPRIPGRPH---------GRPALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSH 1231
Cdd:PHA03247 2831 PTSAQP----TAPPPPPGPPPPSlplggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1232 ATRKPGSVSGTRRPPIPHRHSSTRPvSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPT 1311
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQP-PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                         410
                  ....*....|....*..
gi 568995596 1312 APASEEEFGTTTDFSSS 1328
Cdd:PHA03247 2986 REAPASSTPPLTGHSLS 3002
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
495-758 4.97e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 4.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   495 PTTAAPQQTTSIPSTPKRQS---TPKPPRVKPAPE-----PETRPSAQT----TKAPRKTKKPGHhrLRRPKTTRSPEVP 562
Cdd:pfam03154  313 PSPAAPGQSQQRIHTPPSQSqlqSQQPPREQPLPPaplsmPHIKPPPTTpipqLPNPQSHKHPPH--LSGPSPFQMNSNL 390
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   563 KSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPahepvtfgseapalaivtttdiePVITRTKASVTTLAPK 642
Cdd:pfam03154  391 PPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQP-----------------------PVLTQSQSLPPPAASH 447
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   643 PPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPeVPHTilvPATSLEPFIITEAP 722
Cdd:pfam03154  448 PPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGP-VPAA---VSCPLPPVQIKEEA 523
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 568995596   723 gttlvPKLPQQPDYPHPKPkttRSPAASPTeLVPTP 758
Cdd:pfam03154  524 -----LDEAEEPESPPPPP---RSPSPEPT-VVNTP 550
PHA03377 PHA03377
EBNA-3C; Provisional
517-703 5.96e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 51.21  E-value: 5.96e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  517 KPPRVKPAPEPETRPSAQT---TKAPRKTKKPGHHRLRRPKTTRSPEVPkskpaLEPATVTPEILVPKIVPKPPQKPKAT 593
Cdd:PHA03377  414 RKPRTLPWPTPKTHPVKRTlvkTSGRSDEAEQAQSTPERPGPSDQPSVP-----VEPAHLTPVEHTTVILHQPPQSPPTV 488
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  594 rrpevpQVKPAHEPVTFGSEApalAIVTTTDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPK---IPHSKPAD 670
Cdd:PHA03377  489 ------AIKPAPPPSRRRRGA---CVVYDDDIIEVIDVETTEEEESVTQPAKPHRKVQDGFQRSGRRQKratPPKVSPSD 559
                         170       180       190
                  ....*....|....*....|....*....|...
gi 568995596  671 LGPITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:PHA03377  560 RGPPKASPPVMAPPSTGPRVMATPSTGPRDMAP 592
PHA03247 PHA03247
large tegument protein UL36; Provisional
1165-1385 4.35e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 4.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1165 APHRLHTAPVRPRIPGRPHGRP---ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPgrnASVDSHATRKPGSVSG 1241
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPsepAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP---LPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1242 TRRPPIPHRHSSTRPVSPERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGT 1321
Cdd:PHA03247 2633 PAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568995596 1322 TTDFSSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEATEG-NATSPPQNPPT 1385
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGpPAPAPPAAPAA 2777
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1367-1478 1.04e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.92  E-value: 1.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1367 PTEEATEGNATSPPqNPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEvISRENGSfSGKNKSIQITNQTFSTVENLKP 1446
Cdd:COG3401   220 PSNEVSVTTPTTPP-SAPTGLTATADT--PGSVTLSWDPVTESDATGYR-VYRSNSG-DGPFTKVATVTTTSYTDTGLTN 294
                          90       100       110
                  ....*....|....*....|....*....|...
gi 568995596 1447 DTSYEFQVKPKNPLG-EGPASNTVAFSTESADP 1478
Cdd:COG3401   295 GTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP 327
PHA03247 PHA03247
large tegument protein UL36; Provisional
1034-1384 1.15e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 1.15e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1034 MPPKLK--TPHSRMPAKEPVPKEP-LHTTSKPKMPPSPEVADTTSVPKDERlslkPDPEVTHSETVLPPVTFRVEPPKTT 1110
Cdd:PHA03247 2555 LPPAAPpaAPDRSVPPPRPAPRPSePAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPPS 2630
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1111 IAPLETRGIPLIPVISPRPSQEELQTAMEETDQSTQelfTTKIPRTTELAKTTQAPHRLHTAP--------VRPRIPGR- 1181
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR---ARRLGRAAQASSPPQRPRRRAARPtvgsltslADPPPPPPt 2707
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1182 PHGRP---------------ALNKTTTRPDKTKPRGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPP 1246
Cdd:PHA03247 2708 PEPAPhalvsatplppgpaaARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1247 IPHRHSSTRPVSPERRPLPPNNVTgkPGRAGIVSSSRVTSPPLKATLHPIGTATARPGAEQKEPTAPASEEEFGttTDFS 1326
Cdd:PHA03247 2788 VASLSESRESLPSPWDPADPPAAV--LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG--GDVR 2863
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995596 1327 SSPTKETDPLGKPRFIGPHVRYIPKPENKPCSIT-----DSVRRFPTEEATEGNATSPPQNPP 1384
Cdd:PHA03247 2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESfalppDQPERPPQPQAPPPPQPQPQPPPP 2926
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1367-1521 1.40e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 46.53  E-value: 1.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1367 PTEEATEGNATSPPQnPPTNLTVVTVEgcPSFVILDWEKPLNDTVTEYEV--ISRENGSFSGKNKSIqitNQTFSTVENL 1444
Cdd:COG3401   314 PSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTASSDADVTGYNVyrSTSGGGTYTKIAETV---TTTSYTDTGL 387
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568995596 1445 KPDTSYEFQVKPKNPLG-EGPASNTVAFSTESADPRVSEPISAGRDAIWTERPFNSDSYSECKGKQYVKRTWYKKFVG 1521
Cdd:COG3401   388 TPGTTYYYKVTAVDAAGnESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTG 465
fn3 pfam00041
Fibronectin type III domain;
116-195 1.82e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 41.63  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   116 KPLQLVVGTLTPSSVFLSWgflinphhdwTLPSHCPSD-RFYTIRYREKDKEKKWIFQLCPATET--IVENLKPNTVYEF 192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSW----------TPPPDGNGPiTGYEVEYRPKNSGEPWNEITVPGTTTsvTLTGLKPGTEYEV 71

                   ...
gi 568995596   193 GVK 195
Cdd:pfam00041   72 RVQ 74
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
114-195 2.44e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.45  E-value: 2.44e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596    114 PRKPLQLVVGTLTPSSVFLSWgflinphhdwtLPSHCPSDRFYTIRYREKDKEKKWIFQLCPA----TETIVENLKPNTV 189
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSW-----------EPPPDDGITGYIVGYRVEYREEGSEWKEVNVtpssTSYTLTGLKPGTE 69

                    ....*.
gi 568995596    190 YEFGVK 195
Cdd:smart00060   70 YEFRVR 75
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
114-195 2.57e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 41.71  E-value: 2.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  114 PRKPLQLVVGTLTPSSVFLSWgfliNPHHDWTLPSHcpsdrFYTIRYREKDKE--KKWIFQLCPATETIVENLKPNTVYE 191
Cdd:cd00063     1 PSPPTNLRVTDVTSTSVTLSW----TPPEDDGGPIT-----GYVVEYREKGSGdwKEVEVTPGSETSYTLTGLKPGTEYE 71

                  ....
gi 568995596  192 FGVK 195
Cdd:cd00063    72 FRVR 75
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
515-626 5.02e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 5.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKttrsPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATR 594
Cdd:PRK14950  361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK----EPVRETAT---PPPVPPRPVAPPVPHTPESAPKLTR 433
                          90       100       110
                  ....*....|....*....|....*....|..
gi 568995596  595 RPEVPQVKPAHEPVTFGSEAPALAIVTTTDIE 626
Cdd:PRK14950  434 AAIPVDEKPKYTPPAPPKEEEKALIADGDVLE 465
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1090-1372 5.30e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 44.68  E-value: 5.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1090 VTHSETVLPPVTfrVEPPKTTIAPLETRGIPLIPVISPRPSQEElqTAMEETDQSTQELFTTKIPRTTELAKTTQ--APH 1167
Cdd:PTZ00449  489 IKKSKKKLAPIE--EEDSDKHDEPPEGPEASGLPPKAPGDKEGE--EGEHEDSKESDEPKEGGKPGETKEGEVGKkpGPA 564
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1168 RLHTAPVRPRIPGRPHGrPALNKTTTRPDKTKprgtSHKNGVgtgTKQAP-KPPSPGRNASVD-SHATRKPGSVSGTRRP 1245
Cdd:PTZ00449  565 KEHKPSKIPTLSKKPEF-PKDPKHPKDPEEPK----KPKRPR---SAQRPtRPKSPKLPELLDiPKSPKRPESPKSPKRP 636
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1246 PIPhrhssTRPVSPERRPLPPNNVTGKPGRagivSSSRVTSPPLKATLHpigTATARPGAEQKEPTAPASEEEFGTTTDF 1325
Cdd:PTZ00449  637 PPP-----QRPSSPERPEGPKIIKSPKPPK----SPKPPFDPKFKEKFY---DDYLDAAAKSKETKTTVVLDESFESILK 704
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 568995596 1326 SSSPTKETDPLGKPRFIGPHVRYIPKPENKPCSITDSVRRFPTEEAT 1372
Cdd:PTZ00449  705 ETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFT 751
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
491-703 6.08e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 6.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  491 PEVRPTTAAPQ---QTTSIPSTPKRQSTPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:NF033839  286 EPGNKKPSAPKpgmQPSPQPEKKEVKPEPETPKPEVKPQLEK-PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPE 364
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  568 LEPATVTPEilvpKIVPKPPQKPKATRRPEVPQVKPAHEPvtfGSEAPalaivtTTDIEPVITRTKASVTTlAPKPPRPR 647
Cdd:NF033839  365 VKPQPEKPK----PEVKPQPETPKPEVKPQPEKPKPEVKP---QPEKP------KPEVKPQPEKPKPEVKP-QPEKPKPE 430
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995596  648 THRQRTKYKTTQSPKIPHSKPAdlgpITSEPPLASTTKKVRRPRPKPQTTPHPEVP 703
Cdd:NF033839  431 VKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKPEVKPQPEKPKPEVKPQPEKP 482
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1030-1331 9.19e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.91  E-value: 9.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1030 QTTSMPPKlKTPHSRMPAKEpvpKEPLHTTSKPKMPPSPEVADTTSVPKderlslKPDPEVTHSETVLPPVTFRVEPPKT 1109
Cdd:PTZ00449  515 EASGLPPK-APGDKEGEEGE---HEDSKESDEPKEGGKPGETKEGEVGK------KPGPAKEHKPSKIPTLSKKPEFPKD 584
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1110 TIAPLETRgiplipviSPRPSQEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRlHTAPVRPRIPGRPHGrPALN 1189
Cdd:PTZ00449  585 PKHPKDPE--------EPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKR-PPPPQRPSSPERPEG-PKII 654
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1190 KTTTRPDKTKP----------RGTSHKNGVGTGTKQAPKPPSPGRNASVDSHATRKPGSVSGTRRPPIPHR-HSSTRPVS 1258
Cdd:PTZ00449  655 KSPKPPKSPKPpfdpkfkekfYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLpRDEEFPFE 734
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1259 PERRPLPPNNVTGKPGRAGIVSSSRVTSPPLKATLHPIGT----------ATARPGAEQKEPTAPaSEEEFGTTTDFSSS 1328
Cdd:PTZ00449  735 PIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAeefkeedihaETGEPDEAMKRPDSP-SEHEDKPPGDHPSL 813

                  ...
gi 568995596 1329 PTK 1331
Cdd:PTZ00449  814 PKK 816
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
553-775 9.57e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.76  E-value: 9.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  553 PKTTRSPEVPKSKPalePATVTPEILVPKIVPKPPQKPKATRRPEVP-----QVKPAHEPVTFGSEAPALAIVTTTDIEP 627
Cdd:PLN03209  330 PKESDAADGPKPVP---TKPVTPEAPSPPIEEEPPQPKAVVPRPLSPytayeDLKPPTSPIPTPPSSSPASSKSVDAVAK 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  628 VITRTKASVTTLAPKPPRPRTHRQRTKyktTQSPKIPHSKPADLGPITSepplasttkkvrrPRPKPQTTPHPEVPHTIL 707
Cdd:PLN03209  407 PAEPDVVPSPGSASNVPEVEPAQVEAK---KTRPLSPYARYEDLKPPTS-------------PSPTAPTGVSPSVSSTSS 470
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568995596  708 VPATSLEP----FIITEAPGTTLVPKLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPLKEDPVTTIV 775
Cdd:PLN03209  471 VPAVPDTApataATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
PHA03378 PHA03378
EBNA-3B; Provisional
491-705 1.11e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 1.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  491 PEVRPTTAapQQTTSIPSTPKRqSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGH----HRLRRPKTTRSPEV---PK 563
Cdd:PHA03378  576 PLTSPTTS--QLASSAPSYAQT-PWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRpipmRPLRMQPITFNVLVfptPH 652
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  564 SKPALEPATVTPEILVPKIVP-----------KPPQKPKATRRPEVPQVKPAHEPVTFGSEAPALAIVTTTDIEPVITRT 632
Cdd:PHA03378  653 QPPQVEITPYKPTWTQIGHIPyqpsptgantmLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPG 732
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995596  633 KASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHT 705
Cdd:PHA03378  733 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPT 805
PRK10263 PRK10263
DNA translocase FtsK; Provisional
420-747 1.59e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.54  E-value: 1.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  420 SRVVQPQTATYDVI---------SSSTTSDETEIEIHTATRDPILDSVPPKTSRTA-EQPRATLAPIEALFESRNVeIFT 489
Cdd:PRK10263  297 NRATQPEYDEYDPLlngapitepVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPpAQPTVAWQPVPGPQTGEPV-IAP 375
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  490 SPEVRPTTAAPQQTTSIPSTPKRQSTP--KPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPA 567
Cdd:PRK10263  376 APEGYPQQSQYAQPAVQYNEPLQQPVQpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQST 455
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  568 LEP-ATVTPEILVPKIVPKPP---------QKPKATRRPEVPQVKPAHEPVTFGSEapalaivtttdIEPVITRTKASVT 637
Cdd:PRK10263  456 FAPqSTYQTEQTYQQPAAQEPlyqqpqpveQQPVVEPEPVVEETKPARPPLYYFEE-----------VEEKRAREREQLA 524
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  638 TLAPKPPRPrthrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRrprpkpQTTPHPEVPHTILVPATSLepfi 717
Cdd:PRK10263  525 AWYQPIPEP------VKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVK------KATLATGAAATVAAPVFSL---- 588
                         330       340       350
                  ....*....|....*....|....*....|
gi 568995596  718 iteAPGTTLVPKLPQQPDYPHPKPKTTRSP 747
Cdd:PRK10263  589 ---ANSGGPRPQVKEGIGPQLPRPKRIRVP 615
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
458-771 1.66e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 43.30  E-value: 1.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  458 VPPKTSRTAEQPRATLAP---IEALFESRNVEIFTSPEVRPTTAAPQQTTsIPSTPKRQSTPKP-------PRVKPAPEP 527
Cdd:PRK07003  372 VPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAAT-RAEAPPAAPAPPAtadrgddAADGDAPVP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  528 ---ETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQ--------KPKATRRP 596
Cdd:PRK07003  451 akaNARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAaasredapAAAAPPAP 530
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  597 EVPQVKPA--HEPVTFGSEAPALAIVTTTDIEPVITRTK--------ASVTTLAPKPPRPRTHRQrtkyktTQSPKIPHS 666
Cdd:PRK07003  531 EARPPTPAaaAPAARAGGAAAALDVLRNAGMRVSSDRGAraaaaakpAAAPAAAPKPAAPRVAVQ------VPTPRARAA 604
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  667 KPADLGPITSEPPLASTTkkvRRPRPkpqttPHPEVPHTILVPATSLEPFIiteAPGTTLVPKLPQQPDYPHPKPKTTRS 746
Cdd:PRK07003  605 TGDAPPNGAARAEQAAES---RGAPP-----PWEDIPPDDYVPLSADEGFG---GPDDGFVPVFDSGPDDVRVAPKPADA 673
                         330       340
                  ....*....|....*....|....*
gi 568995596  747 PAAsPTELVPTPvfePVTPLkeDPV 771
Cdd:PRK07003  674 PAP-PVDTRPLP---PAIPL--DAI 692
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
490-618 1.95e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.93  E-value: 1.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  490 SPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEV-PKSKPAL 568
Cdd:PRK07994  370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSePAAASRA 449
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995596  569 EPATVTPEIL-----VPKIVPKPPQKPKATR-RPEVPQVKPAHEPVTFGSEAPALA 618
Cdd:PRK07994  450 RPVNSALERLasvrpAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALE 505
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
480-628 2.21e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.49  E-value: 2.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  480 FESRNVEIFTSPEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPghhrLRRPKTTRSP 559
Cdd:PRK14950  351 LELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP----VAPPVPHTPE 426
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568995596  560 EVPKSKPALEPATVTPEILVPkivPKPPQKPKATRRPEV--PQVKPAHEPVT--FGSEAPALAIVTTTDIEPV 628
Cdd:PRK14950  427 SAPKLTRAAIPVDEKPKYTPP---APPKEEEKALIADGDvlEQLEAIWKQILrdVPPRSPAVQALLSSGVRPV 496
dnaA PRK14086
chromosomal replication initiator protein DnaA;
515-718 2.56e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.51  E-value: 2.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  515 TPKPPRVKPAPEPETRPSAQTTKAPRKTKKP----GHHRL--RRPKTTRSPEVPKSKPALEPATVTPE--ILVPKIVPKP 586
Cdd:PRK14086   87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRAddRPPGLPRQDQLPTARPAYPAYQQRPEpgAWPRAADDYG 166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  587 PQKPKATRRPEVPQVKPAHEPVTFGSEAPALAivtttDIEPVITRTKASVTTLAPKPPRPRTHRQRTKYKTTQSPKIPHS 666
Cdd:PRK14086  167 WQQQRLGFPPRAPYASPASYAPEQERDREPYD-----AGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRG 241
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 568995596  667 KPADLGPItSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFII 718
Cdd:PRK14086  242 GPGPPERD-DAPVVPIRPSAPGPLAAQPAPAPGPGEPTARLNPKYTFDTFVI 292
PHA03369 PHA03369
capsid maturational protease; Provisional
491-779 2.85e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 42.29  E-value: 2.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PHA03369  362 AAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPT 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  571 ATVTPEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPVTFGSEAPalaivtTTDIEPVITRTKASVTTLAPKPPRPRTHR 650
Cdd:PHA03369  442 NPYVMPISMANMVYPGHPQEHGHERKRKRGGELKEELIETLKLVK------KLKEEQESLAKELEATAHKSEIKKIAESE 515
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  651 QRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSLEPFIITEAPGT------ 724
Cdd:PHA03369  516 FKNAGAKTAAANIEPNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTaealag 595
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 568995596  725 ---TLVPKLPQQPDYPHpkpktTRSPAASPTELVPTPVFEPVTPLKEDPVTTIVPITD 779
Cdd:PHA03369  596 aieTLLTQASAQPAGLS-----LPAPAVPVNASTPASTPPPLAPQEPPQPGTSAPSLE 648
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
491-766 2.95e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 2.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQP 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   571 ATVT--PEILVPKIVPKPPQKPKATRRPEVPQVKPAHEPvtfgseapalaivtttdiepvitrtkasvttlAPKPPRPRT 648
Cdd:pfam03154  252 MTQPppPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQ--------------------------------HPVPPQPFP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596   649 hrqrTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRPRPkPQTTPHPEVPhtilVPATSLEPfiiteaPGTTLVP 728
Cdd:pfam03154  300 ----LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP-PREQPLPPAP----LSMPHIKP------PPTTPIP 364
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 568995596   729 KLPQQPDYPHPKPKTTRSPAASPTELVPTPVFEPVTPL 766
Cdd:pfam03154  365 QLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSL 402
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
469-600 3.69e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 3.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  469 PRATLAPIEALfESRNVEIFTSPEVRPT----TAAPQQTTSIPSTPKRQSTPKPPRVkPAPEPETRPSAQTTKAPRKTKK 544
Cdd:PRK07764  371 ERGLLARLERL-ERRLGVAGGAGAPAAAapsaAAAAPAAAPAPAAAAPAAAAAPAPA-AAPQPAPAPAPAPAPPSPAGNA 448
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995596  545 PGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPQ 600
Cdd:PRK07764  449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA 504
PHA03378 PHA03378
EBNA-3B; Provisional
467-967 4.24e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.98  E-value: 4.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  467 EQPRATLAPIEALFESRNVEIFTSPEVRPTTAAPQQTTSIPstpkrQSTPKPPRVKPAPEP-ETRPSAQTTKAPR----- 540
Cdd:PHA03378  345 EAVRLPDDPIIVEDDDESEEIESECDPDEDKSGAEALASIP-----QTLPDPPTVYGRPKVfARKADLKSTKKCRaivtd 419
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  541 ---------KTKKPGHHRLRRPKTTRSPEVPKSKPALEPATVTPEILVPKIVPKP--PQKPKATrrpevPQVKPA--HEP 607
Cdd:PHA03378  420 psvikaieeEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQPLEGPTGPLSVQAPlePWQPLPH-----PQVTPVilHQP 494
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  608 VTFGSEAP-ALAIVTTTDIEPVITRTKAsvTTLAPKPPRPRTHRQ-----------RTKYKTTQSPKIPHSKPAD-LGPI 674
Cdd:PHA03378  495 PAQGVQAHgSMLDLLEKDDEDMEQRVMA--TLLPPSPPQPRAGRRapcvytedldiESDEPASTEPVHDQLLPAPgLGPL 572
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  675 TSEPPLASTTKKVR--------RPRPKPQTTPHPEVPHT-ILVPATS-------------LEPFIITEAPGTTLVPKLPQ 732
Cdd:PHA03378  573 QIQPLTSPTTSQLAssapsyaqTPWPVPHPSQTPEPPTTqSHIPETSaprqwpmplrpipMRPLRMQPITFNVLVFPTPH 652
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  733 QPDYPHPK------------------------------PKTTRSPAASPTELVPTPVfePVTPLKEDPVTTIVPITDLER 782
Cdd:PHA03378  653 QPPQVEITpykptwtqighipyqpsptgantmlpiqwaPGTMQPPPRAPTPMRPPAA--PPGRAQRPAAATGRARPPAAA 730
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  783 VTDLETPVAFRTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKATLSPQAPET-----KTVPAVVLEPVTLRPEVQVt 857
Cdd:PHA03378  731 PGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPApqqrpRGAPTPQPPPQAGPTSMQL- 809
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  858 tLAPQKTQKKHRPSPKPKPVPSPEVTESKPAPKTPKRTRRPRPKPQTTPTPETPLTKPVAATDLEPSALS-TEVPATVVL 936
Cdd:PHA03378  810 -MPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPVFYPPVLQpIQVMRQLGS 888
                         570       580       590
                  ....*....|....*....|....*....|.
gi 568995596  937 ATALTPVTlrtkAPKTTTLAPNVQRTRRPHP 967
Cdd:PHA03378  889 VRAAAAST----VTQAPTEYTGERRGVGPMH 915
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
515-612 4.54e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.47  E-value: 4.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  515 TPKPPRVKPAPEPETrPSAQTTKAPRKTKKPGhhrlRRPKTTRSPEvpkSKPAlePATVTPeilVPKIVPKPPqKPKATR 594
Cdd:PRK14954  385 AGSPDVKKKAPEPDL-PQPDRHPGPAKPEAPG----ARPAELPSPA---SAPT--PEQQPP---VARSAPLPP-SPQASA 450
                          90
                  ....*....|....*...
gi 568995596  595 RPEVPQVKPAhepVTFGS 612
Cdd:PRK14954  451 PRNVASGKPG---VDLGS 465
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
428-770 4.76e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.57  E-value: 4.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  428 ATYDVISSSTTSDETEIEIHTATR-------DPILDSVPPKTSRTAEQPRATLAPIEALFESRNVEIFTSPEVR------ 494
Cdd:COG5665   208 STPQAFNASATSGRSQHIVQAAKRvgvewwgDPSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTsntpts 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  495 --------PTTAAPQQTTSIPSTPKRQSTPKPPRV--KPAPEPETRPSAQTTKAPRKTKKPGHhrlrRPKTTRSPEVPKS 564
Cdd:COG5665   288 takaqpqpPTKKQPAKEPPSDTASGNPSAPSVLINsdSPTSEDPATASVPTTEETTAFTTPSS----VPSTPAEKDTPAT 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  565 KPALEPATVTPEILV-PKIVPKPPQKPKATrrpevpqvkpAHEPVTFGSEAPalaivtttdiepvitrtkASVTTLAPKP 643
Cdd:COG5665   364 DLATPVSPTPPETSVdKKVSPDSATSSTKS----------EKEGGTASSPMP------------------PNIAIGAKDD 415
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  644 PRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLAS--------TTKKVRRPRPKPQTTPHPEVPHTILVPATSLEP 715
Cdd:COG5665   416 VDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAgsdlepenTTLRDPAPNAIPPPEDPSTIGRLSSGDKLANET 495
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  716 FIITEAPGTTLVPKLPQQ--PDYPHPKPKTT---RSPAASPTELVPTPVFEPVTPLKEDP 770
Cdd:COG5665   496 GPPVIRRDSTPSSTADQSivGVLAFGLDQRTqaeISVEAASRSNPLLNSQVKSFPLGKRS 555
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
491-673 5.66e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 5.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  491 PEVRPTTAAPQQTTSIPSTPKRQSTPKPPRVKPAPEPETRPSAQTTKAPRKTKKPGHHRLRRPKTTRSPEVPKSKPALEP 570
Cdd:PRK12323  383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  571 ATVTPEIL---VPKIVPKPPQKPKATRRPEVPQVKPAHE-PVTFGSEAPA------LAIVTTTDIEPVITRTKASVTTLA 640
Cdd:PRK12323  463 RPAAAGPRpvaAAAAAAPARAAPAAAPAPADDDPPPWEElPPEFASPAPAqpdaapAGWVAESIPDPATADPDDAFETLA 542
                         170       180       190
                  ....*....|....*....|....*....|...
gi 568995596  641 PKPPRPRTHRQRTKYKTTQSPKIPHSKPADLGP 673
Cdd:PRK12323  543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
COG3979 COG3979
Chitodextrinase [Carbohydrate transport and metabolism];
1379-1478 5.94e-03

Chitodextrinase [Carbohydrate transport and metabolism];


Pssm-ID: 443178 [Multi-domain]  Cd Length: 369  Bit Score: 40.91  E-value: 5.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1379 PPQNPpTNLTVVTVEgcPSFVILDWEK-PLNDTVTEYEVisrengsFSGKNKSIQITNQTFSTVENLKPDTSYEFQVKPK 1457
Cdd:COG3979     2 APTAP-TGLTASNVT--SSSVSLSWDAsTDNVGVTGYDV-------YRGGDQVATVTGLTAWTVTGLTPGTEYTFTVGAC 71
                          90       100
                  ....*....|....*....|.
gi 568995596 1458 nplgeGPASNTVAFSTESADP 1478
Cdd:COG3979    72 -----DAAGNVSAASGTSTAM 87
PRK10263 PRK10263
DNA translocase FtsK; Provisional
669-1186 8.29e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 8.29e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  669 ADLGPITSEPPLASTTKKVRRPRPKPQTTPHPEVPHTILVPATSlepfiiteapgttlvpKLPQQPDYPHPKPKtTRSPA 748
Cdd:PRK10263  335 APVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPE----------------GYPQQSQYAQPAVQ-YNEPL 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  749 ASPTELVPTPVFEPVTPLKEDPVTTIVPitdlervtdlETPVAFRTEAPGTTLASKISQRTHRPRPRPRPRPRPRPRPKA 828
Cdd:PRK10263  398 QQPVQPQQPYYAPAAEQPAQQPYYAPAP----------EQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQT 467
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  829 TLSPQAPETKTV-PAVVLEPVTLRPEVQVTTLAP--------QKTQKKHRPSPKPKPVPSPEVTESKPAPKTPKRTRRPR 899
Cdd:PRK10263  468 YQQPAAQEPLYQqPQPVEQQPVVEPEPVVEETKParpplyyfEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAP 547
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  900 PKPQTTPTPETPLTKPVAATdLEPSALSTEVPATVVlATALTPVTLRTKAPKTTT-LAPNVQRTRRPH-PRPKTTASTGV 977
Cdd:PRK10263  548 SVAAVPPVEAAAAVSPLASG-VKKATLATGAAATVA-APVFSLANSGGPRPQVKEgIGPQLPRPKRIRvPTRRELASYGI 625
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  978 S-ESKSAPTElqslvlKPVTSPSLEIIQSQSVSDDlELVAFSTESPQKTIAPRQTTSMPPKLKTPHSRMPAKEPVPKEP- 1055
Cdd:PRK10263  626 KlPSQRAAEE------KAREAQRNQYDSGDQYNDD-EIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAe 698
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596 1056 -----LHTTSKPKMPPSPEVADTTSVPKDERLSLKPDPEVTHSETVLPPVTFRVEPPKTTIAPLETRGIPLIPViSPRPS 1130
Cdd:PRK10263  699 larqfAQTQQQRYSGEQPAGANPFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPV-APQPQ 777
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568995596 1131 QEELQTAMEETDQSTQELFTTKIPRTTELAKTTQAPHRLHTAPVRPRIPGRPHGRP 1186
Cdd:PRK10263  778 YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 833
PHA03247 PHA03247
large tegument protein UL36; Provisional
531-756 9.35e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 9.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  531 PSAQTTKAPRKTKKPGHHRlrrpKTTRSPEVPKSKPALEPATVTPEILVPKIVPKPPQKPKATRRPEVPqvkPAHEPVTF 610
Cdd:PHA03247  255 PAPPPVVGEGADRAPETAR----GATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPA---PAGDAEEE 327
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568995596  611 GSEAPALAIVTttdiePVitrtkasvttlapkpPRPRTHRQRTKYKTTQSPKIPHSKPADLGPITSEPPLASTTKKVRRP 690
Cdd:PHA03247  328 DDEDGAMEVVS-----PL---------------PRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRS 387
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568995596  691 RPK------------PQTTPHPEVPHTILVPATSLEPFIITEAPGTTLVPKLPQQPDYPHPKP-KTTRSPAASPTELVP 756
Cdd:PHA03247  388 ARHaatpfargpggdDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPeRQPPAPATEPAPDDP 466
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH