NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|81875363|sp|Q8BV57|]
View 

RecName: Full=Soluble scavenger receptor cysteine-rich domain-containing protein SSC5D; AltName: Full=Scavenger receptor cysteine-rich domain-containing protein LOC284297 homolog; Flags: Precursor

Protein Classification

scavenger receptor cysteine-rich domain-containing protein( domain architecture ID 10640959)

scavenger receptor cysteine-rich (SRCR) domain-containing protein os a member of the group B scavenger receptor cysteine-rich family (SRCR-SF), composed of tandem-repeats of the SRCR domain

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
464-565 2.80e-46

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


:

Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 161.36  E-value: 2.80e-46
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     464 LRLVAGPSRCSGRLEVWHDGRWGTVCDDSWDMRDSAVVCRELGCGRPRQpDPAAGRFGWGAGPIWLDDVGCMGTEASLSE 543
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVS-ASGSAYFGPGSGPIWLDNVRCSGTEASLSD 79
                            90       100
                    ....*....|....*....|..
gi 81875363     544 CPAASWGKHNCAHNEDVGVTCT 565
Cdd:smart00202   80 CPHSGWGSHNCSHGEDAGVVCS 101
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
758-858 8.12e-45

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


:

Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 157.12  E-value: 8.12e-45
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     758 VRLADGPNRCAGRLEVWHAGLWGTVCDDSWDIRDATVACWELGCGKVRPRVGKTHYGPGTGPIWLDDMGCKGSEMSLSDC 837
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|.
gi 81875363     838 PSGAWGKHNCDHEEDVVLTCT 858
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVCS 101
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
199-299 3.65e-41

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


:

Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 146.72  E-value: 3.65e-41
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     199 LRLVSGPHGCAGRLEVWHGGRWGTVCDDGWDLRDAAVACRELGCGGALAAPGGARFGPGEGPVWMDDVGCGGGEEALRDC 278
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|.
gi 81875363     279 PRSPWGRSNCDHTEDAGLVCT 299
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVCS 101
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
305-404 5.46e-39

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


:

Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 140.55  E-value: 5.46e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     305 IRLADGPHGCAGRLEVWHGGRWGSVCDDAWDLRDAAVACKELGCGGALAAPGGAFFGEGTGPIILDDLRCRGNETALRFC 384
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|
gi 81875363     385 PARPWGQHDCHHREDAGAVC 404
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVC 100
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
20-119 5.90e-39

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


:

Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 140.17  E-value: 5.90e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363      20 LRLADGPHGCAGRLEVWHSGRWGTVCDDGWDLRDAEVACRVLGCGGALAAPGGAFFGEGTGPVWLSELNCRGNEGQLGIC 99
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|
gi 81875363     100 PHRGWKAHICSHEEDAGVVC 119
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVC 100
PHA03247 super family cl33720
large tegument protein UL36; Provisional
929-1304 1.40e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.50  E-value: 1.40e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   929 PSGRGLFKGTPTTTKPGSTVTTSTSKSPGHPFPAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWP 1008
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVA 2789
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1009 QLIPDSKQEGTSSSPKPSLLTPGLPSPATFALSTPNTSLLPTRSPELSGSPTPTSP----EGLTSASSMLSEVSRLSPTS 1084
Cdd:PHA03247 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPpppsLPLGGSVAPGGDVRRRPPSR 2869
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1085 ELTPGPDTTPAPEI-------IPESSDSSDLPMNT-RTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPAT 1156
Cdd:PHA03247 2870 SPAAKPAAPARPPVrrlarpaVSRSTESFALPPDQpERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1157 SPQP---------------------PTNTHPSSTPATPTESLPSSRKTELSSPTKPRLNSELTFEEA------------- 1202
Cdd:PHA03247 2950 AGAGepsgavpqpwlgalvpgrvavPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEEtdpppvslkqtlw 3029
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1203 PSTDASQTQNLELFLASESGPSSPSpasnLDPLPTDAFKPPRSQTLHSASDHLTQGPTPNHnpdpFGPcvsplPPVR--- 1279
Cdd:PHA03247 3030 PPDDTEDSDADSLFDSDSERSDLEA----LDPLPPEPHDPFAHEPDPATPEAGARESPSSQ----FGP-----PPLSana 3096
                         410       420       430
                  ....*....|....*....|....*....|.
gi 81875363  1280 ------VMACEPPALVELVGAVREVGDQLQR 1304
Cdd:PHA03247 3097 alsrryVRSTGRSALAVLIEACRRIRRQLRR 3127
Metaviral_G super family cl26626
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
598-723 7.02e-04

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


The actual alignment was detected with superfamily member pfam09595:

Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 42.25  E-value: 7.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    598 TTKPSASLTSSVPQkptkvPGKAPKSTKKWVTKNArrpTTQPPGMPTTKHSRAPGTPTSLhPTARTSELPKRLTTEAPHR 677
Cdd:pfam09595   69 PLNEAAKEAPSESE-----DAPDIDPNNQHPSQDR---SEAPPLEPAAKTKPSEHEPANP-PDASNRLSPPDASTAAIRE 139
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 81875363    678 QTSHTTVRLTPRvpwewtSEPVVSQSTQGPQEVTSEATTTENPQTS 723
Cdd:pfam09595  140 ARTFRKPSTGKR------NNPSSAQSDQSPPRANHEAIGRANPFAM 179
 
Name Accession Description Interval E-value
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
464-565 2.80e-46

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 161.36  E-value: 2.80e-46
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     464 LRLVAGPSRCSGRLEVWHDGRWGTVCDDSWDMRDSAVVCRELGCGRPRQpDPAAGRFGWGAGPIWLDDVGCMGTEASLSE 543
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVS-ASGSAYFGPGSGPIWLDNVRCSGTEASLSD 79
                            90       100
                    ....*....|....*....|..
gi 81875363     544 CPAASWGKHNCAHNEDVGVTCT 565
Cdd:smart00202   80 CPHSGWGSHNCSHGEDAGVVCS 101
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
758-858 8.12e-45

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 157.12  E-value: 8.12e-45
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     758 VRLADGPNRCAGRLEVWHAGLWGTVCDDSWDIRDATVACWELGCGKVRPRVGKTHYGPGTGPIWLDDMGCKGSEMSLSDC 837
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|.
gi 81875363     838 PSGAWGKHNCDHEEDVVLTCT 858
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVCS 101
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
469-565 1.06e-41

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 147.91  E-value: 1.06e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    469 GPSRCSGRLEVWHDGRWGTVCDDSWDMRDSAVVCRELGCGRPRQPDPAAGRFGWG-AGPIWLDDVGCMGTEASLSECPAA 547
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLGCGGAVSAPSGCSYFGPGsTGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*...
gi 81875363    548 SWGKHNCAHNEDVGVTCT 565
Cdd:pfam00530   81 PWGNHNCSHSEDAGVICS 98
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
199-299 3.65e-41

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 146.72  E-value: 3.65e-41
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     199 LRLVSGPHGCAGRLEVWHGGRWGTVCDDGWDLRDAAVACRELGCGGALAAPGGARFGPGEGPVWMDDVGCGGGEEALRDC 278
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|.
gi 81875363     279 PRSPWGRSNCDHTEDAGLVCT 299
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVCS 101
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
305-404 5.46e-39

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 140.55  E-value: 5.46e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     305 IRLADGPHGCAGRLEVWHGGRWGSVCDDAWDLRDAAVACKELGCGGALAAPGGAFFGEGTGPIILDDLRCRGNETALRFC 384
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|
gi 81875363     385 PARPWGQHDCHHREDAGAVC 404
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVC 100
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
20-119 5.90e-39

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 140.17  E-value: 5.90e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363      20 LRLADGPHGCAGRLEVWHSGRWGTVCDDGWDLRDAEVACRVLGCGGALAAPGGAFFGEGTGPVWLSELNCRGNEGQLGIC 99
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|
gi 81875363     100 PHRGWKAHICSHEEDAGVVC 119
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVC 100
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
763-858 2.22e-36

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 132.89  E-value: 2.22e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    763 GPNRCAGRLEVWHAGLWGTVCDDSWDIRDATVACWELGCGK-VRPRVGKTHYGPG-TGPIWLDDMGCKGSEMSLSDCPSG 840
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLGCGGaVSAPSGCSYFGPGsTGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*...
gi 81875363    841 AWGKHNCDHEEDVVLTCT 858
Cdd:pfam00530   81 PWGNHNCSHSEDAGVICS 98
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
204-299 1.19e-34

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 127.88  E-value: 1.19e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    204 GPHGCAGRLEVWHGGRWGTVCDDGWDLRDAAVACREL-GCGGALAAPGGARFGPGE-GPVWMDDVGCGGGEEALRDCPRS 281
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLgCGGAVSAPSGCSYFGPGStGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*...
gi 81875363    282 PWGRSNCDHTEDAGLVCT 299
Cdd:pfam00530   81 PWGNHNCSHSEDAGVICS 98
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
25-119 3.39e-33

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 123.64  E-value: 3.39e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     25 GPHGCAGRLEVWHSGRWGTVCDDGWDLRDAEVACRVL--GCGGALAAPGGAFFGEGTGPVWLSELNCRGNEGQLGICPHR 102
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLgcGGAVSAPSGCSYFGPGSTGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*..
gi 81875363    103 GWKAHICSHEEDAGVVC 119
Cdd:pfam00530   81 PWGNHNCSHSEDAGVIC 97
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
310-404 4.81e-33

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 123.26  E-value: 4.81e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    310 GPHGCAGRLEVWHGGRWGSVCDDAWDLRDAAVACKEL--GCGGALAAPGGAFFGEGTGPIILDDLRCRGNETALRFCPAR 387
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLgcGGAVSAPSGCSYFGPGSTGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*..
gi 81875363    388 PWGQHDCHHREDAGAVC 404
Cdd:pfam00530   81 PWGNHNCSHSEDAGVIC 97
PHA03247 PHA03247
large tegument protein UL36; Provisional
929-1304 1.40e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.50  E-value: 1.40e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   929 PSGRGLFKGTPTTTKPGSTVTTSTSKSPGHPFPAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWP 1008
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVA 2789
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1009 QLIPDSKQEGTSSSPKPSLLTPGLPSPATFALSTPNTSLLPTRSPELSGSPTPTSP----EGLTSASSMLSEVSRLSPTS 1084
Cdd:PHA03247 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPpppsLPLGGSVAPGGDVRRRPPSR 2869
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1085 ELTPGPDTTPAPEI-------IPESSDSSDLPMNT-RTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPAT 1156
Cdd:PHA03247 2870 SPAAKPAAPARPPVrrlarpaVSRSTESFALPPDQpERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1157 SPQP---------------------PTNTHPSSTPATPTESLPSSRKTELSSPTKPRLNSELTFEEA------------- 1202
Cdd:PHA03247 2950 AGAGepsgavpqpwlgalvpgrvavPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEEtdpppvslkqtlw 3029
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1203 PSTDASQTQNLELFLASESGPSSPSpasnLDPLPTDAFKPPRSQTLHSASDHLTQGPTPNHnpdpFGPcvsplPPVR--- 1279
Cdd:PHA03247 3030 PPDDTEDSDADSLFDSDSERSDLEA----LDPLPPEPHDPFAHEPDPATPEAGARESPSSQ----FGP-----PPLSana 3096
                         410       420       430
                  ....*....|....*....|....*....|.
gi 81875363  1280 ------VMACEPPALVELVGAVREVGDQLQR 1304
Cdd:PHA03247 3097 alsrryVRSTGRSALAVLIEACRRIRRQLRR 3127
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
873-1208 1.92e-08

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 58.82  E-value: 1.92e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    873 PTSGEDLTKGTtvaarpghtlsWATTTN-TEVPSPATQNLPDTDDQGGYESswTWDTPSGRGLfkGTPTTTKPGSTVTTS 951
Cdd:pfam17823   66 APAPVTLTKGT-----------SAAHLNsTEVTAEHTPHGTDLSEPATREG--AADGAASRAL--AAAASSSPSSAAQSL 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    952 TSKSPGHP---FPAPRARAgsPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWPQLIPDSKQEGTSSSPkpSLL 1028
Cdd:pfam17823  131 PAAIAALPseaFSAPRAAA--CRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAP--ATL 206
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1029 TP-----------GLPSPATFALSTPNTSLLPTRSPELSGSPTPTSPEGLTSASSMLSEVSRLSPTSE-----LTPG--- 1089
Cdd:pfam17823  207 TPargistaatatGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDpharrLSPAkhm 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1090 PDTTPAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQP---------TTNPQQPRSPHPATSPQP 1160
Cdd:pfam17823  287 PSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSvastnlavvTTTKAQAKEPSASPVPVL 366
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 81875363   1161 PTNTHP---SSTPATPTESLPSSRKTelSSPTKPRLNSELTFEEAPSTDAS 1208
Cdd:pfam17823  367 HTSMIPeveATSPTTQPSPLLPTQGA--AGPGILLAPEQVATEATAGTASA 415
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
956-1212 2.01e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.53  E-value: 2.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   956 PGHPFP-APRARAG-SPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWPQLIPDSKQEGTSSSPKPSLLTPGL- 1032
Cdd:NF033839  287 PGNKKPsAPKPGMQpSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVk 366
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1033 PSPatfalSTPNTSLLP---TRSPELSGSPTPTSPEgltsassmlSEVSRLSPTSELTPGPDtTPAPEIIPESsdssdlp 1109
Cdd:NF033839  367 PQP-----EKPKPEVKPqpeTPKPEVKPQPEKPKPE---------VKPQPEKPKPEVKPQPE-KPKPEVKPQP------- 424
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1110 mntRTPTqpftashPTSIPQLNTTSyPTIAPQPTTN----PQQPRSPHPATSPQPptnthpsSTPATPTESLPSSRKTEL 1185
Cdd:NF033839  425 ---EKPK-------PEVKPQPEKPK-PEVKPQPEKPkpevKPQPETPKPEVKPQP-------EKPKPEVKPQPEKPKPDN 486
                         250       260       270
                  ....*....|....*....|....*....|.
gi 81875363  1186 SSP----TKPRLNSELTFEEAPSTDASQTQN 1212
Cdd:NF033839  487 SKPqaddKKPSTPNNLSKDKQPSNQASTNEK 517
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
598-723 7.02e-04

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 42.25  E-value: 7.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    598 TTKPSASLTSSVPQkptkvPGKAPKSTKKWVTKNArrpTTQPPGMPTTKHSRAPGTPTSLhPTARTSELPKRLTTEAPHR 677
Cdd:pfam09595   69 PLNEAAKEAPSESE-----DAPDIDPNNQHPSQDR---SEAPPLEPAAKTKPSEHEPANP-PDASNRLSPPDASTAAIRE 139
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 81875363    678 QTSHTTVRLTPRvpwewtSEPVVSQSTQGPQEVTSEATTTENPQTS 723
Cdd:pfam09595  140 ARTFRKPSTGKR------NNPSSAQSDQSPPRANHEAIGRANPFAM 179
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1094-1183 7.15e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 39.00  E-value: 7.15e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    1094 PAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQL-NTTSYPTIAPQPTTNPQQPRSPHPATSPQPPTNTHPSSTPAT 1172
Cdd:smart00818   69 PQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLqPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLPDL 148
                            90
                    ....*....|.
gi 81875363    1173 PTESLPSSRKT 1183
Cdd:smart00818  149 PLEAWPATDKT 159
 
Name Accession Description Interval E-value
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
464-565 2.80e-46

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 161.36  E-value: 2.80e-46
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     464 LRLVAGPSRCSGRLEVWHDGRWGTVCDDSWDMRDSAVVCRELGCGRPRQpDPAAGRFGWGAGPIWLDDVGCMGTEASLSE 543
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVS-ASGSAYFGPGSGPIWLDNVRCSGTEASLSD 79
                            90       100
                    ....*....|....*....|..
gi 81875363     544 CPAASWGKHNCAHNEDVGVTCT 565
Cdd:smart00202   80 CPHSGWGSHNCSHGEDAGVVCS 101
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
758-858 8.12e-45

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 157.12  E-value: 8.12e-45
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     758 VRLADGPNRCAGRLEVWHAGLWGTVCDDSWDIRDATVACWELGCGKVRPRVGKTHYGPGTGPIWLDDMGCKGSEMSLSDC 837
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|.
gi 81875363     838 PSGAWGKHNCDHEEDVVLTCT 858
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVCS 101
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
469-565 1.06e-41

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 147.91  E-value: 1.06e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    469 GPSRCSGRLEVWHDGRWGTVCDDSWDMRDSAVVCRELGCGRPRQPDPAAGRFGWG-AGPIWLDDVGCMGTEASLSECPAA 547
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLGCGGAVSAPSGCSYFGPGsTGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*...
gi 81875363    548 SWGKHNCAHNEDVGVTCT 565
Cdd:pfam00530   81 PWGNHNCSHSEDAGVICS 98
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
199-299 3.65e-41

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 146.72  E-value: 3.65e-41
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     199 LRLVSGPHGCAGRLEVWHGGRWGTVCDDGWDLRDAAVACRELGCGGALAAPGGARFGPGEGPVWMDDVGCGGGEEALRDC 278
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|.
gi 81875363     279 PRSPWGRSNCDHTEDAGLVCT 299
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVCS 101
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
305-404 5.46e-39

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 140.55  E-value: 5.46e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     305 IRLADGPHGCAGRLEVWHGGRWGSVCDDAWDLRDAAVACKELGCGGALAAPGGAFFGEGTGPIILDDLRCRGNETALRFC 384
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|
gi 81875363     385 PARPWGQHDCHHREDAGAVC 404
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVC 100
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
20-119 5.90e-39

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 140.17  E-value: 5.90e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363      20 LRLADGPHGCAGRLEVWHSGRWGTVCDDGWDLRDAEVACRVLGCGGALAAPGGAFFGEGTGPVWLSELNCRGNEGQLGIC 99
Cdd:smart00202    1 VRLVGGGSPCEGRVEVYHNGQWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAYFGPGSGPIWLDNVRCSGTEASLSDC 80
                            90       100
                    ....*....|....*....|
gi 81875363     100 PHRGWKAHICSHEEDAGVVC 119
Cdd:smart00202   81 PHSGWGSHNCSHGEDAGVVC 100
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
763-858 2.22e-36

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 132.89  E-value: 2.22e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    763 GPNRCAGRLEVWHAGLWGTVCDDSWDIRDATVACWELGCGK-VRPRVGKTHYGPG-TGPIWLDDMGCKGSEMSLSDCPSG 840
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLGCGGaVSAPSGCSYFGPGsTGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*...
gi 81875363    841 AWGKHNCDHEEDVVLTCT 858
Cdd:pfam00530   81 PWGNHNCSHSEDAGVICS 98
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
204-299 1.19e-34

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 127.88  E-value: 1.19e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    204 GPHGCAGRLEVWHGGRWGTVCDDGWDLRDAAVACREL-GCGGALAAPGGARFGPGE-GPVWMDDVGCGGGEEALRDCPRS 281
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLgCGGAVSAPSGCSYFGPGStGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*...
gi 81875363    282 PWGRSNCDHTEDAGLVCT 299
Cdd:pfam00530   81 PWGNHNCSHSEDAGVICS 98
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
25-119 3.39e-33

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 123.64  E-value: 3.39e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363     25 GPHGCAGRLEVWHSGRWGTVCDDGWDLRDAEVACRVL--GCGGALAAPGGAFFGEGTGPVWLSELNCRGNEGQLGICPHR 102
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLgcGGAVSAPSGCSYFGPGSTGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*..
gi 81875363    103 GWKAHICSHEEDAGVVC 119
Cdd:pfam00530   81 PWGNHNCSHSEDAGVIC 97
SRCR pfam00530
Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular ...
310-404 4.81e-33

Scavenger receptor cysteine-rich domain; These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions.


Pssm-ID: 459844  Cd Length: 98  Bit Score: 123.26  E-value: 4.81e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    310 GPHGCAGRLEVWHGGRWGSVCDDAWDLRDAAVACKEL--GCGGALAAPGGAFFGEGTGPIILDDLRCRGNETALRFCPAR 387
Cdd:pfam00530    1 GSSPCEGRVEVYHNGSWGTVCDDGWDLRDAHVVCRQLgcGGAVSAPSGCSYFGPGSTGPIWLDDVRCSGNETSLWQCPHR 80
                           90
                   ....*....|....*..
gi 81875363    388 PWGQHDCHHREDAGAVC 404
Cdd:pfam00530   81 PWGNHNCSHSEDAGVIC 97
PHA03247 PHA03247
large tegument protein UL36; Provisional
929-1304 1.40e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.50  E-value: 1.40e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   929 PSGRGLFKGTPTTTKPGSTVTTSTSKSPGHPFPAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWP 1008
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVA 2789
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1009 QLIPDSKQEGTSSSPKPSLLTPGLPSPATFALSTPNTSLLPTRSPELSGSPTPTSP----EGLTSASSMLSEVSRLSPTS 1084
Cdd:PHA03247 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPpppsLPLGGSVAPGGDVRRRPPSR 2869
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1085 ELTPGPDTTPAPEI-------IPESSDSSDLPMNT-RTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPAT 1156
Cdd:PHA03247 2870 SPAAKPAAPARPPVrrlarpaVSRSTESFALPPDQpERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1157 SPQP---------------------PTNTHPSSTPATPTESLPSSRKTELSSPTKPRLNSELTFEEA------------- 1202
Cdd:PHA03247 2950 AGAGepsgavpqpwlgalvpgrvavPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEEtdpppvslkqtlw 3029
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1203 PSTDASQTQNLELFLASESGPSSPSpasnLDPLPTDAFKPPRSQTLHSASDHLTQGPTPNHnpdpFGPcvsplPPVR--- 1279
Cdd:PHA03247 3030 PPDDTEDSDADSLFDSDSERSDLEA----LDPLPPEPHDPFAHEPDPATPEAGARESPSSQ----FGP-----PPLSana 3096
                         410       420       430
                  ....*....|....*....|....*....|.
gi 81875363  1280 ------VMACEPPALVELVGAVREVGDQLQR 1304
Cdd:PHA03247 3097 alsrryVRSTGRSALAVLIEACRRIRRQLRR 3127
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1049-1297 5.11e-09

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 60.86  E-value: 5.11e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1049 PTRSPELSGSPtPTSPeGLTSASSMLSEVSRLSPTS------------ELT--PGPDTTPAPEIIP---------ESSDS 1105
Cdd:PTZ00449  510 PPEGPEASGLP-PKAP-GDKEGEEGEHEDSKESDEPkeggkpgetkegEVGkkPGPAKEHKPSKIPtlskkpefpKDPKH 587
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1106 SDLPMNTRTPTQPFTASHPTS-----IPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQPPTNTHPSSTPATPTESLPss 1180
Cdd:PTZ00449  588 PKDPEEPKKPKRPRSAQRPTRpkspkLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKP-- 665
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1181 rktelssPTKPRLNSEL--TFEEAPSTDASQTQNLELFLASESGPSSPSPASNLDPLPTDAFKPPRsqtlhsasdhLTQG 1258
Cdd:PTZ00449  666 -------PFDPKFKEKFydDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPK----------LPRD 728
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 81875363  1259 PTPNHNP--DPFGPCVSPL----PPV--RVMACEPPALVELVGAVRE 1297
Cdd:PTZ00449  729 EEFPFEPigDPDAEQPDDIefftPPEeeRTFFHETPADTPLPDILAE 775
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
873-1208 1.92e-08

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 58.82  E-value: 1.92e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    873 PTSGEDLTKGTtvaarpghtlsWATTTN-TEVPSPATQNLPDTDDQGGYESswTWDTPSGRGLfkGTPTTTKPGSTVTTS 951
Cdd:pfam17823   66 APAPVTLTKGT-----------SAAHLNsTEVTAEHTPHGTDLSEPATREG--AADGAASRAL--AAAASSSPSSAAQSL 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    952 TSKSPGHP---FPAPRARAgsPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWPQLIPDSKQEGTSSSPkpSLL 1028
Cdd:pfam17823  131 PAAIAALPseaFSAPRAAA--CRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAP--ATL 206
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1029 TP-----------GLPSPATFALSTPNTSLLPTRSPELSGSPTPTSPEGLTSASSMLSEVSRLSPTSE-----LTPG--- 1089
Cdd:pfam17823  207 TPargistaatatGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDpharrLSPAkhm 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1090 PDTTPAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQP---------TTNPQQPRSPHPATSPQP 1160
Cdd:pfam17823  287 PSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSvastnlavvTTTKAQAKEPSASPVPVL 366
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 81875363   1161 PTNTHP---SSTPATPTESLPSSRKTelSSPTKPRLNSELTFEEAPSTDAS 1208
Cdd:pfam17823  367 HTSMIPeveATSPTTQPSPLLPTQGA--AGPGILLAPEQVATEATAGTASA 415
PHA03247 PHA03247
large tegument protein UL36; Provisional
942-1288 3.01e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 3.01e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   942 TKPGSTVTTSTSKSPGHP---FPAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWP---QLIPDSK 1015
Cdd:PHA03247 2587 RRPDAPPQSARPRAPVDDrgdPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPapgRVSRPRR 2666
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1016 QEGTSSSPKPSLLTPGLPSPATFALSTPNTSLLPTRSPELSGSPTPTSPEGLTSASSMLSEVSRLSPTSELTPGPDTTPA 1095
Cdd:PHA03247 2667 ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1096 PEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSyPTIAPQPTTNPQQPRSPHPATSPQP--------PTNTHPS 1167
Cdd:PHA03247 2747 GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR-PAVASLSESRESLPSPWDPADPPAAvlapaaalPPAASPA 2825
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1168 STPATPTESLPSSrktelSSPTKPRLNSELTFEE--APSTDASQTQNLELFLASESGPSSPSPASNLDPLPtdafkPPRS 1245
Cdd:PHA03247 2826 GPLPPPTSAQPTA-----PPPPPGPPPPSLPLGGsvAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAV-----SRST 2895
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 81875363  1246 QTLHSASDHLTQGPTPNHNPDPFGPCVSPLPPVRVMACEPPAL 1288
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR 2938
PHA03247 PHA03247
large tegument protein UL36; Provisional
929-1270 6.87e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 6.87e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   929 PSGRGLFKGTPTTTKPGSTVTTSTSKSPGHPFPAPRARAGS--------PRKPTPERRPLPTSATTSSPASSSSPEPSGS 1000
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1001 RQTSGSWPQLIPDSKQEGTSSSPKPSLlTPGLPSPATfALSTPNTSLLPTRSPELSGSPTPTSPEGltsassmlsevsrl 1080
Cdd:PHA03247 2711 APHALVSATPLPPGPAAARQASPALPA-APAPPAVPA-GPATPGGPARPARPPTTAGPPAPAPPAA-------------- 2774
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1081 sptsELTPGPDTTPAPEIIPESSDSSDLPMNTRTPTQPFTASHPTsiPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQP 1160
Cdd:PHA03247 2775 ----PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA--AALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1161 PTNTHPSSTPATPTESLPSSRK--TELSSPTKPRLNSELTFEEAPSTDASQTQNLELFLASESGPSSPSPASNLDPLPTD 1238
Cdd:PHA03247 2849 SLPLGGSVAPGGDVRRRPPSRSpaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
                         330       340       350
                  ....*....|....*....|....*....|..
gi 81875363  1239 AFKPPRSQTLHSASDHLTQGPTPNHNPDPFGP 1270
Cdd:PHA03247 2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1014-1250 1.32e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 56.24  E-value: 1.32e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1014 SKQEGTSSSPKPSLLtPGLPSPATFalstPNTSLLPTRSPELSGSPTPTSPEGLTSASS-MLSEVSRL--------SPTS 1084
Cdd:PTZ00449  558 GKKPGPAKEHKPSKI-PTLSKKPEF----PKDPKHPKDPEEPKKPKRPRSAQRPTRPKSpKLPELLDIpkspkrpeSPKS 632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1085 ELTPGPDTTPAPEIIPESSDSSDLPMNTRTPTQPF----------------TASHPTSIPQLNTTSYPTIAPQptTNPQQ 1148
Cdd:PTZ00449  633 PKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFdpkfkekfyddyldaaAKSKETKTTVVLDESFESILKE--TLPET 710
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1149 PRSPHPATSPQPPTNTHPSSTPATPTESlPSSRKTELSSPTKPRLNSELTFEEAPSTDASQTQNLELFLASESGPSSPSp 1228
Cdd:PTZ00449  711 PGTPFTTPRPLPPKLPRDEEFPFEPIGD-PDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGE- 788
                         250       260
                  ....*....|....*....|..
gi 81875363  1229 asnldplPTDAFKPPRSQTLHS 1250
Cdd:PTZ00449  789 -------PDEAMKRPDSPSEHE 803
PHA03247 PHA03247
large tegument protein UL36; Provisional
961-1304 2.93e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 2.93e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   961 PAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWPQLIPDSKQEGTSSSPKPSLLTPGLPSPATFAL 1040
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1041 STPNTSLLPTRSPElsgSPTPTSPEGLTSASSMLSEVSRLSPTSELTPGPDTTPAPEIIPESSDSSDLPMNTRTPTQPFT 1120
Cdd:PHA03247 2637 EPDPHPPPTVPPPE---RPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPH 2713
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1121 ASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQPPTNTHPSSTPATPTESLPSSRKtelSSPTKPRLNSELTFE 1200
Cdd:PHA03247 2714 ALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP---AAGPPRRLTRPAVAS 2790
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1201 EAPSTDASqtqnlelflasesgpsspspasnldPLPTDAFKPPRSQTLHSASDHLTQGPTPnhnPDPFGPCVSPLPPVRV 1280
Cdd:PHA03247 2791 LSESRESL-------------------------PSPWDPADPPAAVLAPAAALPPAASPAG---PLPPPTSAQPTAPPPP 2842
                         330       340
                  ....*....|....*....|....
gi 81875363  1281 MACEPPALvELVGAVREVGDQLQR 1304
Cdd:PHA03247 2843 PGPPPPSL-PLGGSVAPGGDVRRR 2865
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
1014-1195 3.16e-06

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 49.18  E-value: 3.16e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1014 SKQEGTSSSPKPSLLTPGLpspatfalSTPNTSLLPTRSPELSgsPTPTSPEGLTSASSMLSEVSRLSPTsELTPGPDTT 1093
Cdd:pfam09595   20 NIQARSKCFEHASLILIGE--------SNKEAALIITDIIDIN--INKQHPEQEHHENPPLNEAAKEAPS-ESEDAPDID 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1094 PAPEIiPESSDSSDLPMNTRTPTQPfTASHPTSIPQLNTTSYPtiaPQPTTnpQQPRspHPATSPQPPTNTHPSSTPATP 1173
Cdd:pfam09595   89 PNNQH-PSQDRSEAPPLEPAAKTKP-SEHEPANPPDASNRLSP---PDAST--AAIR--EARTFRKPSTGKRNNPSSAQS 159
                          170       180
                   ....*....|....*....|..
gi 81875363   1174 TESLPSSRKTELSSPTKPRLNS 1195
Cdd:pfam09595  160 DQSPPRANHEAIGRANPFAMSS 181
SOG2 pfam10428
RAM signalling pathway protein; SOG2 proteins in Saccharomyces cerevisiae are involved in cell ...
1004-1201 1.41e-05

RAM signalling pathway protein; SOG2 proteins in Saccharomyces cerevisiae are involved in cell separation and cytokinesis.


Pssm-ID: 431280  Cd Length: 476  Bit Score: 49.33  E-value: 1.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1004 SGSWPQLIPDsKQEGTSSSPKPSLLTPGLPSPATFALSTP-NTSLLPTRSPELSGSPTP-TSPEGLTSAssmLSEVSRLS 1081
Cdd:pfam10428  146 RNAWASLGPL-LEAVRPPSPKKRAGRTKQPSPSITSGGSPsSPAESSTRPSSSSVTPTRrRRHAGSFSS---KLPPLRSD 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1082 PTSELTPGPDTTPAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSipQLNTTSYPTIAPQPTTNpQQPRSPHPATSPQPP 1161
Cdd:pfam10428  222 TTIPHPGGNLSSPAPNGAQTPTPPRSATSPGVPSSAPTLGTGSTG--AISRSNHSTSGSQSSLT-SSSRSRSSSRSNTLL 298
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 81875363   1162 TNTHPSSTPATPtesLPSSRKTELSSPTKPRLNSELTFEE 1201
Cdd:pfam10428  299 STSGPSSLATTP---RPSSGESFAPTSTGSRINPLTGLDE 335
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1020-1208 1.61e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 49.53  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1020 SSSPKPSLLTPGLPSPATfALSTPNTSLLPTRSPelsgSPTPTSPEGLTSASSMLSEVSRLSPTSELTPGPD------TT 1093
Cdd:pfam05109  427 STTTSPTLNTTGFAAPNT-TTGLPSSTHVPTNLT----APASTGPTVSTADVTSPTPAGTTSGASPVTPSPSprdngtES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1094 PAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSI-PQLNTTSYPTIAPQPTTNPQqprSPHPATSPQPPTNTHPSSTPAT 1172
Cdd:pfam05109  502 KAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATsPTLGKTSPTSAVTTPTPNAT---SPTPAVTTPTPNATIPTLGKTS 578
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 81875363   1173 PTESlpssrkteLSSPTkPRLNSELTFEEAPSTDAS 1208
Cdd:pfam05109  579 PTSA--------VTTPT-PNATSPTVGETSPQANTT 605
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
927-1191 5.72e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.86  E-value: 5.72e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   927 DTPSGRGLFKGTPTTTKPGSTVTTSTSKSPGHPFPAPRARAGSPRKPTPerrplpTSATTSSPASSSSPEPSGSRQTSGS 1006
Cdd:PHA03307   80 PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPP------SPAPDLSEMLRPVGSPGPPPAASPP 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1007 WPQLIPDSKQEGTSSSPKPSLLTPGLPSPATfALSTPNTSLlPTRSPELSGSPTPtSPEGLTSASSMLSEVSRLSPTSEL 1086
Cdd:PHA03307  154 AAGASPAAVASDAASSRQAALPLSSPEETAR-APSSPPAEP-PPSTPPAAASPRP-PRRSSPISASASSPAPAPGRSAAD 230
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1087 TPGPDTTPAPEiiPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSP-QPPTNTH 1165
Cdd:PHA03307  231 DAGASSSDSSS--SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPsSPGSGPA 308
                         250       260
                  ....*....|....*....|....*.
gi 81875363  1166 PSSTPATPTESLPSSRKTELSSPTKP 1191
Cdd:PHA03307  309 PSSPRASSSSSSSRESSSSSTSSSSE 334
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
882-1188 1.16e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.68  E-value: 1.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    882 GTTVAARPGHTLSwaTTTNTEVPSPATQNLPDTDDQGGYESSWTWDTPSG------------RGLFKGTPTTTKPGSTVT 949
Cdd:pfam03154  190 GTTQAATAGPTPS--APSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLhpqrlpsphpplQPMTQPPPPSQVSPQPLP 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    950 TSTSKSPGHPFPAPrARAGSPRKPTP-ERRPLPTSATTSSPASSSSPEPSGSRQTSgSWPQLIPDSKQEGTSSSPKPSLL 1028
Cdd:pfam03154  268 QPSLHGQMPPMPHS-LQTGPSHMQHPvPPQPFPLTPQSSQSQVPPGPSPAAPGQSQ-QRIHTPPSQSQLQSQQPPREQPL 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1029 TPGlPSPATFALSTPNTSLLPTRSPELSGSPTPTSPEGLTSASSMLSEVSRLSPTSELTPG--PDTTPAP-EIIPESSDS 1105
Cdd:pfam03154  346 PPA-PLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHhpPSAHPPPlQLMPQSQQL 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1106 SDLPMNTRTPTQpfTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQPPTNTHPSSTPATPTESLPSSRKTEL 1185
Cdd:pfam03154  425 PPPPAQPPVLTQ--SQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSS 502

                   ...
gi 81875363   1186 SSP 1188
Cdd:pfam03154  503 SGP 505
Neisseria_TspB pfam05616
Neisseria meningitidis TspB protein; This family consists of several Neisseria meningitidis ...
1082-1152 1.55e-04

Neisseria meningitidis TspB protein; This family consists of several Neisseria meningitidis TspB virulence factor proteins.


Pssm-ID: 283306 [Multi-domain]  Cd Length: 517  Bit Score: 45.86  E-value: 1.55e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 81875363   1082 PTSELTPGPDTTPAPEIIPESSDSSDlPMNTRTPTQ-PFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSP 1152
Cdd:pfam05616  326 PRPDLTPASAEAPHAQPLPEVSPAEN-PANNPDPDEnPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVP 396
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
956-1212 2.01e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.53  E-value: 2.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   956 PGHPFP-APRARAG-SPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWPQLIPDSKQEGTSSSPKPSLLTPGL- 1032
Cdd:NF033839  287 PGNKKPsAPKPGMQpSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVk 366
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1033 PSPatfalSTPNTSLLP---TRSPELSGSPTPTSPEgltsassmlSEVSRLSPTSELTPGPDtTPAPEIIPESsdssdlp 1109
Cdd:NF033839  367 PQP-----EKPKPEVKPqpeTPKPEVKPQPEKPKPE---------VKPQPEKPKPEVKPQPE-KPKPEVKPQP------- 424
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1110 mntRTPTqpftashPTSIPQLNTTSyPTIAPQPTTN----PQQPRSPHPATSPQPptnthpsSTPATPTESLPSSRKTEL 1185
Cdd:NF033839  425 ---EKPK-------PEVKPQPEKPK-PEVKPQPEKPkpevKPQPETPKPEVKPQP-------EKPKPEVKPQPEKPKPDN 486
                         250       260       270
                  ....*....|....*....|....*....|.
gi 81875363  1186 SSP----TKPRLNSELTFEEAPSTDASQTQN 1212
Cdd:NF033839  487 SKPqaddKKPSTPNNLSKDKQPSNQASTNEK 517
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
960-1189 2.47e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 45.68  E-value: 2.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    960 FPAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWPQLIPDSKQEGTSSSPKPSLLTPglpspaTFA 1039
Cdd:pfam05109  439 FAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSP------TSA 512
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1040 LSTPNTSllptrspelSGSPTPTSPEGLTSASSmlSEVSRLSPTSEL-TPGPD-TTPAPEIIPESSDSSDLPMNTRTPTQ 1117
Cdd:pfam05109  513 VTTPTPN---------ATSPTPAVTTPTPNATS--PTLGKTSPTSAVtTPTPNaTSPTPAVTTPTPNATIPTLGKTSPTS 581
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1118 PFTASHPTSI--------PQLNTTSYpTIAPQPTTnPQQPRSPHPATSPQpPTNTHPSSTPATPTESLPSSRKTELSSPT 1189
Cdd:pfam05109  582 AVTTPTPNATsptvgetsPQANTTNH-TLGGTSST-PVVTSPPKNATSAV-TTGQHNITSSSTSSMSLRPSSISETLSPS 658
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1003-1288 2.67e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.53  E-value: 2.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1003 TSGSWPQLIPDSKQEGTSSSPKPSLLTPGLPsPATFALSTP--NTSLLPTRSPELSGSPTPTSPEGLTSASsmlsevsrL 1080
Cdd:pfam03154  196 TAGPTPSAPSVPPQGSPATSQPPNQTQSTAA-PHTLIQQTPtlHPQRLPSPHPPLQPMTQPPPPSQVSPQP--------L 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1081 SPTSELTPGPDttpapeiIPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQP 1160
Cdd:pfam03154  267 PQPSLHGQMPP-------MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1161 PTNT----------HPSSTPATPTESLPSSRK----TELSSPTKPRLN------------SELTFEEAPSTDASQTQNLE 1214
Cdd:pfam03154  340 PREQplppaplsmpHIKPPPTTPIPQLPNPQShkhpPHLSGPSPFQMNsnlppppalkplSSLSTHHPPSAHPPPLQLMP 419
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 81875363   1215 LFLASESGPSSPSPASNLDPLPTDAFKPPRSQTLHSASdhlTQGPTPNHnpdPFGPCVSP--LPPVRVMACEPPAL 1288
Cdd:pfam03154  420 QSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVP---SQSPFPQH---PFVPGGPPpiTPPSGPPTSTSSAM 489
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1115-1212 2.96e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.15  E-value: 2.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1115 PTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQ--QPRSPHPATSPQP--PTNTHPSSTPATPTESLPSSRKTELSSPTK 1190
Cdd:PRK14971  389 APQPSAAAAASPSPSQSSAAAQPSAPQSATQPAgtPPTVSVDPPAAVPvnPPSTAPQAVRPAQFKEEKKIPVSKVSSLGP 468
                          90       100
                  ....*....|....*....|..
gi 81875363  1191 PRLNSELTFEEAPSTDASQTQN 1212
Cdd:PRK14971  469 STLRPIQEKAEQATGNIKEAPT 490
PHA03247 PHA03247
large tegument protein UL36; Provisional
576-1160 3.07e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 3.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   576 PFSWSWLPGLGR--DQDAWLPGEL--TTKPSASLTSSVPQkptkvPGKAPKSTKKWVTKNARRPTTQP-PGMPTTK---- 646
Cdd:PHA03247 2531 PRMLTWIRGLEElaSDDAGDPPPPlpPAAPPAAPDRSVPP-----PRPAPRPSEPAVTSRARRPDAPPqSARPRAPvddr 2605
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   647 -HSRAPGTPTSLHPTARTSELPKRLTTEAPHRQTSHTTVRLTPRVPWEWTSEPVVSQSTQGPQEVTSEATTTENPQTSLE 725
Cdd:PHA03247 2606 gDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR 2685
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   726 PSGENTEGSLESSQDPATTP----------TAGVPVPSGPFRVRLADGPNRCAgrlEVWHAGLWGTVCDDSwdirDATVA 795
Cdd:PHA03247 2686 RAARPTVGSLTSLADPPPPPptpepaphalVSATPLPPGPAAARQASPALPAA---PAPPAVPAGPATPGG----PARPA 2758
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   796 CWELGCGKVRPRVGKthyGPGTGPIWLDDMGCKGSEMSLSDCPSGAWGKhncdheedvvltctgytgdDDYPSWTWDPTS 875
Cdd:PHA03247 2759 RPPTTAGPPAPAPPA---APAAGPPRRLTRPAVASLSESRESLPSPWDP-------------------ADPPAAVLAPAA 2816
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   876 GEdltkgtTVAARPGHTLSWATTTNTEVPSPATQNLPDTDDQGGyesswtWDTPSG----RGLFKGTPTTTK-----PGS 946
Cdd:PHA03247 2817 AL------PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG------SVAPGGdvrrRPPSRSPAAKPAaparpPVR 2884
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   947 TVTTSTSKSPGHPFPAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQtsgswPQLIPDSKQEGtSSSPKPS 1026
Cdd:PHA03247 2885 RLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ-----PPLAPTTDPAG-AGEPSGA 2958
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1027 LLTPGLPSPATFALSTPNTsLLPTRSPELSGSPTPTSPEGLTSASSMLSEVSRLSPTSELTPGPDTTPAPEIIPESSDSS 1106
Cdd:PHA03247 2959 VPQPWLGALVPGRVAVPRF-RVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDS 3037
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 81875363  1107 DLPMNTRTPTQPFTASHPTSIPQlNTTSYPTIAPQPTTNPQQPR-SPHPATSPQP 1160
Cdd:PHA03247 3038 DADSLFDSDSERSDLEALDPLPP-EPHDPFAHEPDPATPEAGAReSPSSQFGPPP 3091
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1086-1176 3.48e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 3.48e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1086 LTPGPDTTPAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNP--QQPRSPHPATSPQPPTN 1163
Cdd:PRK14950  360 LVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPvpHTPESAPKLTRAAIPVD 439
                          90
                  ....*....|...
gi 81875363  1164 THPSSTPATPTES 1176
Cdd:PRK14950  440 EKPKYTPPAPPKE 452
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
1086-1198 3.86e-04

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 44.35  E-value: 3.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1086 LTPGPDTTPAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYP--TIAPQPTTNPQQPRSPHPATSPQPPTN 1163
Cdd:PRK13335   54 ITAGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKslNISATPAPKQEQSQTTTESTTPKTKVT 133
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 81875363  1164 THPSSTPATPTESLPSSRKTelsSPTKPRLNSELT 1198
Cdd:PRK13335  134 TPPSTNTPQPMQSTKSDTPQ---SPTIKQAQTDMT 165
motB PRK12799
flagellar motor protein MotB; Reviewed
1060-1189 6.26e-04

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 43.94  E-value: 6.26e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1060 TPTSPEGLTSASSMLSEVSRLSPTSELTPGPDTTPAPEIIPESsdssdlpmntrTPTQPFTASHPTSIPQLNTTSYPTIA 1139
Cdd:PRK12799  296 HGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSA-----------TTTQASAVALSSAGVLPSDVTLPGTV 364
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 81875363  1140 PQPTTNPQQPRSPhPATSPQPPTNTHPSSTPAT--PTESLPSSRKTELS-SPT 1189
Cdd:PRK12799  365 ALPAAEPVNMQPQ-PMSTTETQQSSTGNITSTAngPTTSLPAAPASNIPvSPT 416
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
943-1192 6.39e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 44.30  E-value: 6.39e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   943 KPGSTVTTSTSKSPG-------HPFPAPRARAGSPRKPTPERRPlptsaTTSSPASSSSPEPSGSRQTSGSWPQLIPDSK 1015
Cdd:PTZ00449  548 KPGETKEGEVGKKPGpakehkpSKIPTLSKKPEFPKDPKHPKDP-----EEPKKPKRPRSAQRPTRPKSPKLPELLDIPK 622
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1016 QEGTSSSPKpsllTPGLPSPATFALST--PNTSLLPtRSPELSGSPTP-------------------TSPEGLTSASSML 1074
Cdd:PTZ00449  623 SPKRPESPK----SPKRPPPPQRPSSPerPEGPKII-KSPKPPKSPKPpfdpkfkekfyddyldaaaKSKETKTTVVLDE 697
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1075 SEVSRLSPTSELTPG-PDTTPA--PEIIPESSDSSDLPMNTRTPTQPfTASHPTSIPQLNTTSY---PTIAPQPTTNPQQ 1148
Cdd:PTZ00449  698 SFESILKETLPETPGtPFTTPRplPPKLPRDEEFPFEPIGDPDAEQP-DDIEFFTPPEEERTFFhetPADTPLPDILAEE 776
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....
gi 81875363  1149 PRSPHPATSPQPPTntHPSSTPATPTESLPSSRKTELSSPTKPR 1192
Cdd:PTZ00449  777 FKEEDIHAETGEPD--EAMKRPDSPSEHEDKPPGDHPSLPKKRH 818
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
598-723 7.02e-04

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 42.25  E-value: 7.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    598 TTKPSASLTSSVPQkptkvPGKAPKSTKKWVTKNArrpTTQPPGMPTTKHSRAPGTPTSLhPTARTSELPKRLTTEAPHR 677
Cdd:pfam09595   69 PLNEAAKEAPSESE-----DAPDIDPNNQHPSQDR---SEAPPLEPAAKTKPSEHEPANP-PDASNRLSPPDASTAAIRE 139
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 81875363    678 QTSHTTVRLTPRvpwewtSEPVVSQSTQGPQEVTSEATTTENPQTS 723
Cdd:pfam09595  140 ARTFRKPSTGKR------NNPSSAQSDQSPPRANHEAIGRANPFAM 179
PHA03247 PHA03247
large tegument protein UL36; Provisional
956-1294 7.69e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 7.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   956 PGHPFPAPRARAGSPRKPTPERRPlptsattsspasssspepsgsrqtsGSWPQLIPDskqegtsSSPKPSLLTPGLPSP 1035
Cdd:PHA03247 2475 PGAPVYRRPAEARFPFAAGAAPDP-------------------------GGGGPPDPD-------APPAPSRLAPAILPD 2522
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1036 ATFALSTPNTSLLPTRSPEL-----SGSPTPTSPEGLTSASSmlsevSRLSPTSELTPGPD--TTPAPEIIPESSDSSDL 1108
Cdd:PHA03247 2523 EPVGEPVHPRMLTWIRGLEElasddAGDPPPPLPPAAPPAAP-----DRSVPPPRPAPRPSepAVTSRARRPDAPPQSAR 2597
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1109 PMNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQPPtntHPSSTPATPTESLP-----SSRKT 1183
Cdd:PHA03247 2598 PRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPE---RPRDDPAPGRVSRPrrarrLGRAA 2674
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1184 ELSSPTK--------PRLNSELTFEEAPSTDASQTQNLELFLASESGPSSPSPASNLDPLPTDAFKPPRSQTLHSASDHL 1255
Cdd:PHA03247 2675 QASSPPQrprrraarPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 81875363  1256 TQGPTPnhnPDPFGPcVSPLPPVRVMACEPPALVELVGA 1294
Cdd:PHA03247 2755 ARPARP---PTTAGP-PAPAPPAAPAAGPPRRLTRPAVA 2789
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
1067-1161 8.38e-04

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 43.69  E-value: 8.38e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1067 LTSASSMLSEVSRLSPTSELTPGPDTTPAPEiipeSSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNP 1146
Cdd:PRK11907   18 LTASNPKLAQAEEIVTTTPATSTEAEQTTPV----ESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTS 93
                          90
                  ....*....|....*
gi 81875363  1147 QQPRSPHPATSPQPP 1161
Cdd:PRK11907   94 EARTVTPAATETSKP 108
PHA03269 PHA03269
envelope glycoprotein C; Provisional
1059-1180 1.03e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 43.56  E-value: 1.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1059 PTPTS-PEGLTSASSMLSEVSrLSPTSELTPGPDTTPAPeiipeSSDSSDLPMNTRTPTqPFTASHPTSIPQlnTTSYPT 1137
Cdd:PHA03269   23 NTNIPiPELHTSAATQKPDPA-PAPHQAASRAPDPAVAP-----TSAASRKPDLAQAPT-PAASEKFDPAPA--PHQAAS 93
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 81875363  1138 IAPQPTTNPQ--QPRSPHPATSPQPPTNTHP------------SSTPATPTESLPSS 1180
Cdd:PHA03269   94 RAPDPAVAPQlaAAPKPDAAEAFTSAAQAHEapadagtsaaskKPDPAAHTQHSPPP 150
PHA03269 PHA03269
envelope glycoprotein C; Provisional
1003-1139 1.98e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 42.41  E-value: 1.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1003 TSGSWPQLIPDSKQEGTSSSPKPSLLTPGLPSPATFALSTpntsllPTRSPELSGSPTPTSPEGLTSASSMLSEVSR--- 1079
Cdd:PHA03269   24 TNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSA------ASRKPDLAQAPTPAASEKFDPAPAPHQAASRapd 97
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 81875363  1080 --LSPTSELTPGPDTTPAPEIIPES-SDSSDLPMNT--RTPTQPFTASHpTSIPQLNTTSYPTIA 1139
Cdd:PHA03269   98 paVAPQLAAAPKPDAAEAFTSAAQAhEAPADAGTSAasKKPDPAAHTQH-SPPPFAYTRSMEHIA 161
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1099-1191 2.12e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.46  E-value: 2.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1099 IPESSDSSDLPMNTRTPTQPFTaSHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQ--PPTNTH--PSSTPATPT 1174
Cdd:PRK14971  362 LTQKGDDASGGRGPKQHIKPVF-TQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAgtPPTVSVdpPAAVPVNPP 440
                          90
                  ....*....|....*..
gi 81875363  1175 ESLPSSRKTELSSPTKP 1191
Cdd:PRK14971  441 STAPQAVRPAQFKEEKK 457
PHA03378 PHA03378
EBNA-3B; Provisional
941-1173 2.21e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.75  E-value: 2.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   941 TTKPGSTVTTSTSKSPGHPFPAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWPQLIPDSKQEGTS 1020
Cdd:PHA03378  587 SSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTW 666
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1021 SSPKPsllTPGLPSPATFALSTPnTSLLPTRSPELSGSPTPTSP-EGLTSASSMLSEVSRLSPTSELTPGPDTTPAPEII 1099
Cdd:PHA03378  667 TQIGH---IPYQPSPTGANTMLP-IQWAPGTMQPPPRAPTPMRPpAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPG 742
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 81875363  1100 PESSdssdlPMNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQPPTNTHPSSTPATP 1173
Cdd:PHA03378  743 RARP-----PAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP 811
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
1012-1191 2.38e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 41.96  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1012 PDSKQEGTSSSPKPS-LLTPGLPSPATFALSTPNTSLLPTRSPELSGSPTPTSPEGLTSASSmlSEVSRLSPTSELTP-G 1089
Cdd:pfam05539  168 PKTAVTTSKTTSWPTeVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSN--PEPQTEPPPSQRGPsG 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1090 PDTTPAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTtsyptiaPQPTTNPQQPRSPHPATSPQPPTNTHPSST 1169
Cdd:pfam05539  246 SPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTAT-------PPPTTKRQETGRPTPRPTATTQSGSSPPHS 318
                          170       180
                   ....*....|....*....|....*
gi 81875363   1170 PATPTESLPSSRKT---ELSSPTKP 1191
Cdd:pfam05539  319 SPPGVQANPTTQNLvdcKELDPPKP 343
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1001-1291 2.39e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 2.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1001 RQTSGSWPQliPDSKQEGTSSSPKPSLLTPGLPspatfALSTPNTSLLPTRSPELSGS--PTPTSPEGLTSASSMLSevs 1078
Cdd:pfam03154  142 RSTSPSIPS--PQDNESDSDSSAQQQILQTQPP-----VLQAQSGAASPPSPPPPGTTqaATAGPTPSAPSVPPQGS--- 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1079 rlSPTSELTPGPDTTPAPEIIPESSDSSDLPmntRTPTqPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSp 1158
Cdd:pfam03154  212 --PATSQPPNQTQSTAAPHTLIQQTPTLHPQ---RLPS-PHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQT- 284
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1159 QPPTNTHPSSTPATPTESLPSSRKTELSSPTKPRLNSELTFEEAPSTDASQTQNLElflASESGPSSPSPASNLDPLPTD 1238
Cdd:pfam03154  285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPP---REQPLPPAPLSMPHIKPPPTT 361
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 81875363   1239 AFKPPRSQTLHSASDHLTqGPTPNHNPdpfgpcvSPLPPvrvmacePPALVEL 1291
Cdd:pfam03154  362 PIPQLPNPQSHKHPPHLS-GPSPFQMN-------SNLPP-------PPALKPL 399
PRK11901 PRK11901
hypothetical protein; Reviewed
1032-1191 2.61e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 41.59  E-value: 2.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1032 LPSPATFALSTPNTSLLPTRSPELSGSPT-------PTSPEGLTSASSMLSEVSRLSPTSELTP----GPDTTPAPEIIP 1100
Cdd:PRK11901   58 LKSPTEHESQQSSNNAGAEKNIDLSGSSSlssgnqsSPSAANNTSDGHDASGVKNTAPPQDISAppisPTPTQAAPPQTP 137
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1101 ESSDSSDLP--------------------MNTRTPTQPfTAshPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQP 1160
Cdd:PRK11901  138 NGQQRIELPgnisdalsqqqgqvnaasqnAQGNTSTLP-TA--PATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHK 214
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 81875363  1161 -PTNTHPSSTPATPTE---------SLPSSRKT-ELSSPTKP 1191
Cdd:PRK11901  215 tATVAVPPATSGKPKSgaasaralsSAPASHYTlQLSSASRS 256
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1094-1191 2.77e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.10  E-value: 2.77e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1094 PAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQlnttsyptIAPQPTTNPQQPRSPHPATsPQPPTNthPSSTPATP 1173
Cdd:PRK14950  364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPP--------KEPVRETATPPPVPPRPVA-PPVPHT--PESAPKLT 432
                          90
                  ....*....|....*...
gi 81875363  1174 TESLPSSRKTELSSPTKP 1191
Cdd:PRK14950  433 RAAIPVDEKPKYTPPAPP 450
TALPID3 pfam15324
Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for ...
1018-1201 2.78e-03

Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for Hedgehog signalling. Mutations in this gene noticed first in chickens lead to multiple abnormalities of development.


Pssm-ID: 434634 [Multi-domain]  Cd Length: 1288  Bit Score: 42.18  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1018 GTSSSPKPSLLTPGLPSPATfalstPNTSLLPTRSPelsgSPTPTSPEgltSASSMLSEVSRLSpTSELTPGP---DTTP 1094
Cdd:pfam15324  959 GDREAQREPPVAASVPGDLP-----TKETLLPTPVP----TPQPTPPC---SPPSPLKEPSPVK-TPDSSPCVsehDFFP 1025
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   1095 APEIIPEssdssdlpmntrTPTQPFTASHPtsipqLNTtsyPTIAPQPTtnpqqprsPHPATSPqpptnthpsstpaTPT 1174
Cdd:pfam15324 1026 VKEIPPE------------KGADTGPAVSL-----VIT---PTVTPIAT--------PPPAATP-------------TPP 1064
                          170       180
                   ....*....|....*....|....*....
gi 81875363   1175 ESLPSSRKTELSSPTKPRL--NSELTFEE 1201
Cdd:pfam15324 1065 LSENSIDKLKSPSPELPKPweDSDLPLEE 1093
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1020-1213 2.89e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.38  E-value: 2.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1020 SSSPKPSLLTPGLPSPATfALSTPNTSLLPTRSPELSGSPTPTSPEGLTSASSMLS-EVSRLSPTSELTPGPDTTPAPEI 1098
Cdd:PRK10263  334 AAPVEPVTQTPPVASVDV-PPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQpAVQYNEPLQQPVQPQQPYYAPAA 412
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1099 IPESSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYP----TIAPQPTtnpQQPRSPHPATSPQPPTNTHPSSTPATP- 1173
Cdd:PRK10263  413 EQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEeqqsTFAPQST---YQTEQTYQQPAAQEPLYQQPQPVEQQPv 489
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 81875363  1174 TESLPSSRKTElssPTKPRLnseLTFEEAPSTDASQTQNL 1213
Cdd:PRK10263  490 VEPEPVVEETK---PARPPL---YYFEEVEEKRAREREQL 523
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1031-1183 5.50e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.37  E-value: 5.50e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1031 GLPSPATFALSTPNTSLLPTRSPELSGSPTPTSPEGLTSASSMLSE--VSRLSP------TSELTPGPDTTPAPEIIPES 1102
Cdd:PRK07003  379 AVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEapPAAPAPpatadrGDDAADGDAPVPAKANARAS 458
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1103 SDSSDLPMNTRTPTQPFTASHPTSI--------PQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQPPTNTHPSSTPATPT 1174
Cdd:PRK07003  459 ADSRCDERDAQPPADSGSASAPASDappdaafePAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPA 538

                  ....*....
gi 81875363  1175 ESLPSSRKT 1183
Cdd:PRK07003  539 AAAPAARAG 547
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1110-1191 6.66e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 40.95  E-value: 6.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1110 MNTRTPTQPFTASHPTSIPQLNTTSYPTIAPQPTTNPQQPRSPHPATSPQPPTNTHPSSTPATPT-ESLPSSRKTELSSP 1188
Cdd:PRK14950  360 LVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTpESAPKLTRAAIPVD 439

                  ...
gi 81875363  1189 TKP 1191
Cdd:PRK14950  440 EKP 442
motB PRK12799
flagellar motor protein MotB; Reviewed
1015-1148 7.04e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 40.47  E-value: 7.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1015 KQEGTSSSpKPSLLTPGLPSPATFALSTPNTSLLPtrSPELSGSPTPTSPEGLTSASSMLSEVSRLSPTSELTPGPDTTP 1094
Cdd:PRK12799  291 KQIDTHGT-VPVAAVTPSSAVTQSSAITPSSAAIP--SPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALP 367
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 81875363  1095 APEiipesSDSSDLPMNTRTPTQPFTASHPTSIPQLNTTSYPT-----IAPQPTTNPQQ 1148
Cdd:PRK12799  368 AAE-----PVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAapasnIPVSPTSRDAQ 421
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1094-1183 7.15e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 39.00  E-value: 7.15e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363    1094 PAPEIIPESSDSSDLPMNTRTPTQPFTASHPTSIPQL-NTTSYPTIAPQPTTNPQQPRSPHPATSPQPPTNTHPSSTPAT 1172
Cdd:smart00818   69 PQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLqPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLPDL 148
                            90
                    ....*....|.
gi 81875363    1173 PTESLPSSRKT 1183
Cdd:smart00818  149 PLEAWPATDKT 159
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
956-1192 8.55e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.54  E-value: 8.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363   956 PGHPFPAPRARAGSPRKPTPERRPLPTSATTSSPASSSSPEPSGSRQTSGSWPQLIPDSKQEGTSSSPKPSlltpglpsp 1035
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPR--------- 259
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 81875363  1036 atfalstPNTSLLPTRSPELSGSPTPTSPEGLTSASSMLSEVS-RLSPTSELTPGPDTTPAPEIIPESSDSSDLPMNTRT 1114
Cdd:PHA03307  260 -------PAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSpSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS 332
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 81875363  1115 PTQPFTASHPTSipqlnttsyptiapqpttnpqQPRSPHPATSPQPPTNTHPSSTPATPTESLPSSRKTELSSPTKPR 1192
Cdd:PHA03307  333 SESSRGAAVSPG---------------------PSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH