NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|613254167|gb|EZY60209|]
View 

serine-aspartate repeat-containing protein E, partial [Staphylococcus aureus R0353]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MSCRAMM_SdrD super family cl37929
MSCRAMM family adhesin SdrD; Features of this protein family include a YSIRK-type signal ...
1-935 5.28e-150

MSCRAMM family adhesin SdrD; Features of this protein family include a YSIRK-type signal peptide at the N-terminus and a variable-length C-terminal region of Ser-Asp (SD) repeats followed by an LPXTG motif for surface immobilization by sortase.


The actual alignment was detected with superfamily member NF012181:

Pssm-ID: 467951 [Multi-domain]  Cd Length: 1379  Bit Score: 479.69  E-value: 5.28e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167    1 MINRDNKKAITKKGMISNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTEnAKQDEASASDNKEvvseten 80
Cdd:NF012181    1 MLNRENKTAITRKGMVSNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAESTNKE-LNEATTSASDNQS------- 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167   81 nsttentstnpikkETNTDSQPETKEESTTSSTQKQ--QNNVTATTETKPQNIEKENVKPSTDKTAtEDTSVILEEKKAP 158
Cdd:NF012181   73 --------------SSKVDNQQLNQEDNTKNDNQKEmvSSQGNETTSNGNKSIEKESVQSTTGNKV-EVSTAKSDEQASP 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  159 NNTNNDVTTKPSTSEIQTKPTTPQESTNIENSQPqptpSKVDNQVTDATNPKEPVNVSKEELKNNPEKLkelVRNDSNTD 238
Cdd:NF012181  138 KSTNEDLNTKQTISNQEALQPDLQENKSVVNVQP----TNEENKKVDAKTESTTLNVKSDAIKSNAETL---VDNNSNSN 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  239 RSTKPVATAPTSVAPKRLNAKMRFAVAQPAAVASNNVNDLI-KVTKQTIKVGDGKDDVVAAHDgeEIEYDSEFTIDNKVK 317
Cdd:NF012181  211 NENNADIILPKSTAPKRLNTRMRMAAVQPSSTDSKNVNDLItSNTTLTVVDADKNNKIVPAQD--YLSLKSQITVDDKVK 288
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  318 AGDTMTINYDKNVIPSDLT--DKNDPIDITDPS-GEVIAKGTFDKATKQITYTFTDYVDKYEDIKSRLTLYSYIDKKTVP 394
Cdd:NF012181  289 SGDYFTIKYSDTVQVYGLNpeDIKNIGDIKDPNnGETIATAKHDTANNLITYTFTDYVDRFNSVQMGINYSIYMDADTIP 368
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  395 ---NETSLNLTFATAGKETSQNVtvDYQDPMVHGDSNIQSIFTKL------DEDKQNIEQQIYVNPLKKTATNTKVDIAG 465
Cdd:NF012181  369 vskNDVEFNVTIGNTTTKTTANI--QYPDYVSRDNNSIGSAFTETvshagnAEDPGYYKQTVYVNPSEKSLTNAKLKVEA 446
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  466 SQVD----------------------------------------------------------DYGNI------------- 474
Cdd:NF012181  447 YHKDypdnvgqinkdvtkikiyqapkdyvlnkgydvntnqlidvteqfkdkitygandsvnvDFGSInnsyvvmvdtkfe 526
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  475 --------------------------------------------KLGN----------------------GSTIIDQNTE 488
Cdd:NF012181  527 yttsesptlvqmatlssdgnksvstgnalgftnnqsggagqevyKIGNyvwedtnkngvqelgevgvkgvTVVVYDNKTN 606
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  489 IKVYKVNSNQQ-------LPQSNRIYDFSQ----YEDVTS-QFDNKKSFSNNVATL-----------DFGNIDSAYII-- 543
Cdd:NF012181  607 KEVGRTITDEKggylipnLPNGDYRVEFSNlpqgYEVTPSkQGNNEELDSNGVSSVitvngkdnlsaDLGIYKPKYNLgd 686
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  544 ----------------KVVSKYTPTSDDELDiaQGASMRTTDKNNYYNYAGYSN------------FIVTSTDTGGGD-- 593
Cdd:NF012181  687 yvwedtnkngiqdqdeKGISGVTVTLKDENG--NVLKTVTTDADGKYKFTDLDNgnykvefttpegYTPTTVTSGSDIek 764
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  594 --------GTVKPEEKL-----------YKIGDYVWEDVDKDGVQgtDSKEKPMANVLVTLTYPDGTT-KSVRTDANGHY 653
Cdd:NF012181  765 dsngltttGVINGADNMtldsgfyktpkYNLGNYVWEDTNKDGKQ--DSTEKGISGVTVTLKNENGEVlQTTKTDKDGKY 842
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  654 EFGGLKDGeTYTVKFETPAGYLPTKENGTTDGEKDSNGSSVTVKINGKDDMSLDTGFYKePKYNLGDYVWEDTNKDGIQD 733
Cdd:NF012181  843 QFTGLENG-TYKVEFETPSGYTPTQVGSGTDEGIDSNGTSTTGVIKDKDNDTIDSGFYK-PTYNLGDYVWEDTNKNGVQD 920
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  734 ANEPGIKDVKVTLKDSTGKIIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPT-LKNTTAEDKDSNGLTTTGVIKDAD 812
Cdd:NF012181  921 KDEKGISGVTVTLKDENDKVLKTVTTDENGKYQFTDLNNGTYKVEFETPSGYTPTsVTSGNDTEKDSNGLTTTGVIKDAD 1000
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  813 NMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKP 892
Cdd:NF012181 1001 NMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVKVTLLNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKP 1080
                        1130      1140      1150      1160
                  ....*....|....*....|....*....|....*....|...
gi 613254167  893 AGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTIDNGYFEEDT 935
Cdd:NF012181 1081 AGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYFEEET 1123
 
Name Accession Description Interval E-value
MSCRAMM_SdrD NF012181
MSCRAMM family adhesin SdrD; Features of this protein family include a YSIRK-type signal ...
1-935 5.28e-150

MSCRAMM family adhesin SdrD; Features of this protein family include a YSIRK-type signal peptide at the N-terminus and a variable-length C-terminal region of Ser-Asp (SD) repeats followed by an LPXTG motif for surface immobilization by sortase.


Pssm-ID: 467951 [Multi-domain]  Cd Length: 1379  Bit Score: 479.69  E-value: 5.28e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167    1 MINRDNKKAITKKGMISNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTEnAKQDEASASDNKEvvseten 80
Cdd:NF012181    1 MLNRENKTAITRKGMVSNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAESTNKE-LNEATTSASDNQS------- 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167   81 nsttentstnpikkETNTDSQPETKEESTTSSTQKQ--QNNVTATTETKPQNIEKENVKPSTDKTAtEDTSVILEEKKAP 158
Cdd:NF012181   73 --------------SSKVDNQQLNQEDNTKNDNQKEmvSSQGNETTSNGNKSIEKESVQSTTGNKV-EVSTAKSDEQASP 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  159 NNTNNDVTTKPSTSEIQTKPTTPQESTNIENSQPqptpSKVDNQVTDATNPKEPVNVSKEELKNNPEKLkelVRNDSNTD 238
Cdd:NF012181  138 KSTNEDLNTKQTISNQEALQPDLQENKSVVNVQP----TNEENKKVDAKTESTTLNVKSDAIKSNAETL---VDNNSNSN 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  239 RSTKPVATAPTSVAPKRLNAKMRFAVAQPAAVASNNVNDLI-KVTKQTIKVGDGKDDVVAAHDgeEIEYDSEFTIDNKVK 317
Cdd:NF012181  211 NENNADIILPKSTAPKRLNTRMRMAAVQPSSTDSKNVNDLItSNTTLTVVDADKNNKIVPAQD--YLSLKSQITVDDKVK 288
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  318 AGDTMTINYDKNVIPSDLT--DKNDPIDITDPS-GEVIAKGTFDKATKQITYTFTDYVDKYEDIKSRLTLYSYIDKKTVP 394
Cdd:NF012181  289 SGDYFTIKYSDTVQVYGLNpeDIKNIGDIKDPNnGETIATAKHDTANNLITYTFTDYVDRFNSVQMGINYSIYMDADTIP 368
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  395 ---NETSLNLTFATAGKETSQNVtvDYQDPMVHGDSNIQSIFTKL------DEDKQNIEQQIYVNPLKKTATNTKVDIAG 465
Cdd:NF012181  369 vskNDVEFNVTIGNTTTKTTANI--QYPDYVSRDNNSIGSAFTETvshagnAEDPGYYKQTVYVNPSEKSLTNAKLKVEA 446
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  466 SQVD----------------------------------------------------------DYGNI------------- 474
Cdd:NF012181  447 YHKDypdnvgqinkdvtkikiyqapkdyvlnkgydvntnqlidvteqfkdkitygandsvnvDFGSInnsyvvmvdtkfe 526
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  475 --------------------------------------------KLGN----------------------GSTIIDQNTE 488
Cdd:NF012181  527 yttsesptlvqmatlssdgnksvstgnalgftnnqsggagqevyKIGNyvwedtnkngvqelgevgvkgvTVVVYDNKTN 606
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  489 IKVYKVNSNQQ-------LPQSNRIYDFSQ----YEDVTS-QFDNKKSFSNNVATL-----------DFGNIDSAYII-- 543
Cdd:NF012181  607 KEVGRTITDEKggylipnLPNGDYRVEFSNlpqgYEVTPSkQGNNEELDSNGVSSVitvngkdnlsaDLGIYKPKYNLgd 686
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  544 ----------------KVVSKYTPTSDDELDiaQGASMRTTDKNNYYNYAGYSN------------FIVTSTDTGGGD-- 593
Cdd:NF012181  687 yvwedtnkngiqdqdeKGISGVTVTLKDENG--NVLKTVTTDADGKYKFTDLDNgnykvefttpegYTPTTVTSGSDIek 764
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  594 --------GTVKPEEKL-----------YKIGDYVWEDVDKDGVQgtDSKEKPMANVLVTLTYPDGTT-KSVRTDANGHY 653
Cdd:NF012181  765 dsngltttGVINGADNMtldsgfyktpkYNLGNYVWEDTNKDGKQ--DSTEKGISGVTVTLKNENGEVlQTTKTDKDGKY 842
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  654 EFGGLKDGeTYTVKFETPAGYLPTKENGTTDGEKDSNGSSVTVKINGKDDMSLDTGFYKePKYNLGDYVWEDTNKDGIQD 733
Cdd:NF012181  843 QFTGLENG-TYKVEFETPSGYTPTQVGSGTDEGIDSNGTSTTGVIKDKDNDTIDSGFYK-PTYNLGDYVWEDTNKNGVQD 920
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  734 ANEPGIKDVKVTLKDSTGKIIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPT-LKNTTAEDKDSNGLTTTGVIKDAD 812
Cdd:NF012181  921 KDEKGISGVTVTLKDENDKVLKTVTTDENGKYQFTDLNNGTYKVEFETPSGYTPTsVTSGNDTEKDSNGLTTTGVIKDAD 1000
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  813 NMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKP 892
Cdd:NF012181 1001 NMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVKVTLLNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKP 1080
                        1130      1140      1150      1160
                  ....*....|....*....|....*....|....*....|...
gi 613254167  893 AGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTIDNGYFEEDT 935
Cdd:NF012181 1081 AGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYFEEET 1123
SdrD_B pfam17210
SdrD B-like domain; This family corresponds to the B-like domain from the SdrD protein. This ...
826-930 1.50e-34

SdrD B-like domain; This family corresponds to the B-like domain from the SdrD protein. This domain has three calcium binding sites within a greek key beta sandwich fold.


Pssm-ID: 435789 [Multi-domain]  Cd Length: 112  Bit Score: 127.72  E-value: 1.50e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  826 YSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKPAGLTQTGTNTTED 905
Cdd:pfam17210   1 ASIGDFVWEDANKNGIQDAGEPGISGVTVTLYDANGTVVGTTTTDANGKYLFTNLAPGTYYVEFTAPAGYTFTPQNQGSD 80
                          90       100       110
                  ....*....|....*....|....*....|..
gi 613254167  906 D-KDADGGEVDVTITD------HDDFTIDNGY 930
Cdd:pfam17210  81 DaLDSDADPATGLTATvtlasgESDLTIDAGL 112
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
22-598 1.32e-30

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 130.41  E-value: 1.32e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  22 KFSIRKYTVGTASILVGTTLIFGL-GNQEAKAAENTSTENAKQDEASASDNKEVVSET----ENNSTTENTSTNPIKKET 96
Cdd:NF033609   8 KHAIRKKSIGVASVLVGTLIGFGLlSSKEADASENSVTQSDSASNESKSNDSSSVSAApktdDTNVSDTKTSSNTNNGET 87
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  97 NTDSQPETKEESTTSSTQKQQNNVTATTETKPQNIEKENVKPSTDKTATEDTSVIleekkapNNTNNDVTTKPSTSeiQT 176
Cdd:NF033609  88 SVAQNPAQQETTQSASTNATTEETPVTGEATTTATNQANTPATTQSSNTNAEELV-------NQTSNETTSNDTNT--VS 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 177 KPTTPQESTNIENSQpqpTPSKVDNQVTDATNPKEPvnvskeelknnpeklkelvrndSNTDRSTKPVATAPTSVAPKRL 256
Cdd:NF033609 159 SVNSPQNSTNAENVS---TTQDTSTEATPSNNESAP----------------------QSTDASNKDVVNQAVNTSAPRM 213
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 257 NAKMRFAVAQPAAVASNNVNDLIKVTKQTIKVGDgkddVVAAHDGEEIEYDSEFTIDNKVKAGDTMTINYDK--NVIPSD 334
Cdd:NF033609 214 RAFSLAAVAADAPAAGTDITNQLTNVTVGIDSGT----TVYPHQAGYVKLNYGFSVPNSAVKGDTFKITVPKelNLNGVT 289
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 335 LTDKNDPIDITDpsgEVIAKGTFDkATKQITYTFTDYVDKYEDIKSRLTLYSYIDKKTVPNETSLNLTFATAGKETSQNV 414
Cdd:NF033609 290 STAKVPPIMAGD---QVLANGVID-SDGNVIYTFTDYVDTKEDVKATLTMPAYIDPENVTKTGNVTLTTGIGSTTANKTV 365
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 415 TVDYQDPMVHGDSNIQSIFTKLDEDKQNIEQQIYVNPLKKTATNTKVDiagsqvddyGNIKLGNGST--IIDQNTEIKVY 492
Cdd:NF033609 366 LVDYEKYGKFYNLSIKGTIDQIDKTNNTYRQTIYVNPSGDNVIAPVLT---------GNLKPNTDSNalIDQQNTSIKVY 436
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 493 KVNSNQQLPQSNRIyDFSQYEDVTSQ----FDNKKSFSNNVATLDfGNIDSAYIIKVVSKYTPTSDDelDIAQGASMRTT 568
Cdd:NF033609 437 KVDNAADLSESYFV-NPENFEDVTNSvnitFPNPNQYKVEFNTPD-DQITTPYIVVVNGHIDPNSKG--DLALRSTLYGY 512
                        570       580       590
                 ....*....|....*....|....*....|
gi 613254167 569 DKNNYYNYAGYSNFIVTSTDTGGGDGTVKP 598
Cdd:NF033609 513 NSNIIWRSMSWDNEVAFNNGSGSGDGIDKP 542
YSIRK_signal TIGR01168
Gram-positive signal peptide, YSIRK family; Many surface proteins found in Streptococcus, ...
18-55 1.98e-07

Gram-positive signal peptide, YSIRK family; Many surface proteins found in Streptococcus, Staphylococcus, and related lineages share apparently homologous signal sequences. A motif resembling [YF]SIRKxxxGxxS[VIA] appears at the start of the transmembrane domain. The GxxS motif appears perfectly conserved, suggesting a specific function and not just homology. There is a strong correlation between proteins carrying this region at the N-terminus and those carrying the Gram-positive anchor domain with the LPXTG sortase processing site at the C-terminus.


Pssm-ID: 273479 [Multi-domain]  Cd Length: 39  Bit Score: 47.86  E-value: 1.98e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 613254167   18 NRLNKFSIRKYTVGTASILVGtTLIFGLGnqeAKAAEN 55
Cdd:TIGR01168   6 EKQQKYSIRKLSVGVASVLVA-SLFFGGG---VAAAES 39
ClfA COG4932
Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing ...
626-935 2.18e-06

Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing adhesive matrix molecules) family, DEv-IgG fold [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443959 [Multi-domain]  Cd Length: 689  Bit Score: 51.51  E-value: 2.18e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 626 KPMANVLVTLTYPDG----TTKSVRTDANGHYEFGGLKDGETYTVKFETPAGYLPtkengttdgekDSNGSSVTVKINGK 701
Cdd:COG4932  275 EPLAGATFTLTDADGntvvTTTVTVTDADGSYTFTDLPPGTYTVTETKAPAGYDL-----------DGEAVKVTITAGQT 343
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 702 DDMSLDTGFYKEPKYNLgdyVWEDTNKDGiqdaNEPGIKDVKVTLKDSTGKIIGTTTTDASGKYKFTDLDNGNYTVEF-E 780
Cdd:COG4932  344 TTVTVTNGNNEVKTGSV---TLTKVDADD----GEAPLAGAEFTLTDADGTVVATITTDADGTASFKGLAPGTYTLTEtK 416
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 781 TPAGYTPTLKNTTAEDKDsNGLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEK 860
Cdd:COG4932  417 APEGYTLDSTPITVTVTD-GGTGAIDTITNERKKGSVQVTKVDAPLAGATFTLTDADGTVVTLTTDADLAGATFEADGKV 495
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 613254167 861 gevigTTKTDENGKYRFDNLDSGKYKVIFEKPAGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTIDNGYFEEDT 935
Cdd:COG4932  496 -----VTTTDASGKYTFKNLPPGTYTDAGGSATVITDDTDGTVGDEATGTDPEVTVTGKSTTTTPDVALLTNLGT 565
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
117-243 1.34e-04

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 45.73  E-value: 1.34e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 117 QNNVTATTETKPQNIEKENVKPSTDKTATEDTSVILEEKKAPNNTNNDVTTKPSTSEIQTKPTTPQESTNIENSQPQPTP 196
Cdd:PTZ00441 327 QDPVPPPNEGKDGNPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPEDSNKEVP 406
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 613254167 197 SKV--------DNQVTDATNPKE-----PVNVSKEELKN-----NPEKLKELVRNDSNTDRSTKP 243
Cdd:PTZ00441 407 EDVpmepeddrDNNFNEPKKPENkgdgqNEPVIPKPLDNerdqsNKNKQVNPGNRHNSEDRYTRP 471
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
17-228 2.88e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.76  E-value: 2.88e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  17 SNRLNKFSIRKYTVGTASILVGtTLIFGlgnqeakaAENTSTENAKQDEASASDNKEVVSETENNSTTeNTSTNPIKKET 96
Cdd:NF033839   6 HERKMRYSIRKFSIGVASVAVA-SLFMG--------SVVHATEKEGSTQAATSSNRGNESQAEQRKEL-DLERDKAKKAV 75
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  97 nTDSQPETKEESTTSSTQKQQNNvTATTETKPQNIEKENVKPSTDKTATEDTSVILEEK-----KAPNNTNNDVTTKpST 171
Cdd:NF033839  76 -SEYKEKKVKEIYKKSTKERHKN-TVDLVNKLQNIKNEYLNKIVESTSKSQLQKLMMESqskvdEAVSKFEKDSSSS-SS 152
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 613254167 172 SEIQTKPTTPQEstniENSQPQ-PTPSKVDNQVTDATNPKEPvnvSKEELKNNPEKLK 228
Cdd:NF033839 153 SGSSTKPETPQP----ENPEHQkPTTPAPDTKPSPQPEGKKP---SVPDINQEKEKAK 203
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
17-243 2.75e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.54  E-value: 2.75e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  17 SNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTENAKQDEASASDNKEV-------VSETENNST----TE 85
Cdd:NF033838   6 SERKVHYSIRKFSIGVASVVVASLFLGGVVHAEEVRGGNNPTVTSSGNESQKEHAKEVeshlekiLSEIQKSLDkrkhTQ 85
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  86 NTSTNPIKKETNTD---SQPETKEESTTSSTQKQQNNVTATTEtkpqNIEKENVKPsTDKTATEDTSVILEEKKA----- 157
Cdd:NF033838  86 NVALNKKLSDIKTEylyELNVLKEKSEAELTSKTKKELDAAFE----QFKKDTLEP-GKKVAEATKKVEEAEKKAkdqke 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 158 ------PNNT---------NNDVTTKPSTSE-IQTKPTTPQESTNIENSQpqptpSKVDNQVTDAT---NPKEPVNVSKE 218
Cdd:NF033838 161 edrrnyPTNTyktleleiaESDVEVKKAELElVKEEAKEPRDEEKIKQAK-----AKVESKKAEATrleKIKTDREKAEE 235
                        250       260
                 ....*....|....*....|....*.
gi 613254167 219 ELKNNPE-KLKELVRNDSNTDRSTKP 243
Cdd:NF033838 236 EAKRRADaKLKEAVEKNVATSEQDKP 261
 
Name Accession Description Interval E-value
MSCRAMM_SdrD NF012181
MSCRAMM family adhesin SdrD; Features of this protein family include a YSIRK-type signal ...
1-935 5.28e-150

MSCRAMM family adhesin SdrD; Features of this protein family include a YSIRK-type signal peptide at the N-terminus and a variable-length C-terminal region of Ser-Asp (SD) repeats followed by an LPXTG motif for surface immobilization by sortase.


Pssm-ID: 467951 [Multi-domain]  Cd Length: 1379  Bit Score: 479.69  E-value: 5.28e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167    1 MINRDNKKAITKKGMISNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTEnAKQDEASASDNKEvvseten 80
Cdd:NF012181    1 MLNRENKTAITRKGMVSNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAESTNKE-LNEATTSASDNQS------- 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167   81 nsttentstnpikkETNTDSQPETKEESTTSSTQKQ--QNNVTATTETKPQNIEKENVKPSTDKTAtEDTSVILEEKKAP 158
Cdd:NF012181   73 --------------SSKVDNQQLNQEDNTKNDNQKEmvSSQGNETTSNGNKSIEKESVQSTTGNKV-EVSTAKSDEQASP 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  159 NNTNNDVTTKPSTSEIQTKPTTPQESTNIENSQPqptpSKVDNQVTDATNPKEPVNVSKEELKNNPEKLkelVRNDSNTD 238
Cdd:NF012181  138 KSTNEDLNTKQTISNQEALQPDLQENKSVVNVQP----TNEENKKVDAKTESTTLNVKSDAIKSNAETL---VDNNSNSN 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  239 RSTKPVATAPTSVAPKRLNAKMRFAVAQPAAVASNNVNDLI-KVTKQTIKVGDGKDDVVAAHDgeEIEYDSEFTIDNKVK 317
Cdd:NF012181  211 NENNADIILPKSTAPKRLNTRMRMAAVQPSSTDSKNVNDLItSNTTLTVVDADKNNKIVPAQD--YLSLKSQITVDDKVK 288
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  318 AGDTMTINYDKNVIPSDLT--DKNDPIDITDPS-GEVIAKGTFDKATKQITYTFTDYVDKYEDIKSRLTLYSYIDKKTVP 394
Cdd:NF012181  289 SGDYFTIKYSDTVQVYGLNpeDIKNIGDIKDPNnGETIATAKHDTANNLITYTFTDYVDRFNSVQMGINYSIYMDADTIP 368
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  395 ---NETSLNLTFATAGKETSQNVtvDYQDPMVHGDSNIQSIFTKL------DEDKQNIEQQIYVNPLKKTATNTKVDIAG 465
Cdd:NF012181  369 vskNDVEFNVTIGNTTTKTTANI--QYPDYVSRDNNSIGSAFTETvshagnAEDPGYYKQTVYVNPSEKSLTNAKLKVEA 446
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  466 SQVD----------------------------------------------------------DYGNI------------- 474
Cdd:NF012181  447 YHKDypdnvgqinkdvtkikiyqapkdyvlnkgydvntnqlidvteqfkdkitygandsvnvDFGSInnsyvvmvdtkfe 526
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  475 --------------------------------------------KLGN----------------------GSTIIDQNTE 488
Cdd:NF012181  527 yttsesptlvqmatlssdgnksvstgnalgftnnqsggagqevyKIGNyvwedtnkngvqelgevgvkgvTVVVYDNKTN 606
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  489 IKVYKVNSNQQ-------LPQSNRIYDFSQ----YEDVTS-QFDNKKSFSNNVATL-----------DFGNIDSAYII-- 543
Cdd:NF012181  607 KEVGRTITDEKggylipnLPNGDYRVEFSNlpqgYEVTPSkQGNNEELDSNGVSSVitvngkdnlsaDLGIYKPKYNLgd 686
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  544 ----------------KVVSKYTPTSDDELDiaQGASMRTTDKNNYYNYAGYSN------------FIVTSTDTGGGD-- 593
Cdd:NF012181  687 yvwedtnkngiqdqdeKGISGVTVTLKDENG--NVLKTVTTDADGKYKFTDLDNgnykvefttpegYTPTTVTSGSDIek 764
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  594 --------GTVKPEEKL-----------YKIGDYVWEDVDKDGVQgtDSKEKPMANVLVTLTYPDGTT-KSVRTDANGHY 653
Cdd:NF012181  765 dsngltttGVINGADNMtldsgfyktpkYNLGNYVWEDTNKDGKQ--DSTEKGISGVTVTLKNENGEVlQTTKTDKDGKY 842
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  654 EFGGLKDGeTYTVKFETPAGYLPTKENGTTDGEKDSNGSSVTVKINGKDDMSLDTGFYKePKYNLGDYVWEDTNKDGIQD 733
Cdd:NF012181  843 QFTGLENG-TYKVEFETPSGYTPTQVGSGTDEGIDSNGTSTTGVIKDKDNDTIDSGFYK-PTYNLGDYVWEDTNKNGVQD 920
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  734 ANEPGIKDVKVTLKDSTGKIIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPT-LKNTTAEDKDSNGLTTTGVIKDAD 812
Cdd:NF012181  921 KDEKGISGVTVTLKDENDKVLKTVTTDENGKYQFTDLNNGTYKVEFETPSGYTPTsVTSGNDTEKDSNGLTTTGVIKDAD 1000
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  813 NMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKP 892
Cdd:NF012181 1001 NMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVKVTLLNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKP 1080
                        1130      1140      1150      1160
                  ....*....|....*....|....*....|....*....|...
gi 613254167  893 AGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTIDNGYFEEDT 935
Cdd:NF012181 1081 AGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYFEEET 1123
SdrD_B pfam17210
SdrD B-like domain; This family corresponds to the B-like domain from the SdrD protein. This ...
826-930 1.50e-34

SdrD B-like domain; This family corresponds to the B-like domain from the SdrD protein. This domain has three calcium binding sites within a greek key beta sandwich fold.


Pssm-ID: 435789 [Multi-domain]  Cd Length: 112  Bit Score: 127.72  E-value: 1.50e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  826 YSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKPAGLTQTGTNTTED 905
Cdd:pfam17210   1 ASIGDFVWEDANKNGIQDAGEPGISGVTVTLYDANGTVVGTTTTDANGKYLFTNLAPGTYYVEFTAPAGYTFTPQNQGSD 80
                          90       100       110
                  ....*....|....*....|....*....|..
gi 613254167  906 D-KDADGGEVDVTITD------HDDFTIDNGY 930
Cdd:pfam17210  81 DaLDSDADPATGLTATvtlasgESDLTIDAGL 112
SdrD_B pfam17210
SdrD B-like domain; This family corresponds to the B-like domain from the SdrD protein. This ...
716-820 2.62e-31

SdrD B-like domain; This family corresponds to the B-like domain from the SdrD protein. This domain has three calcium binding sites within a greek key beta sandwich fold.


Pssm-ID: 435789 [Multi-domain]  Cd Length: 112  Bit Score: 118.47  E-value: 2.62e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  716 YNLGDYVWEDTNKDGIQDANEPGIKDVKVTLKDSTGKIIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPTLKNTTAE 795
Cdd:pfam17210   1 ASIGDFVWEDANKNGIQDAGEPGISGVTVTLYDANGTVVGTTTTDANGKYLFTNLAPGTYYVEFTAPAGYTFTPQNQGSD 80
                          90       100       110
                  ....*....|....*....|....*....|..
gi 613254167  796 D-KDSNGLTTTGVIKD------ADNMTLDSGF 820
Cdd:pfam17210  81 DaLDSDADPATGLTATvtlasgESDLTIDAGL 112
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
22-598 1.32e-30

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 130.41  E-value: 1.32e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  22 KFSIRKYTVGTASILVGTTLIFGL-GNQEAKAAENTSTENAKQDEASASDNKEVVSET----ENNSTTENTSTNPIKKET 96
Cdd:NF033609   8 KHAIRKKSIGVASVLVGTLIGFGLlSSKEADASENSVTQSDSASNESKSNDSSSVSAApktdDTNVSDTKTSSNTNNGET 87
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  97 NTDSQPETKEESTTSSTQKQQNNVTATTETKPQNIEKENVKPSTDKTATEDTSVIleekkapNNTNNDVTTKPSTSeiQT 176
Cdd:NF033609  88 SVAQNPAQQETTQSASTNATTEETPVTGEATTTATNQANTPATTQSSNTNAEELV-------NQTSNETTSNDTNT--VS 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 177 KPTTPQESTNIENSQpqpTPSKVDNQVTDATNPKEPvnvskeelknnpeklkelvrndSNTDRSTKPVATAPTSVAPKRL 256
Cdd:NF033609 159 SVNSPQNSTNAENVS---TTQDTSTEATPSNNESAP----------------------QSTDASNKDVVNQAVNTSAPRM 213
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 257 NAKMRFAVAQPAAVASNNVNDLIKVTKQTIKVGDgkddVVAAHDGEEIEYDSEFTIDNKVKAGDTMTINYDK--NVIPSD 334
Cdd:NF033609 214 RAFSLAAVAADAPAAGTDITNQLTNVTVGIDSGT----TVYPHQAGYVKLNYGFSVPNSAVKGDTFKITVPKelNLNGVT 289
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 335 LTDKNDPIDITDpsgEVIAKGTFDkATKQITYTFTDYVDKYEDIKSRLTLYSYIDKKTVPNETSLNLTFATAGKETSQNV 414
Cdd:NF033609 290 STAKVPPIMAGD---QVLANGVID-SDGNVIYTFTDYVDTKEDVKATLTMPAYIDPENVTKTGNVTLTTGIGSTTANKTV 365
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 415 TVDYQDPMVHGDSNIQSIFTKLDEDKQNIEQQIYVNPLKKTATNTKVDiagsqvddyGNIKLGNGST--IIDQNTEIKVY 492
Cdd:NF033609 366 LVDYEKYGKFYNLSIKGTIDQIDKTNNTYRQTIYVNPSGDNVIAPVLT---------GNLKPNTDSNalIDQQNTSIKVY 436
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 493 KVNSNQQLPQSNRIyDFSQYEDVTSQ----FDNKKSFSNNVATLDfGNIDSAYIIKVVSKYTPTSDDelDIAQGASMRTT 568
Cdd:NF033609 437 KVDNAADLSESYFV-NPENFEDVTNSvnitFPNPNQYKVEFNTPD-DQITTPYIVVVNGHIDPNSKG--DLALRSTLYGY 512
                        570       580       590
                 ....*....|....*....|....*....|
gi 613254167 569 DKNNYYNYAGYSNFIVTSTDTGGGDGTVKP 598
Cdd:NF033609 513 NSNIIWRSMSWDNEVAFNNGSGSGDGIDKP 542
SdrD_B pfam17210
SdrD B-like domain; This family corresponds to the B-like domain from the SdrD protein. This ...
603-710 1.34e-23

SdrD B-like domain; This family corresponds to the B-like domain from the SdrD protein. This domain has three calcium binding sites within a greek key beta sandwich fold.


Pssm-ID: 435789 [Multi-domain]  Cd Length: 112  Bit Score: 96.52  E-value: 1.34e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  603 YKIGDYVWEDVDKDGVQgtDSKEKPMANVLVTLTYPDGTT-KSVRTDANGHYEFGGLKDGeTYTVKFETPAGYLPTKENG 681
Cdd:pfam17210   1 ASIGDFVWEDANKNGIQ--DAGEPGISGVTVTLYDANGTVvGTTTTDANGKYLFTNLAPG-TYYVEFTAPAGYTFTPQNQ 77
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 613254167  682 TTDGEKDSNGSSVTVKIN------GKDDMSLDTGF 710
Cdd:pfam17210  78 GSDDALDSDADPATGLTAtvtlasGESDLTIDAGL 112
Big_8 pfam17961
Bacterial Ig domain; This entry represents a bacterial Ig-fold domain that is found in a wide ...
299-397 1.19e-22

Bacterial Ig domain; This entry represents a bacterial Ig-fold domain that is found in a wide range of bacterial cell surface adherence proteins.


Pssm-ID: 465589 [Multi-domain]  Cd Length: 102  Bit Score: 93.44  E-value: 1.19e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  299 HDGEEIEYDSEFTIDNKVKAGDTMTINYDKNVIPSDLTDKNDPIDITDPSGEVIAKGTFDKATKQITYTFTDYVDKYEDI 378
Cdd:pfam17961   3 DQGESLKLKADFSLGDSVKEGDYFTIKLPDNLKFYGINTSDKSFDIKDDNGEVIAKGTYDPGTGTITYTFTDYVENKSNI 82
                          90
                  ....*....|....*....
gi 613254167  379 KSRLTLYSYIDKKTVPNET 397
Cdd:pfam17961  83 KGSLYLPAYIDKKKVKENG 101
SdrG_C_C pfam10425
C-terminus of bacterial fibrinogen-binding adhesin; This is the C-terminal half of a bacterial ...
428-581 6.99e-21

C-terminus of bacterial fibrinogen-binding adhesin; This is the C-terminal half of a bacterial fibrinogen-binding adhesin SdrG. SdrG is a Gram-positive cell-wall-anchored adhesin that allows attachment of the bacterium to host tissues via specific binding to the beta-chain of human fibrinogen (Fg). SdrG binds to its ligand with a dynamic "dock, lock, and latch" mechanism which represents a general mode of ligand-binding for structurally related cell wall-anchored proteins in most Gram-positive bacteria. The C-terminal part of SdrG(276-596) is integral to the folding of the immunoglobulin-like whole to create the docking grooves necessary for Fg binding. The domain is associated with families of Cna_B, pfam05738.


Pssm-ID: 431277 [Multi-domain]  Cd Length: 156  Bit Score: 90.17  E-value: 6.99e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  428 NIQSIFTKLDEDKQNIEQQIYVNPLKKTATNTKVDiagsqvddyGNIKLGN--GSTIIDQNTEIKVYKVNSNQQLPQSNR 505
Cdd:pfam10425   7 NISSRIMHFDKENGTFEQTIYVNPNKKSLTSATVT---------GNLSGYIdsGSKVNPNNTNVKIYKVNDGQDLPDSYY 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  506 I-YDFSQYEDVTSQFDNKKSF-SNNVATLDFGNI--DSAYIIKVVSKYTPTSDDELDIAQGASMRTTDkNNYYNYAGYSN 581
Cdd:pfam10425  78 VnEDTSELEDVTNQFDGYISLgNNNSASINFGNLqsDKSYIVKVVGKYDNNNDDSVDLRTTLYGYNTQ-YVTSYSYGWTN 156
CarboxypepD_reg pfam13620
Carboxypeptidase regulatory-like domain;
843-919 1.08e-08

Carboxypeptidase regulatory-like domain;


Pssm-ID: 433354 [Multi-domain]  Cd Length: 81  Bit Score: 53.05  E-value: 1.08e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 613254167  843 DSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKPAGLTQTGTNTTEDdkdaDGGEVDVTIT 919
Cdd:pfam13620   8 DPSGAPVPGATVTVTNTDTGTVRTTTTDADGRYRFPGLPPGTYTVTVSAPGFKTATRTGVTVT----AGQTTTLDVT 80
YSIRK_signal pfam04650
YSIRK type signal peptide; Many surface proteins found in Streptococcus, Staphylococcus, and ...
18-43 1.11e-08

YSIRK type signal peptide; Many surface proteins found in Streptococcus, Staphylococcus, and related lineages share apparently homologous signal sequences. A motif resembling [YF]SIRKxxxGxxS[VIA] appears at the start of the transmembrane domain. The GxxS motif appears perfectly conserved, suggesting a specific function and not just homology. There is a strong correlation between proteins carrying this region at the N-terminus and those carrying the Gram-positive anchor domain with the LPXTG sortase processing site at the C-terminus.


Pssm-ID: 428049 [Multi-domain]  Cd Length: 26  Bit Score: 51.23  E-value: 1.11e-08
                          10        20
                  ....*....|....*....|....*.
gi 613254167   18 NRLNKFSIRKYTVGTASILVGTTLIF 43
Cdd:pfam04650   1 EKKQRYSIRKLSVGVASVLIGTLLFL 26
YSIRK_signal TIGR01168
Gram-positive signal peptide, YSIRK family; Many surface proteins found in Streptococcus, ...
18-55 1.98e-07

Gram-positive signal peptide, YSIRK family; Many surface proteins found in Streptococcus, Staphylococcus, and related lineages share apparently homologous signal sequences. A motif resembling [YF]SIRKxxxGxxS[VIA] appears at the start of the transmembrane domain. The GxxS motif appears perfectly conserved, suggesting a specific function and not just homology. There is a strong correlation between proteins carrying this region at the N-terminus and those carrying the Gram-positive anchor domain with the LPXTG sortase processing site at the C-terminus.


Pssm-ID: 273479 [Multi-domain]  Cd Length: 39  Bit Score: 47.86  E-value: 1.98e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 613254167   18 NRLNKFSIRKYTVGTASILVGtTLIFGLGnqeAKAAEN 55
Cdd:TIGR01168   6 EKQQKYSIRKLSVGVASVLVA-SLFFGGG---VAAAES 39
ClfA COG4932
Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing ...
626-935 2.18e-06

Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing adhesive matrix molecules) family, DEv-IgG fold [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443959 [Multi-domain]  Cd Length: 689  Bit Score: 51.51  E-value: 2.18e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 626 KPMANVLVTLTYPDG----TTKSVRTDANGHYEFGGLKDGETYTVKFETPAGYLPtkengttdgekDSNGSSVTVKINGK 701
Cdd:COG4932  275 EPLAGATFTLTDADGntvvTTTVTVTDADGSYTFTDLPPGTYTVTETKAPAGYDL-----------DGEAVKVTITAGQT 343
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 702 DDMSLDTGFYKEPKYNLgdyVWEDTNKDGiqdaNEPGIKDVKVTLKDSTGKIIGTTTTDASGKYKFTDLDNGNYTVEF-E 780
Cdd:COG4932  344 TTVTVTNGNNEVKTGSV---TLTKVDADD----GEAPLAGAEFTLTDADGTVVATITTDADGTASFKGLAPGTYTLTEtK 416
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 781 TPAGYTPTLKNTTAEDKDsNGLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEK 860
Cdd:COG4932  417 APEGYTLDSTPITVTVTD-GGTGAIDTITNERKKGSVQVTKVDAPLAGATFTLTDADGTVVTLTTDADLAGATFEADGKV 495
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 613254167 861 gevigTTKTDENGKYRFDNLDSGKYKVIFEKPAGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTIDNGYFEEDT 935
Cdd:COG4932  496 -----VTTTDASGKYTFKNLPPGTYTDAGGSATVITDDTDGTVGDEATGTDPEVTVTGKSTTTTPDVALLTNLGT 565
CarboxypepD_reg pfam13620
Carboxypeptidase regulatory-like domain;
617-683 3.46e-06

Carboxypeptidase regulatory-like domain;


Pssm-ID: 433354 [Multi-domain]  Cd Length: 81  Bit Score: 45.73  E-value: 3.46e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  617 GVQGT--DSKEKPMANVLVTLTYPD-GTTKSVRTDANGHYEFGGLKDGeTYTVKFETPaGYLPTKENGTT 683
Cdd:pfam13620   1 TISGTvtDPSGAPVPGATVTVTNTDtGTVRTTTTDADGRYRFPGLPPG-TYTVTVSAP-GFKTATRTGVT 68
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
117-243 1.34e-04

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 45.73  E-value: 1.34e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 117 QNNVTATTETKPQNIEKENVKPSTDKTATEDTSVILEEKKAPNNTNNDVTTKPSTSEIQTKPTTPQESTNIENSQPQPTP 196
Cdd:PTZ00441 327 QDPVPPPNEGKDGNPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPEDSNKEVP 406
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 613254167 197 SKV--------DNQVTDATNPKE-----PVNVSKEELKN-----NPEKLKELVRNDSNTDRSTKP 243
Cdd:PTZ00441 407 EDVpmepeddrDNNFNEPKKPENkgdgqNEPVIPKPLDNerdqsNKNKQVNPGNRHNSEDRYTRP 471
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
17-228 2.88e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.76  E-value: 2.88e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  17 SNRLNKFSIRKYTVGTASILVGtTLIFGlgnqeakaAENTSTENAKQDEASASDNKEVVSETENNSTTeNTSTNPIKKET 96
Cdd:NF033839   6 HERKMRYSIRKFSIGVASVAVA-SLFMG--------SVVHATEKEGSTQAATSSNRGNESQAEQRKEL-DLERDKAKKAV 75
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  97 nTDSQPETKEESTTSSTQKQQNNvTATTETKPQNIEKENVKPSTDKTATEDTSVILEEK-----KAPNNTNNDVTTKpST 171
Cdd:NF033839  76 -SEYKEKKVKEIYKKSTKERHKN-TVDLVNKLQNIKNEYLNKIVESTSKSQLQKLMMESqskvdEAVSKFEKDSSSS-SS 152
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 613254167 172 SEIQTKPTTPQEstniENSQPQ-PTPSKVDNQVTDATNPKEPvnvSKEELKNNPEKLK 228
Cdd:NF033839 153 SGSSTKPETPQP----ENPEHQkPTTPAPDTKPSPQPEGKKP---SVPDINQEKEKAK 203
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
71-225 4.03e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 43.88  E-value: 4.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167   71 NKEVVSETENNSTTENTSTNPIKKETNTDSQPETKEESTTSSTQKQQNNVTATTE--TKPQNIEKENVKPSTDKTATEDT 148
Cdd:pfam05539 168 PKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQgtTTSSNPEPQTEPPPSQRGPSGSP 247
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  149 SVILEEKKAPNNTNND-------VTTKPSTSEIQT--KPTTPQESTNIENSQpQPTPSkvdNQVTDATNPKEPvNVSKEE 219
Cdd:pfam05539 248 QHPPSTTSQDQSTTGDgqehtqrRKTPPATSNRRSphSTATPPPTTKRQETG-RPTPR---PTATTQSGSSPP-HSSPPG 322

                  ....*.
gi 613254167  220 LKNNPE 225
Cdd:pfam05539 323 VQANPT 328
Caldesmon pfam02029
Caldesmon;
44-229 4.53e-04

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 44.09  E-value: 4.53e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167   44 GLGNQEAKAAENT------STENAKQDEASASDNKEVVSETENNSTTEnTSTNPIKKETNTDSQPETKEESTTSSTQKQQ 117
Cdd:pfam02029  63 AFLDRTAKREERRqkrlqeALERQKEFDPTIADEKESVAERKENNEEE-ENSSWEKEEKRDSRLGRYKEEETEIREKEYQ 141
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  118 NN-------------------VTATTETKPQNIEKENVK---PSTDKTATEDTSVILEEKKAPNNtnndvtTKPSTSEIQ 175
Cdd:pfam02029 142 ENkwstevrqaeeegeeeedkSEEAEEVPTENFAKEEVKdekIKKEKKVKYESKVFLDQKRGHPE------VKSQNGEEE 215
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 613254167  176 TKPTTPQESTNIENSQPQPTPSKVDNQVTDATNPKEPVNVSKEELKNNP-EKLKE 229
Cdd:pfam02029 216 VTKLKVTTKRRQGGLSQSQEREEEAEVFLEAEQKLEELRRRRQEKESEEfEKLRQ 270
ClfA COG4932
Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing ...
584-925 7.09e-04

Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing adhesive matrix molecules) family, DEv-IgG fold [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443959 [Multi-domain]  Cd Length: 689  Bit Score: 43.42  E-value: 7.09e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 584 VTSTDTGGGDGTVKPEEKLYKIGDYVWEDVDKDgvqgtdSKEKPMANVLVTLTYPDGTT-KSVRTDANGHYEFGGLKDGe 662
Cdd:COG4932  336 VTITAGQTTTVTVTNGNNEVKTGSVTLTKVDAD------DGEAPLAGAEFTLTDADGTVvATITTDADGTASFKGLAPG- 408
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 663 TYTVKF-ETPAGYLPTKENGTTDGEKDSNGSSVTVKINGKDDMslDTGFYKEPKYNLGDYVWEDTNKDGIQDANEPGIKD 741
Cdd:COG4932  409 TYTLTEtKAPEGYTLDSTPITVTVTDGGTGAIDTITNERKKGS--VQVTKVDAPLAGATFTLTDADGTVVTLTTDADLAG 486
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 742 VKVTLKDSTGKIIgttttDASGKYKFTDLDNGNYTVEFETPAGYTPTLKNTTAEDKDSNGLTTTGVIKD---ADNMTLDS 818
Cdd:COG4932  487 ATFEADGKVVTTT-----DASGKYTFKNLPPGTYTDAGGSATVITDDTDGTVGDEATGTDPEVTVTGKStttTPDVALLT 561
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 819 GFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKPAGLTQT 898
Cdd:COG4932  562 NLGTTEDALTSLAKTGDEVGKGLTLTTTTTVDTLDTNATEKTETVTVTAQLIGVKTTKLTDTTDPKGGTVEEATTTGGTA 641
                        330       340
                 ....*....|....*....|....*..
gi 613254167 899 GTNTTEDDKDADGGEVDVTITDHDDFT 925
Cdd:COG4932  642 NTGKTGTDLTDDTTVTSTTNTATSVED 668
Herpes_U47 pfam05467
Herpesvirus glycoprotein U47;
78-284 1.48e-03

Herpesvirus glycoprotein U47;


Pssm-ID: 283192 [Multi-domain]  Cd Length: 677  Bit Score: 42.57  E-value: 1.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167   78 TENNSTTENTSTNPIKKETNTDSQPETKEEST--TSSTQKQQNNVTATT----------ETKPQN---IEKENVKPSTDK 142
Cdd:pfam05467 335 TENPTENPKSPPKPTNFENTTIRIPETFESTTvaTNTTQKLESTTFATTigieeisdniYSSPKNsiyLKSKSQQSTTKF 414
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  143 TATEDTSVILE----EKKAPNNTNNDVTTKPSTSEI------QTKPTTPQESTNiensqpqPTpskvdnQVTDATNPKEP 212
Cdd:pfam05467 415 TDTEHTTPILKfttwQDAARTYMSHNTEVQNMTENFikislgETMGITPKEPTN-------PT------QLLNVKNQTEY 481
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 613254167  213 VNVSKEELKNNPEKLKELVRNDSNTDRSTKPVATAPTSVAPKR-----LNAKMRFAVAQPAAVASNNVNDLIKVTKQ 284
Cdd:pfam05467 482 ANETHSTEVQTVKTFKEDRFQRTTLKSSSEPPTVQTLSVTPKKklpsnVTAKTEVQVTNNALPSSNSSHSITKVTEE 558
CarboxypepD_reg pfam13620
Carboxypeptidase regulatory-like domain;
731-788 2.52e-03

Carboxypeptidase regulatory-like domain;


Pssm-ID: 433354 [Multi-domain]  Cd Length: 81  Bit Score: 37.64  E-value: 2.52e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 613254167  731 IQDANEPGIKDVKVTLKDSTGKIIGTTTTDASGKYKFTDLDNGNYTVEFETPaGYTPT 788
Cdd:pfam13620   6 VTDPSGAPVPGATVTVTNTDTGTVRTTTTDADGRYRFPGLPPGTYTVTVSAP-GFKTA 62
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
849-884 2.57e-03

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 41.99  E-value: 2.57e-03
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 613254167  849 IKDVTVTLQNEKGEVIGTTKTDENGKYRFDNLDSGK 884
Cdd:COG2373   291 VAGAEVELYDRNGQVLATATTDADGLARFPAGDRGE 326
PRK08581 PRK08581
amidase domain-containing protein;
22-247 2.60e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 41.70  E-value: 2.60e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  22 KFSIRKYTVGTASILvgTTLIFGLGNQEAKAAENTSTENAKQDEASASDNKEVVSETENNSTTENTSTNpiKKETNTDSQ 101
Cdd:PRK08581   3 KNKILIYLLSTTLVL--PTLTSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTS--NQDNNDKKF 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 102 PETKEESTTSSTQKQQ--NNVTATTETKPQNIEKENVKPSTDKTATE--DTSVILEEKKAPNNTNNDVTTKPSTSEIQTK 177
Cdd:PRK08581  79 STIDSSTSDSNNIIDFiyKNLPQTNINQLLTKNKYDDNYSLTTLIQNlfNLNSDISDYEQPRNSEKSTNDSNKNSDSSIK 158
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 613254167 178 PTTPQESTN---IENSQPQPTPSKVDNQVTDATNPKEPVNVSKEELKNNPEKLKELVRNDSNTDRSTKPVATA 247
Cdd:PRK08581 159 NDTDTQSSKqdkADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDS 231
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
17-243 2.75e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.54  E-value: 2.75e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  17 SNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTENAKQDEASASDNKEV-------VSETENNST----TE 85
Cdd:NF033838   6 SERKVHYSIRKFSIGVASVVVASLFLGGVVHAEEVRGGNNPTVTSSGNESQKEHAKEVeshlekiLSEIQKSLDkrkhTQ 85
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  86 NTSTNPIKKETNTD---SQPETKEESTTSSTQKQQNNVTATTEtkpqNIEKENVKPsTDKTATEDTSVILEEKKA----- 157
Cdd:NF033838  86 NVALNKKLSDIKTEylyELNVLKEKSEAELTSKTKKELDAAFE----QFKKDTLEP-GKKVAEATKKVEEAEKKAkdqke 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 158 ------PNNT---------NNDVTTKPSTSE-IQTKPTTPQESTNIENSQpqptpSKVDNQVTDAT---NPKEPVNVSKE 218
Cdd:NF033838 161 edrrnyPTNTyktleleiaESDVEVKKAELElVKEEAKEPRDEEKIKQAK-----AKVESKKAEATrleKIKTDREKAEE 235
                        250       260
                 ....*....|....*....|....*.
gi 613254167 219 ELKNNPE-KLKELVRNDSNTDRSTKP 243
Cdd:NF033838 236 EAKRRADaKLKEAVEKNVATSEQDKP 261
bMG3 pfam11974
Bacterial alpha-2-macroglobulin MG3 domain; This is the MG3 domain from bacterial ...
847-878 2.79e-03

Bacterial alpha-2-macroglobulin MG3 domain; This is the MG3 domain from bacterial alpha2-macroglobulins.


Pssm-ID: 432232 [Multi-domain]  Cd Length: 102  Bit Score: 38.35  E-value: 2.79e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 613254167  847 KGIKDVTVTLQNEKGEVIGTTKTDENGKYRFD 878
Cdd:pfam11974  26 KPVAGVEVRLLDCNGQVLATGTTDAQGHARFE 57
PRK06347 PRK06347
1,4-beta-N-acetylmuramoylhydrolase;
59-131 3.29e-03

1,4-beta-N-acetylmuramoylhydrolase;


Pssm-ID: 180536 [Multi-domain]  Cd Length: 592  Bit Score: 41.22  E-value: 3.29e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 613254167  59 ENAKQDEASASDNKEVVSETENNSTTENTSTNPI-KKETNTDSQPETKEESTTSSTQKQQNNVTATTETKPQNI 131
Cdd:PRK06347  54 ETAPADEASKSAEANTTKEAPATATPENTTEPTVePKQTETKEQTKTPEEKQPAAKQVEKAPAEPATVSNPDNA 127
MetallophosN pfam16371
N terminal of Calcineurin-like phosphoesterase; This is the N-terminal of Calcineurin-like ...
624-685 3.73e-03

N terminal of Calcineurin-like phosphoesterase; This is the N-terminal of Calcineurin-like phosphoesterases. It is around 150 residues in length from various Bacteroides species. The function of this family is unknown.


Pssm-ID: 435307  Cd Length: 73  Bit Score: 37.15  E-value: 3.73e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 613254167  624 KEKPMANVLVTltypDGTTkSVRTDANGHYEFggLKDGETYTVKFETPAGYLPTKENGTTDG 685
Cdd:pfam16371   1 NGKGLAGVVVS----DGYN-FTKTDANGRYTL--PDDKKAKFVYISTPAGYEVPTDDGITPR 55
PHA03255 PHA03255
BDLF3; Provisional
46-212 6.30e-03

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 39.50  E-value: 6.30e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  46 GNQEAKAAENTSTENAKQDEASASdnkevvSETENNSTTENTSTNPIKKE----TNTDSQPETKEESTTSSTQKQQNNVT 121
Cdd:PHA03255  28 GSSTASAGNVTGTTAVTTPSPSAS------GPSTNQSTTLTTTSAPITTTailsTNTTTVTSTGTTVTPVPTTSNASTIN 101
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 122 ATTETKPQNIEKENVKPSTdktatedtsvileekkAPNNTNNDVTTKPSTSEIQTKPTTPQESTniensqpqPTPSKVDN 201
Cdd:PHA03255 102 VTTKVTAQNITATEAGTGT----------------STGVTSNVTTRSSSTTSATTRITNATTLA--------PTLSSKGT 157
                        170
                 ....*....|.
gi 613254167 202 QVTDATNPKEP 212
Cdd:PHA03255 158 SNATKTTAELP 168
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
55-218 6.57e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 40.03  E-value: 6.57e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167   55 NTSTENAKQDEASASDNKEVVSETENNSTTENTSTNPikkETNTDSQPETKEESTT----SSTQKQQNNVTATTETKPQN 130
Cdd:pfam05539 194 TPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNP---EPQTEPPPSQRGPSGSpqhpPSTTSQDQSTTGDGQEHTQR 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  131 IE----KENVKPSTDKTATEDTSVILEEKKA---PNNTNNDVTTKPSTSE--IQTKPTTpQESTNIENSQPqPTPSKVDN 201
Cdd:pfam05539 271 RKtppaTSNRRSPHSTATPPPTTKRQETGRPtprPTATTQSGSSPPHSSPpgVQANPTT-QNLVDCKELDP-PKPNSICY 348
                         170
                  ....*....|....*..
gi 613254167  202 QVtDATNPKEPVNVSKE 218
Cdd:pfam05539 349 GV-GIYNEALPRGCDIV 364
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
29-187 7.81e-03

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 39.72  E-value: 7.81e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  29 TVGTASILVG--TTLIFGLGNQEAKAAENTSTENAKQDEASASDNKEVVSETENNSTTENTSTNPIKKETNTDSQPETKE 106
Cdd:PRK13335   5 TIAKTSLALGllTTGAITVTTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSATTQAANTRQERTPKLEKAPNTNE 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 107 ESTTSSTqkqqnnVTATTETKPQNIEKENVkPSTDKTATEDTSVILEEKK------APNNTNNDVTTKPSTSEIQTKPTT 180
Cdd:PRK13335  85 EKTSASK------IEKISQPKQEEQKSLNI-SATPAPKQEQSQTTTESTTpktkvtTPPSTNTPQPMQSTKSDTPQSPTI 157

                 ....*..
gi 613254167 181 PQESTNI 187
Cdd:PRK13335 158 KQAQTDM 164
PRK08581 PRK08581
amidase domain-containing protein;
48-223 8.60e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 39.77  E-value: 8.60e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167  48 QEAKAAENTSTENAKQDEASASDNKEVVSETENNSTTENTSTNPIKK---ETNTDSQPETKEESTTSSTQKQQNNVTATT 124
Cdd:PRK08581  82 DSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNlnsDISDYEQPRNSEKSTNDSNKNSDSSIKNDT 161
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 613254167 125 ETKPQNIEKENVKPSTDKTATEDTSVILEEKKAPNNTNNDVTtkpSTSEIQTKPTTPQESTNIENSQPQPTPSKVDNQVT 204
Cdd:PRK08581 162 DTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSN---SQPASDDTANQKSSSKDNQSMSDSALDSILDQYSE 238
                        170
                 ....*....|....*....
gi 613254167 205 DATNPKEPVNVSKEELKNN 223
Cdd:PRK08581 239 DAKKTQKDYASQSKKDKTE 257
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH