NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|161077523|ref|NP_001096864|]
View 

futsch, isoform C [Drosophila melanogaster]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PTZ00121 super family cl31754
MAEBL; Provisional
3352-4195 1.42e-35

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 151.06  E-value: 1.42e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3352 KDSADESKEQRPESLPQSKAGSIKDEKSPLASKDEAEKSKEESRRESVAEQfplvSKEVSRPASVAESVKDEAEKSKEES 3431
Cdd:PTZ00121 1045 KDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEE----AFGKAEEAKKTETGKAEEARKAEEA 1120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3432 PLMSKEASRpasvAGSVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKe 3511
Cdd:PTZ00121 1121 KKKAEDARK----AEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRK- 1195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3512 ASRPASVAESIKDEAEKSKEESRRESVAEKSplpskEASRptSVAESVKDEAEKSKEESRRDSVAEKSPLASKEASRPAS 3591
Cdd:PTZ00121 1196 AEDARKAEAARKAEEERKAEEARKAEDAKKA-----EAVK--KAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3592 VAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRR-ESVAEKSPLASKEASRPTSVAESV 3670
Cdd:PTZ00121 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKaEEAKKKADAAKKKAEEAKKAAEAA 1348
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3671 KDEAEKSKEEssrdsvAEKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVK---DD 3747
Cdd:PTZ00121 1349 KAEAEAAADE------AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKkkaDE 1422
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3748 AEKSKEESRRESVAEKSPLASKEASRPASVAESVK--DEAEKSKEESRRESVAEKSPLPSKEASRPTSVAEsvkdEAEKS 3825
Cdd:PTZ00121 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKkaEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE----EAKKK 1498
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3826 KEESRRESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRR-ESVAEKSPLASKEASRPasvAESVKDEAEKSKEESR 3904
Cdd:PTZ00121 1499 ADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKaEEKKKADELKKAEELKK---AEEKKKAEEAKKAEED 1575
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3905 RESVAEKSPLPSK-EASRPTSVAESVKDEADKSKEESRRESGA--EKSPLASMEASRPTSVAESVKDETEKSKEESRR-- 3979
Cdd:PTZ00121 1576 KNMALRKAEEAKKaEEARIEEVMKLYEEEKKMKAEEAKKAEEAkiKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKka 1655
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3980 --ESVTEKSPLPSKEASRPTSVAESVKDEAEKSKEE---SRRESVAEKSPLASKESSRPASVAESIKDEAEGTK---QES 4051
Cdd:PTZ00121 1656 eeENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAealKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKikaEEA 1735
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4052 RRESMPESGKAESIKGDQSSlaSKETSRPDSVVESVKDETEKPEGSAID---KSQVASRPESVAVSAKDEKSPLHSRPES 4128
Cdd:PTZ00121 1736 KKEAEEDKKKAEEAKKDEEE--KKKIAHLKKEEEKKAEEIRKEKEAVIEeelDEEDEKRRMEVDKKIKDIFDNFANIIEG 1813
                         810       820       830       840       850       860       870
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 161077523 4129 VADKSP--DASKEA--SRSLSVAETASSPIEEGPRSIADLSLPLNLTGEAKGKLPTLSSPIDVAEGDFLEV 4195
Cdd:PTZ00121 1814 GKEGNLviNDSKEMedSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEI 1884
PTZ00121 super family cl31754
MAEBL; Provisional
1890-2644 2.64e-24

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 114.08  E-value: 2.64e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1890 ESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLKGSTRREsvAESDKSSQPFKETSRPESAVGSMKDESMSK-E 1968
Cdd:PTZ00121 1102 EAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARK--AEDAKRVEIARKAEDARKAEEARKAEDAKKaE 1179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1969 PSRR--------------ESVKDGAAQSRETSRPASVAESAKDgADDLKELSRPESTTQSKEAGSIKDEKSPLASEEASR 2034
Cdd:PTZ00121 1180 AARKaeevrkaeelrkaeDARKAEAARKAEEERKAEEARKAED-AKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFE 1258
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2035 PASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRR-ESVAEKSPLPSKEASRPASVA 2113
Cdd:PTZ00121 1259 EARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKaDEAKKKAEEAKKKADAAKKKA 1338
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2114 ESIKDEAE-KSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIK- 2191
Cdd:PTZ00121 1339 EEAKKAAEaAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKk 1418
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2192 --DEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIK--DEAEKSKEESRRESVAEKsplpSKEASRPASVAESIKDE 2267
Cdd:PTZ00121 1419 kaDEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKkaEEAKKKAEEAKKADEAKK----KAEEAKKADEAKKKAEE 1494
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2268 AEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEETRR-ESVAEKSPLPSKEASRPasvAESIKDEAEKSK 2346
Cdd:PTZ00121 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKaEEKKKADELKKAEELKK---AEEKKKAEEAKK 1571
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2347 EESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQSIKGDQSPLK-----------EVSRPES 2415
Cdd:PTZ00121 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKkveqlkkkeaeEKKKAEE 1651
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2416 VAESVKDDPVKSKEPSRR--------ESVAGSVTADSARDDQSPLESKGASRPESVVDSVKDEAEKQESRRESKTESvip 2487
Cdd:PTZ00121 1652 LKKAEEENKIKAAEEAKKaeedkkkaEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN--- 1728
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2488 pKAKDDKSPKEVLQPVSMTETIREDADQPMKPSQAESRRESIAESIKASSPRDEKSPLASKEASRPGSVAESIKYDLDKP 2567
Cdd:PTZ00121 1729 -KIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2568 QII-----KDDKSTEHSRRESLEDKSAVTSEKSVSRPLSVASDHEAAVAIEDDAKSSISPKDKSRPGFVAETVSSPIEEA 2642
Cdd:PTZ00121 1808 ANIieggkEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEA 1887

                  ..
gi 161077523 2643 TM 2644
Cdd:PTZ00121 1888 DE 1889
PTZ00121 super family cl31754
MAEBL; Provisional
2878-3754 1.31e-15

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 85.19  E-value: 1.31e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2878 DEAPKSLIGCPAEERPESPAESAKDAAESVEKSKDASRPPSVVESTKADSTKgdispspeSVLEGPKDDVEKSKESSRPP 2957
Cdd:PTZ00121 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAK--------RVEIARKAEDARKAEEARKA 1172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2958 SVSASITGDSTKDVSRPASVVESVKDEHDKAESRResiAKVESVIDEAGKSDSKSSSQDSQKDEKSTLASKEASRRESvV 3037
Cdd:PTZ00121 1173 EDAKKAEAARKAEEVRKAEELRKAEDARKAEAARK---AEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE-E 1248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3038 ESSKDDAEKSESRPESVIASGEPVPRESKSPLDSKDTSRPGSMVESVTAEDEKSEQQSRRESVAESVKADTKKDgKSQEA 3117
Cdd:PTZ00121 1249 RNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK-KAEEA 1327
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3118 SRPSsvDELLKDDDEKQESRRQSITGSHKAmstmgdespMDKADKSKEPSRPESVAESikHENTKDEESPLGSRRDSVAE 3197
Cdd:PTZ00121 1328 KKKA--DAAKKKAEEAKKAAEAAKAEAEAA---------ADEAEAAEEKAEAAEKKKE--EAKKKADAAKKKAEEKKKAD 1394
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3198 SIKSditKGEKSPLPSKEVSRPESVVGSIKD--EKAESRRESVAESVKPESSKDATSAPPSKEHSRPESVLGSLKDEGDK 3275
Cdd:PTZ00121 1395 EAKK---KAEEDKKKADELKKAAAAKKKADEakKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3276 TTSRRVSVADSIKDEKSLLVSQEASRPESEAESLKDAAAPSQETSRPEsvtesvkdgkspvaSKEASRPASVAENAKdSA 3355
Cdd:PTZ00121 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE--------------EAKKADEAKKAEEAK-KA 1536
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3356 DESKEQRpeslPQSKAGSIKdeKSPLASKDEAEKSKEESRRESVAEQFPLVSKEVSRPASvaESVKDEAEKSKEESPLMS 3435
Cdd:PTZ00121 1537 DEAKKAE----EKKKADELK--KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE--EARIEEVMKLYEEEKKMK 1608
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3436 KEASRPAsvagsvKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRP 3515
Cdd:PTZ00121 1609 AEEAKKA------EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3516 ASVAESIKDEAEKSKEESRR--ESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSvaEKSPLASKEASRPASVA 3593
Cdd:PTZ00121 1683 AEEDEKKAAEALKKEAEEAKkaEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK--KKAEEAKKDEEEKKKIA 1760
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3594 ESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDeaEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDE 3673
Cdd:PTZ00121 1761 HLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKD--IFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADS 1838
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3674 AEKSKEES--------SRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEE--------SRRESVAEKSPLASKEASRP 3737
Cdd:PTZ00121 1839 KNMQLEEAdafekhkfNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIekidkddiEREIPNNNMAGKNNDIIDDK 1918
                         890
                  ....*....|....*..
gi 161077523 3738 ASVAESVKDDAEKSKEE 3754
Cdd:PTZ00121 1919 LDKDEYIKRDAEETREE 1935
PTZ00121 super family cl31754
MAEBL; Provisional
1464-2163 1.55e-09

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.16  E-value: 1.55e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1464 KSAKDREDTGSIESPPTIEEAIEVEVQAKQEAQKPVPA--PEEAIKTEKSPLA-----SKETSRPESATGSVKEDTEQTK 1536
Cdd:PTZ00121 1240 EEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEArkADELKKAEEKKKAdeakkAEEKKKADEAKKKAEEAKKADE 1319
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1537 SKKSPVPSRPESEAKDKKSPFASGEASRPESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIKPE 1616
Cdd:PTZ00121 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1617 SGIDEKSALASKEASRPESVTDKSKEPSRRESIAESLKAESTKDEKSAPPSKEASRPGSVVESVKDETEKSKepsrresi 1696
Cdd:PTZ00121 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK-------- 1471
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1697 AESAKPPIEfrEVSRPESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESvaESFKADSTKD 1776
Cdd:PTZ00121 1472 ADEAKKKAE--EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKAD--EAKKAEEKKK 1547
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1777 EKSPLTSKDISRPESavenvmdaVGSAERSQPESVTASRDVSRPESVAESEKddtDKPESVVESVIPASDVVEIEKGAAD 1856
Cdd:PTZ00121 1548 ADELKKAEELKKAEE--------KKKAEEAKKAEEDKNMALRKAEEAKKAEE---ARIEEVMKLYEEEKKMKAEEAKKAE 1616
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1857 KEKGVFVSLEIGKPDSPSEVISRPGPVVESVKPESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLKGSTRRES 1936
Cdd:PTZ00121 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1937 VAESDKSSQPFKETSRPESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRPASVAESAKDGADDLKELSRPESTTQSKE 2016
Cdd:PTZ00121 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2017 AGSIKDEKspLASEEASRPASVAESVKDEAEKSK--EESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKE----ES 2090
Cdd:PTZ00121 1777 KEAVIEEE--LDEEDEKRRMEVDKKIKDIFDNFAniIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAfekhKF 1854
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2091 RRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEAS--------RPASVAESIKDEAEKSKE 2162
Cdd:PTZ00121 1855 NKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAgknndiidDKLDKDEYIKRDAEETRE 1934

                  .
gi 161077523 2163 E 2163
Cdd:PTZ00121 1935 E 1935
PTZ00121 super family cl31754
MAEBL; Provisional
580-1228 2.04e-09

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 64.78  E-value: 2.04e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  580 APAIQTVTSTRKSLKSAIEATPAPPSASYKTTKFSPVASAALAVQHPQQQDNKAKEAAAAAAAAAAAAASAATIARAKAD 659
Cdd:PTZ00121 1226 AEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  660 SMDTDAEPEHEAD-----PEPADTGDEAAPTEQEPEAETEPEPEHEPEAEQDKDVGEEKKVEVLIMKPQQAtpaviaasg 734
Cdd:PTZ00121 1306 EAKKKAEEAKKADeakkkAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA--------- 1376
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  735 KDGVDAASADATPTGKLSKASAKGKADKPRA-EVKPVVRSRIDTKPPKSMDRKLAKRDEKKSSptttpAARAPVAQNAKP 813
Cdd:PTZ00121 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKAdELKKAAAAKKKADEAKKKAEEKKKADEAKKK-----AEEAKKADEAKK 1451
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  814 KV-LSRPATKSSPSSTPAKSAKEANNRKVLESKQQAARVQATSTVSRRVTSTASERRVQQQAEAKTAATGATQATQRKPI 892
Cdd:PTZ00121 1452 KAeEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  893 SRRPRGVSPSKRAPAPGSPVKQAKP-KAADLKKTRLDKGGTTDSSLVSTPSADEATAAKKLQDLTASQELDAEKQRELDD 971
Cdd:PTZ00121 1532 EAKKADEAKKAEEKKKADELKKAEElKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEE 1611
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  972 LKEEQEVVREIEAVFSRDEMKRQQHQQIKAELREM-PAEGTGDGENEPDEEEEYLIIEKEEVEQYTEDSIVEQESSMTKE 1050
Cdd:PTZ00121 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKkKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAA 1691
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1051 EEIQKHQRDSQESEKKRKKSAEEEIEAAIAKVEAAERKARLEGASARQDESELDVEpEQSKIKAEVQDIIATAKDIAKSR 1130
Cdd:PTZ00121 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE-EAKKDEEEKKKIAHLKKEEEKKA 1770
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1131 TEEQLAKPA--EEELSSPTPEEKLS--KKTSDTKDDqigapvdvlpvnlqeslpeekFSATIESGATTAPTLPEDERIPL 1206
Cdd:PTZ00121 1771 EEIRKEKEAviEEELDEEDEKRRMEvdKKIKDIFDN---------------------FANIIEGGKEGNLVINDSKEMED 1829
                         650       660
                  ....*....|....*....|..
gi 161077523 1207 DQIKEdLVIEEKYVKEETKEAE 1228
Cdd:PTZ00121 1830 SAIKE-VADSKNMQLEEADAFE 1850
LGT super family cl00478
Prolipoprotein diacylglyceryl transferase;
2780-2945 3.41e-03

Prolipoprotein diacylglyceryl transferase;


The actual alignment was detected with superfamily member PRK13108:

Pssm-ID: 469786 [Multi-domain]  Cd Length: 460  Bit Score: 43.81  E-value: 3.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2780 RSPVASTEISRPASAGETASSPIEEAPKDFAEFEQAEKAVLPLTIELKGNLptlSSPVDVAHGDFPQTSTPTSSPTVASV 2859
Cdd:PRK13108  298 REPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAES---VVQVADRDGESTPAVEETSEADIERE 374
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2860 QPAELSKvdiEKTASSPIDEAPKSLI-GCPAEERPESPAESAKDAAESVEKSKDASRP-PSVVESTKADSTKGDISPSPE 2937
Cdd:PRK13108  375 QPGDLAG---QAPAAHQVDAEAASAApEEPAALASEAHDETEPEVPEKAAPIPDPAKPdELAVAGPGDDPAEPDGIRRQD 451

                  ....*...
gi 161077523 2938 SVLEGPKD 2945
Cdd:PRK13108  452 DFSSRRRR 459
 
Name Accession Description Interval E-value
PTZ00121 PTZ00121
MAEBL; Provisional
3352-4195 1.42e-35

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 151.06  E-value: 1.42e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3352 KDSADESKEQRPESLPQSKAGSIKDEKSPLASKDEAEKSKEESRRESVAEQfplvSKEVSRPASVAESVKDEAEKSKEES 3431
Cdd:PTZ00121 1045 KDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEE----AFGKAEEAKKTETGKAEEARKAEEA 1120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3432 PLMSKEASRpasvAGSVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKe 3511
Cdd:PTZ00121 1121 KKKAEDARK----AEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRK- 1195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3512 ASRPASVAESIKDEAEKSKEESRRESVAEKSplpskEASRptSVAESVKDEAEKSKEESRRDSVAEKSPLASKEASRPAS 3591
Cdd:PTZ00121 1196 AEDARKAEAARKAEEERKAEEARKAEDAKKA-----EAVK--KAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3592 VAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRR-ESVAEKSPLASKEASRPTSVAESV 3670
Cdd:PTZ00121 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKaEEAKKKADAAKKKAEEAKKAAEAA 1348
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3671 KDEAEKSKEEssrdsvAEKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVK---DD 3747
Cdd:PTZ00121 1349 KAEAEAAADE------AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKkkaDE 1422
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3748 AEKSKEESRRESVAEKSPLASKEASRPASVAESVK--DEAEKSKEESRRESVAEKSPLPSKEASRPTSVAEsvkdEAEKS 3825
Cdd:PTZ00121 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKkaEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE----EAKKK 1498
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3826 KEESRRESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRR-ESVAEKSPLASKEASRPasvAESVKDEAEKSKEESR 3904
Cdd:PTZ00121 1499 ADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKaEEKKKADELKKAEELKK---AEEKKKAEEAKKAEED 1575
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3905 RESVAEKSPLPSK-EASRPTSVAESVKDEADKSKEESRRESGA--EKSPLASMEASRPTSVAESVKDETEKSKEESRR-- 3979
Cdd:PTZ00121 1576 KNMALRKAEEAKKaEEARIEEVMKLYEEEKKMKAEEAKKAEEAkiKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKka 1655
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3980 --ESVTEKSPLPSKEASRPTSVAESVKDEAEKSKEE---SRRESVAEKSPLASKESSRPASVAESIKDEAEGTK---QES 4051
Cdd:PTZ00121 1656 eeENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAealKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKikaEEA 1735
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4052 RRESMPESGKAESIKGDQSSlaSKETSRPDSVVESVKDETEKPEGSAID---KSQVASRPESVAVSAKDEKSPLHSRPES 4128
Cdd:PTZ00121 1736 KKEAEEDKKKAEEAKKDEEE--KKKIAHLKKEEEKKAEEIRKEKEAVIEeelDEEDEKRRMEVDKKIKDIFDNFANIIEG 1813
                         810       820       830       840       850       860       870
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 161077523 4129 VADKSP--DASKEA--SRSLSVAETASSPIEEGPRSIADLSLPLNLTGEAKGKLPTLSSPIDVAEGDFLEV 4195
Cdd:PTZ00121 1814 GKEGNLviNDSKEMedSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEI 1884
PTZ00121 PTZ00121
MAEBL; Provisional
1890-2644 2.64e-24

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 114.08  E-value: 2.64e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1890 ESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLKGSTRREsvAESDKSSQPFKETSRPESAVGSMKDESMSK-E 1968
Cdd:PTZ00121 1102 EAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARK--AEDAKRVEIARKAEDARKAEEARKAEDAKKaE 1179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1969 PSRR--------------ESVKDGAAQSRETSRPASVAESAKDgADDLKELSRPESTTQSKEAGSIKDEKSPLASEEASR 2034
Cdd:PTZ00121 1180 AARKaeevrkaeelrkaeDARKAEAARKAEEERKAEEARKAED-AKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFE 1258
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2035 PASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRR-ESVAEKSPLPSKEASRPASVA 2113
Cdd:PTZ00121 1259 EARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKaDEAKKKAEEAKKKADAAKKKA 1338
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2114 ESIKDEAE-KSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIK- 2191
Cdd:PTZ00121 1339 EEAKKAAEaAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKk 1418
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2192 --DEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIK--DEAEKSKEESRRESVAEKsplpSKEASRPASVAESIKDE 2267
Cdd:PTZ00121 1419 kaDEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKkaEEAKKKAEEAKKADEAKK----KAEEAKKADEAKKKAEE 1494
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2268 AEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEETRR-ESVAEKSPLPSKEASRPasvAESIKDEAEKSK 2346
Cdd:PTZ00121 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKaEEKKKADELKKAEELKK---AEEKKKAEEAKK 1571
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2347 EESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQSIKGDQSPLK-----------EVSRPES 2415
Cdd:PTZ00121 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKkveqlkkkeaeEKKKAEE 1651
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2416 VAESVKDDPVKSKEPSRR--------ESVAGSVTADSARDDQSPLESKGASRPESVVDSVKDEAEKQESRRESKTESvip 2487
Cdd:PTZ00121 1652 LKKAEEENKIKAAEEAKKaeedkkkaEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN--- 1728
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2488 pKAKDDKSPKEVLQPVSMTETIREDADQPMKPSQAESRRESIAESIKASSPRDEKSPLASKEASRPGSVAESIKYDLDKP 2567
Cdd:PTZ00121 1729 -KIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2568 QII-----KDDKSTEHSRRESLEDKSAVTSEKSVSRPLSVASDHEAAVAIEDDAKSSISPKDKSRPGFVAETVSSPIEEA 2642
Cdd:PTZ00121 1808 ANIieggkEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEA 1887

                  ..
gi 161077523 2643 TM 2644
Cdd:PTZ00121 1888 DE 1889
PTZ00121 PTZ00121
MAEBL; Provisional
2878-3754 1.31e-15

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 85.19  E-value: 1.31e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2878 DEAPKSLIGCPAEERPESPAESAKDAAESVEKSKDASRPPSVVESTKADSTKgdispspeSVLEGPKDDVEKSKESSRPP 2957
Cdd:PTZ00121 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAK--------RVEIARKAEDARKAEEARKA 1172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2958 SVSASITGDSTKDVSRPASVVESVKDEHDKAESRResiAKVESVIDEAGKSDSKSSSQDSQKDEKSTLASKEASRRESvV 3037
Cdd:PTZ00121 1173 EDAKKAEAARKAEEVRKAEELRKAEDARKAEAARK---AEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE-E 1248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3038 ESSKDDAEKSESRPESVIASGEPVPRESKSPLDSKDTSRPGSMVESVTAEDEKSEQQSRRESVAESVKADTKKDgKSQEA 3117
Cdd:PTZ00121 1249 RNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK-KAEEA 1327
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3118 SRPSsvDELLKDDDEKQESRRQSITGSHKAmstmgdespMDKADKSKEPSRPESVAESikHENTKDEESPLGSRRDSVAE 3197
Cdd:PTZ00121 1328 KKKA--DAAKKKAEEAKKAAEAAKAEAEAA---------ADEAEAAEEKAEAAEKKKE--EAKKKADAAKKKAEEKKKAD 1394
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3198 SIKSditKGEKSPLPSKEVSRPESVVGSIKD--EKAESRRESVAESVKPESSKDATSAPPSKEHSRPESVLGSLKDEGDK 3275
Cdd:PTZ00121 1395 EAKK---KAEEDKKKADELKKAAAAKKKADEakKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3276 TTSRRVSVADSIKDEKSLLVSQEASRPESEAESLKDAAAPSQETSRPEsvtesvkdgkspvaSKEASRPASVAENAKdSA 3355
Cdd:PTZ00121 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE--------------EAKKADEAKKAEEAK-KA 1536
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3356 DESKEQRpeslPQSKAGSIKdeKSPLASKDEAEKSKEESRRESVAEQFPLVSKEVSRPASvaESVKDEAEKSKEESPLMS 3435
Cdd:PTZ00121 1537 DEAKKAE----EKKKADELK--KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE--EARIEEVMKLYEEEKKMK 1608
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3436 KEASRPAsvagsvKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRP 3515
Cdd:PTZ00121 1609 AEEAKKA------EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3516 ASVAESIKDEAEKSKEESRR--ESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSvaEKSPLASKEASRPASVA 3593
Cdd:PTZ00121 1683 AEEDEKKAAEALKKEAEEAKkaEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK--KKAEEAKKDEEEKKKIA 1760
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3594 ESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDeaEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDE 3673
Cdd:PTZ00121 1761 HLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKD--IFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADS 1838
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3674 AEKSKEES--------SRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEE--------SRRESVAEKSPLASKEASRP 3737
Cdd:PTZ00121 1839 KNMQLEEAdafekhkfNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIekidkddiEREIPNNNMAGKNNDIIDDK 1918
                         890
                  ....*....|....*..
gi 161077523 3738 ASVAESVKDDAEKSKEE 3754
Cdd:PTZ00121 1919 LDKDEYIKRDAEETREE 1935
growth_prot_Scy NF041483
polarized growth protein Scy;
3229-3983 1.63e-11

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 71.40  E-value: 1.63e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3229 EKAESRRESVAESVKPESSKDATSAppskehsrpESVLGSLKDEGDKTTSRRVSVADSIKDEKSLLVSQEASRPESEAES 3308
Cdd:NF041483  349 EAAEKARTVAAEDTAAQLAKAARTA---------EEVLTKASEDAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEQ 419
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3309 LKDAAAPSQETSRPESVtESVKDGKSPVASKEASRPASVAENAKDSAdeskEQRPESLPQSKAGSIKDEKSPLASKDEAE 3388
Cdd:NF041483  420 LKGAAKDDTKEYRAKTV-ELQEEARRLRGEAEQLRAEAVAEGERIRG----EARREAVQQIEEAARTAEELLTKAKADAD 494
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3389 kskeESRRESVAEQFPLVSKEVSRPASVAESVKDEAEKSKEESPLMSKEASRPA--------SVAGSVKDEAEKSKEESR 3460
Cdd:NF041483  495 ----ELRSTATAESERVRTEAIERATTLRRQAEETLERTRAEAERLRAEAEEQAeevraaaeRAARELREETERAIAARQ 570
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3461 RESVAEKSPLPSKEASRPASvAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKSKEESRRE---S 3537
Cdd:NF041483  571 AEAAEELTRLHTEAEERLTA-AEEALADARAEAERIRREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADasaA 649
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3538 VAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRrdsvAEKSPLASKEASRPASVAESVQDEAEKskeesRRESVAEKSP 3617
Cdd:NF041483  650 RAEGENVAVRLRSEAAAEAERLKSEAQESADRVR----AEAAAAAERVGTEAAEALAAAQEEAAR-----RRREAEETLG 720
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3618 LASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDEAekskeESSRDSVAEKSPLASKEA 3697
Cdd:NF041483  721 SARAEADQERERAREQSEELLASARKRVEEAQAEAQRLVEEADRRATELVSAAEQTA-----QQVRDSVAGLQEQAEEEI 795
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3698 SRPASVAESVQD----EAEKSKEESRRESVAEKSPlASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPL---ASKE 3770
Cdd:NF041483  796 AGLRSAAEHAAErtrtEAQEEADRVRSDAYAERER-ASEDANRLRREAQEETEAAKALAERTVSEAIAEAERLrsdASEY 874
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3771 ASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRES---VAEKSSLASKKASR 3847
Cdd:NF041483  875 AQRVRTEASDTLASAEQDAARTRADAREDANRIRSDAAAQADRLIGEATSEAERLTAEARAEAerlRDEARAEAERVRAD 954
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3848 PASVAESVKDEAEKSKEESRRESvAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEksplpskeasrptsvAE 3927
Cdd:NF041483  955 AAAQAEQLIAEATGEAERLRAEA-AETVGSAQQHAERIRTEAERVKAEAAAEAERLRTEAREE---------------AD 1018
                         730       740       750       760       770
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 3928 SVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAESVKDETEKSKEESRRESVT 3983
Cdd:NF041483 1019 RTLDEARKDANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADT 1074
growth_prot_Scy NF041483
polarized growth protein Scy;
3513-4055 2.06e-11

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 71.40  E-value: 2.06e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3513 SRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDsvaeksplaskeASRPASV 3592
Cdd:NF041483  159 ARTESQARRLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAILRRARKD------------AERLLNA 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3593 AeSVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRREsvAEKSPLASKE-ASRPTSVAESVK 3671
Cdd:NF041483  227 A-STQAQEATDHAEQLRSSTAAESDQARRQAAELSRAAEQRMQEAEEALREARAE--AEKVVAEAKEaAAKQLASAESAN 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3672 DEAEKSKEESSRDSVAEksplASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEA----SRPASVAESVKDD 3747
Cdd:NF041483  304 EQRTRTAKEEIARLVGE----ATKEAEALKAEAEQALADARAEAEKLVAEAAEKARTVAAEDTaaqlAKAARTAEEVLTK 379
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3748 AEKSKEESRRESVAEKSPL---ASKEASR----PASVAESVKDEAEKSKEESRRESVAEKsplpsKEASRPTSVAESVKD 3820
Cdd:NF041483  380 ASEDAKATTRAAAEEAERIrreAEAEADRlrgeAADQAEQLKGAAKDDTKEYRAKTVELQ-----EEARRLRGEAEQLRA 454
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3821 EA----EKSKEESRRESVaeksslasKKASRPASVAESVKDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEA 3896
Cdd:NF041483  455 EAvaegERIRGEARREAV--------QQIEEAARTAEELLTKAKADADELRSTATAESERVRTEAIERATTLRRQAEETL 526
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3897 EKSKEESRRESvAEKSPLPSKEASRPTSVAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAESVKDETEKSkEE 3976
Cdd:NF041483  527 ERTRAEAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERLTAAEEALADARAEA-ER 604
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3977 SRRESVTEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRE---SVAEKSPLASKESSRPASVAESIKDEAEGTKQESRR 4053
Cdd:NF041483  605 IRREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADasaARAEGENVAVRLRSEAAAEAERLKSEAQESADRVRA 684

                  ..
gi 161077523 4054 ES 4055
Cdd:NF041483  685 EA 686
growth_prot_Scy NF041483
polarized growth protein Scy;
3377-4089 3.01e-11

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 70.63  E-value: 3.01e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3377 EKSPLASKDEAEKSKEESRResvaeqfplvskevsRPASVAESVKDEAEKSKEESplmSKEASRPASVAGSVKDEAEKSK 3456
Cdd:NF041483  177 EQALAAARAEAERLAEEARQ---------------RLGSEAESARAEAEAILRRA---RKDAERLLNAASTQAQEATDHA 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3457 EESRRESVAEkSPLPSKEASRPASVAESVKDEADKSKEESRREsgAEKSPLASKE-ASRPASVAESIKDEAEKSKEESRR 3535
Cdd:NF041483  239 EQLRSSTAAE-SDQARRQAAELSRAAEQRMQEAEEALREARAE--AEKVVAEAKEaAAKQLASAESANEQRTRTAKEEIA 315
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3536 ESVAEKsplpskeasrpTSVAESVKDEAEKSKEESRRDsvAEKSPLASKEASRPASVAESVqdeAEKSKEESRRESVAEK 3615
Cdd:NF041483  316 RLVGEA-----------TKEAEALKAEAEQALADARAE--AEKLVAEAAEKARTVAAEDTA---AQLAKAARTAEEVLTK 379
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3616 SPLASKEASRPASV-AESIKDEAEKSKEESRRES--VAEKSPLASK---------------EASRPTSVAESVKDEA--- 3674
Cdd:NF041483  380 ASEDAKATTRAAAEeAERIRREAEAEADRLRGEAadQAEQLKGAAKddtkeyraktvelqeEARRLRGEAEQLRAEAvae 459
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3675 -EKSKEESSRDSVaeksplasKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKE 3753
Cdd:NF041483  460 gERIRGEARREAV--------QQIEEAARTAEELLTKAKADADELRSTATAESERVRTEAIERATTLRRQAEETLERTRA 531
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3754 ESRRESvAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSvAESVKDEAEKSKEESRRES 3833
Cdd:NF041483  532 EAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERLTA-AEEALADARAEAERIRREA 609
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3834 VAEKSSLASKKASRPASVAESVKDEAEKSKEESRRE---SVAEKSPLASKEASRPASVAESVKDEAEKSKEESRRES--- 3907
Cdd:NF041483  610 AEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADasaARAEGENVAVRLRSEAAAEAERLKSEAQESADRVRAEAaaa 689
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3908 -------VAEKSPLPSKEASRPTSVAE----SVKDEADKSKEESRRES------GAEKSPLASMEASRPTSVAESVKDET 3970
Cdd:NF041483  690 aervgteAAEALAAAQEEAARRRREAEetlgSARAEADQERERAREQSeellasARKRVEEAQAEAQRLVEEADRRATEL 769
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3971 EKSKEE---SRRESVTEKSPLPSKEASRPTS----VAESVKDEAEKSKEESRRESVAEKSPlASKESSRPASVAesiKDE 4043
Cdd:NF041483  770 VSAAEQtaqQVRDSVAGLQEQAEEEIAGLRSaaehAAERTRTEAQEEADRVRSDAYAERER-ASEDANRLRREA---QEE 845
                         730       740       750       760
                  ....*....|....*....|....*....|....*....|....*..
gi 161077523 4044 AEGTKQESRRESMPESGKAESIKGDQSSLASK-ETSRPDSVVESVKD 4089
Cdd:NF041483  846 TEAAKALAERTVSEAIAEAERLRSDASEYAQRvRTEASDTLASAEQD 892
growth_prot_Scy NF041483
polarized growth protein Scy;
3439-4062 2.25e-10

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 67.93  E-value: 2.25e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3439 SRPASVAGSVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRresgaeksplasKEASRPASV 3518
Cdd:NF041483  159 ARTESQARRLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAILRRAR------------KDAERLLNA 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3519 AESIKDEAEKSKEESRRESVAEkSPLPSKEASRPTSVAESVKDEAEKSKEESRRDsvAEKSPLASKE-ASRPASVAESVQ 3597
Cdd:NF041483  227 ASTQAQEATDHAEQLRSSTAAE-SDQARRQAAELSRAAEQRMQEAEEALREARAE--AEKVVAEAKEaAAKQLASAESAN 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3598 DEAEKSKEESRRESVAEksplASKEasrpasvAESIKDEAEKSKEESRREsvAEKSPLASKEASRPTSVAESVkdeAEKS 3677
Cdd:NF041483  304 EQRTRTAKEEIARLVGE----ATKE-------AEALKAEAEQALADARAE--AEKLVAEAAEKARTVAAEDTA---AQLA 367
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3678 KEESSRDSVAEKSPLASKEASRPASvaesvqDEAEKskeeSRRESVAEksplASKEASRPASVAESVKDDAEKSKEESRR 3757
Cdd:NF041483  368 KAARTAEEVLTKASEDAKATTRAAA------EEAER----IRREAEAE----ADRLRGEAADQAEQLKGAAKDDTKEYRA 433
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3758 ESVAeksplASKEASRPASVAESVKDEA----EKSKEESRRESVAEksplpSKEASRptsVAESVKDEAEKSKEESRRES 3833
Cdd:NF041483  434 KTVE-----LQEEARRLRGEAEQLRAEAvaegERIRGEARREAVQQ-----IEEAAR---TAEELLTKAKADADELRSTA 500
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3834 VAEKSSLASKKASRPASVAESVKDEAEKSKEESRRESvAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSP 3913
Cdd:NF041483  501 TAESERVRTEAIERATTLRRQAEETLERTRAEAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTR 579
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3914 LPSKEASRPTSvAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAESVKDETEKSKEESRRE---SVTEKSPLPS 3990
Cdd:NF041483  580 LHTEAEERLTA-AEEALADARAEAERIRREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADasaARAEGENVAV 658
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 161077523 3991 KEASRPTSVAESVKDEAEKSKEESRRESVAEKSPLASKESSRPASVAESI---KDEAEGTKQESRRESMPESGKA 4062
Cdd:NF041483  659 RLRSEAAAEAERLKSEAQESADRVRAEAAAAAERVGTEAAEALAAAQEEAarrRREAEETLGSARAEADQERERA 733
PTZ00121 PTZ00121
MAEBL; Provisional
1464-2163 1.55e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.16  E-value: 1.55e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1464 KSAKDREDTGSIESPPTIEEAIEVEVQAKQEAQKPVPA--PEEAIKTEKSPLA-----SKETSRPESATGSVKEDTEQTK 1536
Cdd:PTZ00121 1240 EEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEArkADELKKAEEKKKAdeakkAEEKKKADEAKKKAEEAKKADE 1319
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1537 SKKSPVPSRPESEAKDKKSPFASGEASRPESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIKPE 1616
Cdd:PTZ00121 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1617 SGIDEKSALASKEASRPESVTDKSKEPSRRESIAESLKAESTKDEKSAPPSKEASRPGSVVESVKDETEKSKepsrresi 1696
Cdd:PTZ00121 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK-------- 1471
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1697 AESAKPPIEfrEVSRPESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESvaESFKADSTKD 1776
Cdd:PTZ00121 1472 ADEAKKKAE--EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKAD--EAKKAEEKKK 1547
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1777 EKSPLTSKDISRPESavenvmdaVGSAERSQPESVTASRDVSRPESVAESEKddtDKPESVVESVIPASDVVEIEKGAAD 1856
Cdd:PTZ00121 1548 ADELKKAEELKKAEE--------KKKAEEAKKAEEDKNMALRKAEEAKKAEE---ARIEEVMKLYEEEKKMKAEEAKKAE 1616
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1857 KEKGVFVSLEIGKPDSPSEVISRPGPVVESVKPESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLKGSTRRES 1936
Cdd:PTZ00121 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1937 VAESDKSSQPFKETSRPESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRPASVAESAKDGADDLKELSRPESTTQSKE 2016
Cdd:PTZ00121 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2017 AGSIKDEKspLASEEASRPASVAESVKDEAEKSK--EESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKE----ES 2090
Cdd:PTZ00121 1777 KEAVIEEE--LDEEDEKRRMEVDKKIKDIFDNFAniIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAfekhKF 1854
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2091 RRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEAS--------RPASVAESIKDEAEKSKE 2162
Cdd:PTZ00121 1855 NKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAgknndiidDKLDKDEYIKRDAEETRE 1934

                  .
gi 161077523 2163 E 2163
Cdd:PTZ00121 1935 E 1935
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
2002-2322 1.62e-09

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 64.65  E-value: 1.62e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2002 LKELSRPESTTQSKEAGSIKDEKSPLASEEASRPASVAESVKDEAEK----SKEESRR----------ESVAEKSPLPSK 2067
Cdd:NF033838  107 LKEKSEAELTSKTKKELDAAFEQFKKDTLEPGKKVAEATKKVEEAEKkakdQKEEDRRnyptntyktlELEIAESDVEVK 186
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2068 EASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----ESVAEKSPLPSKE 2142
Cdd:NF033838  187 KAE-----LELVKEEAKEPRDEEKIKQAKAKVESKKAEATR----LEKIKTDREKAEEEAKRradakLKEAVEKNVATSE 257
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2143 ASRP------ASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEESRR-------- 2203
Cdd:NF033838  258 QDKPkrrakrGVLGEPATPDKKENDAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnyptntyk 337
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2204 --ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR---- 2277
Cdd:NF033838  338 tlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRkaae 408
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 161077523 2278 -ESVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEKSP 2322
Cdd:NF033838  409 eDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PTZ00121 PTZ00121
MAEBL; Provisional
580-1228 2.04e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 64.78  E-value: 2.04e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  580 APAIQTVTSTRKSLKSAIEATPAPPSASYKTTKFSPVASAALAVQHPQQQDNKAKEAAAAAAAAAAAAASAATIARAKAD 659
Cdd:PTZ00121 1226 AEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  660 SMDTDAEPEHEAD-----PEPADTGDEAAPTEQEPEAETEPEPEHEPEAEQDKDVGEEKKVEVLIMKPQQAtpaviaasg 734
Cdd:PTZ00121 1306 EAKKKAEEAKKADeakkkAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA--------- 1376
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  735 KDGVDAASADATPTGKLSKASAKGKADKPRA-EVKPVVRSRIDTKPPKSMDRKLAKRDEKKSSptttpAARAPVAQNAKP 813
Cdd:PTZ00121 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKAdELKKAAAAKKKADEAKKKAEEKKKADEAKKK-----AEEAKKADEAKK 1451
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  814 KV-LSRPATKSSPSSTPAKSAKEANNRKVLESKQQAARVQATSTVSRRVTSTASERRVQQQAEAKTAATGATQATQRKPI 892
Cdd:PTZ00121 1452 KAeEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  893 SRRPRGVSPSKRAPAPGSPVKQAKP-KAADLKKTRLDKGGTTDSSLVSTPSADEATAAKKLQDLTASQELDAEKQRELDD 971
Cdd:PTZ00121 1532 EAKKADEAKKAEEKKKADELKKAEElKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEE 1611
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  972 LKEEQEVVREIEAVFSRDEMKRQQHQQIKAELREM-PAEGTGDGENEPDEEEEYLIIEKEEVEQYTEDSIVEQESSMTKE 1050
Cdd:PTZ00121 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKkKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAA 1691
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1051 EEIQKHQRDSQESEKKRKKSAEEEIEAAIAKVEAAERKARLEGASARQDESELDVEpEQSKIKAEVQDIIATAKDIAKSR 1130
Cdd:PTZ00121 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE-EAKKDEEEKKKIAHLKKEEEKKA 1770
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1131 TEEQLAKPA--EEELSSPTPEEKLS--KKTSDTKDDqigapvdvlpvnlqeslpeekFSATIESGATTAPTLPEDERIPL 1206
Cdd:PTZ00121 1771 EEIRKEKEAviEEELDEEDEKRRMEvdKKIKDIFDN---------------------FANIIEGGKEGNLVINDSKEMED 1829
                         650       660
                  ....*....|....*....|..
gi 161077523 1207 DQIKEdLVIEEKYVKEETKEAE 1228
Cdd:PTZ00121 1830 SAIKE-VADSKNMQLEEADAFE 1850
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1948-2285 9.16e-09

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 61.95  E-value: 9.16e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1948 KETSRPESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRpasvAESAKDGADDLKELSRPESTTQSKEAGSIKDEKSPL 2027
Cdd:NF033838  108 KEKSEAELTSKTKKELDAAFEQFKKDTLEPGKKVAEATKK----VEEAEKKAKDQKEEDRRNYPTNTYKTLELEIAESDV 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2028 ASEEASrpasvAESVKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----ESVAEKSPLP 2102
Cdd:NF033838  184 EVKKAE-----LELVKEEAKEPRDEEKIKQAKAKVESKKAEATR----LEKIKTDREKAEEEAKRradakLKEAVEKNVA 254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2103 SKEASRP------ASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEESRR----- 2166
Cdd:NF033838  255 TSEQDKPkrrakrGVLGEPATPDKKENDAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnyptn 334
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2167 -----ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR- 2240
Cdd:NF033838  335 tyktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRk 405
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 2241 ----ESVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSP 2285
Cdd:NF033838  406 aaeeDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
2028-2433 2.02e-08

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 61.18  E-value: 2.02e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2028 ASEEASRPASVAESVKDEAEKSKE---ESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSplpsk 2104
Cdd:NF033838   37 AEEVRGGNNPTVTSSGNESQKEHAkevESHLEKILSEIQKSLDKRKHTQNVALNKKLSDIKTEYLYELNVLKEKS----- 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2105 EASRPASVAESIKDEAEKSKEESRResvaeksplPSKEASRPASVAESIKDEAEKSKEESRR----------ESVAEKSP 2174
Cdd:NF033838  112 EAELTSKTKKELDAAFEQFKKDTLE---------PGKKVAEATKKVEEAEKKAKDQKEEDRRnyptntyktlELEIAESD 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2175 LPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----ESVAEKSPL 2249
Cdd:NF033838  183 VEVKKAE-----LELVKEEAKEPRDEEKIKQAKAKVESKKAEATR----LEKIKTDREKAEEEAKRradakLKEAVEKNV 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2250 PSKEASRP------ASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEETRR---- 2314
Cdd:NF033838  254 ATSEQDKPkrrakrGVLGEPATPDKKENDAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnypt 333
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2315 ------ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESAAEKSPLPSKEASRpasvAESVKDEADKSKEESRR 2388
Cdd:NF033838  334 ntyktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKR 404
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 2389 ESmAESGKAQSIKGDQ-------SPLKEVSRPESVAESVK----DDPVKSKEPSRR 2433
Cdd:NF033838  405 KA-AEEDKVKEKPAEQpqpapapQPEKPAPKPEKPAEQPKaekpADQQAEEDYARR 459
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
3638-4024 3.85e-08

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 60.03  E-value: 3.85e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3638 EKSKEESRRESVAEKSPlASKEASRPTSVAESVKDEAEKSKEESSRDSVAEKSP----------------LASKEASRPA 3701
Cdd:NF033838  109 EKSEAELTSKTKKELDA-AFEQFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEdrrnyptntyktleleIAESDVEVKK 187
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3702 SVAESVQDEAEKSKEESRRESVAEKSPLASKEASRpasvAESVKDDAEKSKEESRRESVAEKSPLASKEASrpasvaesv 3781
Cdd:NF033838  188 AELELVKEEAKEPRDEEKIKQAKAKVESKKAEATR----LEKIKTDREKAEEEAKRRADAKLKEAVEKNVA--------- 254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3782 KDEAEKSKEESRRESVAEKSPlPSKEASRPTSVAESVKDEAEKSKEESRRESVAEksslASKKAsrpasvaESVKDEAEK 3861
Cdd:NF033838  255 TSEQDKPKRRAKRGVLGEPAT-PDKKENDAKSSDSSVGEETLPSPSLKPEKKVAE----AEKKV-------EEAKKKAKD 322
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3862 SKEESRR----------ESVAEKSPLASKEASrpasvAESVKDEAEKSKEESRRESVAEKSPLPSKEASRptsvAESVKD 3931
Cdd:NF033838  323 QKEEDRRnyptntyktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKT 393
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3932 EADKSKEESRRESGAEksplasmeasrptsvaESVKDETEKSKEesrresvteKSPLPSKEASRPTSVAESVKDEAEKSK 4011
Cdd:NF033838  394 DRKKAEEEAKRKAAEE----------------DKVKEKPAEQPQ---------PAPAPQPEKPAPKPEKPAEQPKAEKPA 448
                         410
                  ....*....|...
gi 161077523 4012 EESRRESVAEKSP 4024
Cdd:NF033838  449 DQQAEEDYARRSE 461
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1917-2248 5.79e-08

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 59.64  E-value: 5.79e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1917 KVECLKDESEVLKGStrrESVAESDKSSQPF-KETSRPESAVGSMK---DESMSKEPSRRESVKDGAAQSRETSRPASVA 1992
Cdd:NF033838  103 ELNVLKEKSEAELTS---KTKKELDAAFEQFkKDTLEPGKKVAEATkkvEEAEKKAKDQKEEDRRNYPTNTYKTLELEIA 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1993 ESAKDGADDLKELSRpESTTQSKEAGSIKDEKSPLASEEASrpASVAESVKDEAEKSKEESRR-----ESVAEKSPLPSK 2067
Cdd:NF033838  180 ESDVEVKKAELELVK-EEAKEPRDEEKIKQAKAKVESKKAE--ATRLEKIKTDREKAEEEAKRradakLKEAVEKNVATS 256
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2068 EASRP------ASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEESRR------- 2129
Cdd:NF033838  257 EQDKPkrrakrGVLGEPATPDKKENDAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnyptnty 336
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2130 ---ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR--- 2203
Cdd:NF033838  337 ktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRkaa 407
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 161077523 2204 --ESVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSP 2248
Cdd:NF033838  408 eeDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
growth_prot_Scy NF041483
polarized growth protein Scy;
3612-4135 4.71e-07

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 56.76  E-value: 4.71e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3612 VAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDEAEK--------------- 3676
Cdd:NF041483  147 VNENVAWAEQLRARTESQARRLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAilrrarkdaerllna 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3677 ---------SKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRREsvAEKSPLASKE-ASRPASVAESVKD 3746
Cdd:NF041483  227 astqaqeatDHAEQLRSSTAAESDQARRQAAELSRAAEQRMQEAEEALREARAE--AEKVVAEAKEaAAKQLASAESANE 304
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3747 DAEKSKEESRRESVAEksplASKEasrpasvAESVKDEAEKSKEESRREsvAEKSPLPSKEASRPTSVAESVkdeAEKSK 3826
Cdd:NF041483  305 QRTRTAKEEIARLVGE----ATKE-------AEALKAEAEQALADARAE--AEKLVAEAAEKARTVAAEDTA---AQLAK 368
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3827 EESRRESVAEKSSLASKKASRPASV-AESVKDEAEKSKEESRRES--VAEKSPLASK---------------EASRPASV 3888
Cdd:NF041483  369 AARTAEEVLTKASEDAKATTRAAAEeAERIRREAEAEADRLRGEAadQAEQLKGAAKddtkeyraktvelqeEARRLRGE 448
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3889 AESVKDEA----EKSKEESRRESVAEksplpSKEASRptsVAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAE 3964
Cdd:NF041483  449 AEQLRAEAvaegERIRGEARREAVQQ-----IEEAAR---TAEELLTKAKADADELRSTATAESERVRTEAIERATTLRR 520
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3965 SVKDETEKSKEESRRESvTEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRESVAEKSPLASKESSRPASVAESIKDeA 4044
Cdd:NF041483  521 QAEETLERTRAEAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERLTAAEEALAD-A 598
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4045 EGTKQESRRESMPESgkaESIKGDQSSLASKETSRPDSVVESVKDEtekpegSAIDKSQVASRPESVAVSAKDEKSPLHS 4124
Cdd:NF041483  599 RAEAERIRREAAEET---ERLRTEAAERIRTLQAQAEQEAERLRTE------AAADASAARAEGENVAVRLRSEAAAEAE 669
                         570
                  ....*....|.
gi 161077523 4125 RPESVADKSPD 4135
Cdd:NF041483  670 RLKSEAQESAD 680
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1909-2211 1.26e-06

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 55.02  E-value: 1.26e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1909 KEPSRPESKVEclKDESEVLKGSTRRESVAESDKSSQPFK-----ETSRPESAVGSMKDEsmskepsrRESVKDGAAQSR 1983
Cdd:NF033838  132 KDTLEPGKKVA--EATKKVEEAEKKAKDQKEEDRRNYPTNtyktlELEIAESDVEVKKAE--------LELVKEEAKEPR 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1984 ETSRPASV---AESAKDGADDLKELS--RPESTTQSKEAGSIKDEKS---PLASEEASRP------ASVAESVKDEAEKS 2049
Cdd:NF033838  202 DEEKIKQAkakVESKKAEATRLEKIKtdREKAEEEAKRRADAKLKEAvekNVATSEQDKPkrrakrGVLGEPATPDKKEN 281
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2050 KEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEESRR----------ESVAEKSPLPSKEASrpasvAE 2114
Cdd:NF033838  282 DAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnyptntyktlELEIAESDVKVKEAE-----LE 356
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2115 SIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----ESVAEK-------SPLPSKEASR 2182
Cdd:NF033838  357 LVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRkaaeeDKVKEKpaeqpqpAPAPQPEKPA 432
                         330       340
                  ....*....|....*....|....*....
gi 161077523 2183 PASVAESIKDEAEKSKEESRRESVAEKSP 2211
Cdd:NF033838  433 PKPEKPAEQPKAEKPADQQAEEDYARRSE 461
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
3029-3390 1.71e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 54.92  E-value: 1.71e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3029 EASRRESVVESSKDDAEKSESRPESVIASGEPVPRESKSPLDSKDTSRPGSMVESVTAEDEKSEQQSRRESVAESvKADT 3108
Cdd:NF033609  550 EPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDS-ASDS 628
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3109 KKDGKSQEASRPSSVDELLKDDDEKQESRRQSITGSHKAMSTMGDeSPMDKADKSKEPSRPESVAESikhENTKDEESPL 3188
Cdd:NF033609  629 DSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDS---DSDSDSDSDS 704
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3189 GSRRDSVAESiKSDITKGEKSPLPSKEVSRPESVVGSIKDEKAESRRESVAESvkpESSKDATSAPPSKEHSRPESVLGS 3268
Cdd:NF033609  705 DSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS---DSDSDSDSDSDSDSDSDSDSDSDS 780
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3269 LKDEGDKTTSRRVSVADSIKDEKSLLVSQEASRPESEAESLKDAAAPSQETSRPESVTESVKDGKSPVASKEASRPASVA 3348
Cdd:NF033609  781 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDS 860
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*....
gi 161077523 3349 ENAKDSADESKEQRPESLPQ----SKAGSIKDEKSPL---ASKDEAEKS 3390
Cdd:NF033609  861 NSDSESGSNNNVVPPNSPKNgtnaSNKNEAKDSKEPLpdtGSEDEANTS 909
growth_prot_Scy NF041483
polarized growth protein Scy;
1943-2400 4.60e-06

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 53.68  E-value: 4.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1943 SSQPFKETSRPESAVGSMKDESmskEPSRRESVKDGAAQSRETSRPASVAESAKDGADDLKELSRPESTTQSKEAGSIKD 2022
Cdd:NF041483  228 STQAQEATDHAEQLRSSTAAES---DQARRQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAKEAAAKQLASAESANE 304
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2023 EKSPLASEEASR----PASVAESVKDEAEKSKEESRREsvAEKSPLPSKEASRPASVAESikdEAEKSKEESRRESVAEK 2098
Cdd:NF041483  305 QRTRTAKEEIARlvgeATKEAEALKAEAEQALADARAE--AEKLVAEAAEKARTVAAEDT---AAQLAKAARTAEEVLTK 379
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2099 SPLPSKEASRPASV-AESIKDEAEKSKEESRRESvaeksplpskeasrpASVAESIKDEAEKSKEESRRESVAEKsplps 2177
Cdd:NF041483  380 ASEDAKATTRAAAEeAERIRREAEAEADRLRGEA---------------ADQAEQLKGAAKDDTKEYRAKTVELQ----- 439
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2178 KEASRPASVAESIKDEA----EKSKEESRRESVaeksplpsKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKE 2253
Cdd:NF041483  440 EEARRLRGEAEQLRAEAvaegERIRGEARREAV--------QQIEEAARTAEELLTKAKADADELRSTATAESERVRTEA 511
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2254 ASRPASVAESIKDEAEKSKEESRRESvAEKSPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEKSPLPSKEASRPAS 2333
Cdd:NF041483  512 IERATTLRRQAEETLERTRAEAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERLTA 590
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 2334 VAESIKDeAEKSKEESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQSI 2400
Cdd:NF041483  591 AEEALAD-ARAEAERIRREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADASAARAEGENV 656
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
3893-4433 4.63e-06

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 53.36  E-value: 4.63e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3893 KDEAEKSKE-ESRRESVAEKS-PLPSKEASRPTSVAESVKDEADkskeesrreSGAEKSPLASMEASRPTSVAESVKDET 3970
Cdd:TIGR00600  269 RDEGGFLKEvELRRVVSEDTShYILIKGIQGKTAVKAVDSDDES---------LPSLSSQLDSNSEDLKSSPWEKLKPES 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3971 EKSkeeSRRESVTEKSpLPSKEASRPTSVAESvKDEAEKSKEESRRESVAEKSPLASKESSRPAsvAESIKDEAEGTKQE 4050
Cdd:TIGR00600  340 ESI---VEAEPPSPRT-LLAKQAAMSESSSED-SDESEWERQELKRNNVAFVDDGSLSPRTLQA--IGQALDDDEDKKVS 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  4051 SRRESMPESGKAESIkgdqssLASKETSRPDSVVESVKDETEKPEGSAIDKSQVASRPESVAVSAKdeksplhsrPESVA 4130
Cdd:TIGR00600  413 ASSDDQASPSKKTKM------LLISRIEVEDDDLDYLDQGEGIPLMAALQLSSVNSKPEAVASTKI---------AREVT 477
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  4131 DKSPDASKEASRSLSVAETASSPIeegPRSIADLSLPLNLTGEAKGKLPTLSSPIDVAEGDFLEVKAESSPRPAVLSKPA 4210
Cdd:TIGR00600  478 SSGHEAVPKAVQSLLLGATNDSPI---PSEFTILDRKSELSIERTVKPVSSEFGLPSQREDKLAIPTEGTQNLQGISDHP 554
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  4211 EFSQPDTGHTASTPVDEASPVLEE-----IEVVEQHTTSGVgatgaTAETDLLDLTETksetvtkqsettlfetltskve 4285
Cdd:TIGR00600  555 EQFEFQNELSPLETKNNESNLSSDaetegSPNPEMPSWSSV-----TVPSEALDNYET---------------------- 607
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  4286 skveVLESSVKQVEEKVQTSVKQAETTVTDSLEQLTKKSSEQLTEI---KSVLDTNFEEVAKIVADVAKVLKSDKDITDI 4362
Cdd:TIGR00600  608 ----TNPSNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESekeESESDGSFIEVDSVSSTLELQVPSKSQPTDE 683
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 161077523  4363 IPDFDERQLEEKLKSTADTEEESDKSTRDEKSLEISVKVEIESEKSSPDQKSgpISIEEKDKIEQSEKAQL 4433
Cdd:TIGR00600  684 SEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQD--ISLEELEALEANLLAEQ 752
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
3298-3580 2.14e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 51.17  E-value: 2.14e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3298 EASRPESEAESLKDAAAPSQETSRPESVTESVKDGKSPVASK--EASRPASVAENAKDSADESKEQRPESLpqskagsiK 3375
Cdd:NF033838  175 ELEIAESDVEVKKAELELVKEEAKEPRDEEKIKQAKAKVESKkaEATRLEKIKTDREKAEEEAKRRADAKL--------K 246
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3376 DEKSPLASKDEAEKSKEESRRESVAEQFPLVSKEvSRPASVAESVKDEAEKSKEESPlmSKEASRPASVAGSVKDEAEKS 3455
Cdd:NF033838  247 EAVEKNVATSEQDKPKRRAKRGVLGEPATPDKKE-NDAKSSDSSVGEETLPSPSLKP--EKKVAEAEKKVEEAKKKAKDQ 323
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3456 KEESRR----------ESVAEKSPLPSKEASrpasvAESVKDEADKSKEESRRESGAEKSPLASKEASRpasvAESIKDE 3525
Cdd:NF033838  324 KEEDRRnyptntyktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTD 394
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 3526 AEKSKEESRR-----ESVAEK-------SPLPSKEASRPTSVAESVKDEAEKSKEESRRDSVAEKSP 3580
Cdd:NF033838  395 RKKAEEEAKRkaaeeDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
3466-3827 2.87e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.68  E-value: 2.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3466 EKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEkSPLP 3545
Cdd:NF033609  553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD-SDSA 631
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3546 SKEASRPTSVAESVKDEAEKSKEESRRDSVAEkSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSplaSKEASR 3625
Cdd:NF033609  632 SDSDSASDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSD 707
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3626 PASVAESIKDEAEKSKEESRRESVAEkSPLASKEASRPTSVAESVKDEAEKSKEESSRDSVAEkSPLASKEASRPASVAE 3705
Cdd:NF033609  708 SDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSD 785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3706 SVQDEAEKSKEESRRESVAEkSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSplaSKEASRPASVAESVKDEA 3785
Cdd:NF033609  786 SDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSESDSN 861
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 161077523 3786 EKSKEESRRESVAEKSPLPSKEASRptsvaesvKDEAEKSKE 3827
Cdd:NF033609  862 SDSESGSNNNVVPPNSPKNGTNASN--------KNEAKDSKE 895
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
942-1476 6.70e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 49.55  E-value: 6.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  942 SADEATAAKKLQDLTASQELDAEKQRELDDLKEEQEvvrEIEAVFSRDEMKRQQHQQIKAELREMPAEGTGDGENEPDEE 1021
Cdd:COG1196   263 AELEAELEELRLELEELELELEEAQAEEYELLAELA---RLEQDIARLEERRRELEERLEELEEELAELEEELEELEEEL 339
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1022 EEYL--IIEKEEVEQYTEDSIVEQESSMTKEEEIQKHQRDSQESEKKRKKSAEEEIEAAIAKVEAAERkaRLEGASARQD 1099
Cdd:COG1196   340 EELEeeLEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEE--AEEALLERLE 417
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1100 ESELDVEPEQSKIKAEVQDIIATAKDIAKSRTEEQLAKPAEEELSSPTPEEKLSKKTSDTKDDQIGAPVDVLpVNLQESL 1179
Cdd:COG1196   418 RLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEA-AARLLLL 496
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1180 PEEKFSATIESGATTAPTLPEDERIPLDQIKEDLVIEEKYVKEETKEAEAIVVATVQTLPEAAPLAIDTIlasatkdapK 1259
Cdd:COG1196   497 LEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYL---------K 567
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1260 DANAEALGELPDSGERVLPMKMTFEAQQNLLRDViktpDEVADLPVheEADLGLYEKDSQDAGAKSISHKEESAKEEKET 1339
Cdd:COG1196   568 AAKAGRATFLPLDKIRARAALAAALARGAIGAAV----DLVASDLR--EADARYYVLGDTLLGRTLVAARLEAALRRAVT 641
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1340 DDEKENKVGEIELGDEPNKVDISHvllkeSVQEVAEKVVVIETTVEKKQEEIVEATTVITQENQEDLMEQVKDKEEHEQK 1419
Cdd:COG1196   642 LAGRLREVTLEGEGGSAGGSLTGG-----SRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEER 716
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 1420 IESGIITEKEAKKSASTPEEKETSDITSDDELPAQLADPTTVPPKSAKDREDTGSIE 1476
Cdd:COG1196   717 LEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLE 773
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1907-2174 1.39e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 48.47  E-value: 1.39e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1907 DSKEPSRPESKVECLKDESEVLKG-STRRESVAESDKSSQPFKEtsrpESAVGSMKDESMSKEPSRResVKDGAAQsrET 1985
Cdd:NF033838  202 DEEKIKQAKAKVESKKAEATRLEKiKTDREKAEEEAKRRADAKL----KEAVEKNVATSEQDKPKRR--AKRGVLG--EP 273
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1986 SRPASVAESAKDGADDLKELSRPESttqskeagSIKDEKSPLASEEAsrpasvAESVKDEAEKSKEESRR---------- 2055
Cdd:NF033838  274 ATPDKKENDAKSSDSSVGEETLPSP--------SLKPEKKVAEAEKK------VEEAKKKAKDQKEEDRRnyptntyktl 339
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2056 ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----E 2130
Cdd:NF033838  340 ELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRkaaeeD 410
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 161077523 2131 SVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSP 2174
Cdd:NF033838  411 KVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1777-2137 1.52e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 48.47  E-value: 1.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1777 EKSPLTSKDISRPESAVENVMDAVGSAERSQPEsvtASRDVSRPESVAES--EKDDTDKPESVVESVipasdvvEIEKGA 1854
Cdd:NF033838  111 SEAELTSKTKKELDAAFEQFKKDTLEPGKKVAE---ATKKVEEAEKKAKDqkEEDRRNYPTNTYKTL-------ELEIAE 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1855 AD---KEKGVFVSLEIGKPDSPSEVISRPGPVVESVKPESRRESSTEIVLPCHAEDSKEPSRPESKVECLKD----ESEV 1927
Cdd:NF033838  181 SDvevKKAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNvatsEQDK 260
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1928 LKGSTRRESVAESDKSSQPFKETSRPESAVGsmkdESMSKEPSRRESVKDGAAQSRetsrpasvAESAKDGADDLKELSR 2007
Cdd:NF033838  261 PKRRAKRGVLGEPATPDKKENDAKSSDSSVG----EETLPSPSLKPEKKVAEAEKK--------VEEAKKKAKDQKEEDR 328
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2008 PESTTQSKEAGSIKDEKSPLASEEASrpasvAESVKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSK 2087
Cdd:NF033838  329 RNYPTNTYKTLELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAE 399
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 161077523 2088 EESRR-----ESVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSP 2137
Cdd:NF033838  400 EEAKRkaaeeDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1393-2320 2.25e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 48.04  E-value: 2.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1393 EATTVITQENQEDLMEQVKDKEEHEQKIESGIITEKEAKKSASTPEEKETSDITSddelpaqladpttvppksaKDREDT 1472
Cdd:pfam02463  133 EAYNFLVQGGKIEIIAMMKPERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAE-------------------LIIDLE 193
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1473 GSIESPPTIEEAIEVEVQAKQEAQKPVPAPEEAIKTEKsplaSKETSRPESATGSVKEDTEQTKSKKSPVPSRPESEAKD 1552
Cdd:pfam02463  194 ELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDY----LKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQ 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1553 KKSPFASGEASRPESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIKPESGIDEKSALASKEASR 1632
Cdd:pfam02463  270 VLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEI 349
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1633 PESVTDKSKEPSRRESIAESLKAESTKDEKSappSKEASRPGSVVESVKDETEKSKEPSRRESIAESAkppiefrevsrp 1712
Cdd:pfam02463  350 KREAEEEEEEELEKLQEKLEQLEEELLAKKK---LESERLSSAAKLKEEELELKSEEEKEAQLLLELA------------ 414
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1713 ESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESVAESFKAdSTKDEKSPLTSKDISRPESA 1792
Cdd:pfam02463  415 RQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDL-LKETQLVKLQEQLELLLSRQ 493
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1793 VENVMDAVGSAERSqPESVTASRDVSRPESVAESEKDDTDKPESVVESVIPA-SDVVEIEKGAADKEKGVFVSLEIGKPD 1871
Cdd:pfam02463  494 KLEERSQKESKARS-GLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAiSTAVIVEVSATADEVEERQKLVRALTE 572
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1872 SPSEVISRPGPVVESVKPESR-RESSTEIVLPCHAEDSKEPS-RPESKVECLKDESEVLKGSTRRESVAESDKSSQPFKE 1949
Cdd:pfam02463  573 LPLGARKLRLLIPKLKLPLKSiAVLEIDPILNLAQLDKATLEaDEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGV 652
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1950 TSRPESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRPASVAESAKDGAddLKELSRPESTTQSKEAGS-IKDEKSPLA 2028
Cdd:pfam02463  653 SLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIK--KKEQREKEELKKLKLEAEeLLADRVQEA 730
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  2029 SEEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASR 2108
Cdd:pfam02463  731 QDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEEL 810
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  2109 PASVAESIKDEAEKSKEESRRESVAEKSPLPSKeasrpasvaESIKDEAEKSKEESRRESVAEKsplpsKEASRPASVAE 2188
Cdd:pfam02463  811 KEEAELLEEEQLLIEQEEKIKEEELEELALELK---------EEQKLEKLAEEELERLEEEITK-----EELLQELLLKE 876
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  2189 SIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvaESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEA 2268
Cdd:pfam02463  877 EELEEQKLKDELESKEEKEKEEKKELEEESQ-----KLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKE 951
                          890       900       910       920       930
                   ....*....|....*....|....*....|....*....|....*....|..
gi 161077523  2269 EKSKEESRRESVAEKsplpSKEASRPASVAESIKDEAEKSKEETRRESVAEK 2320
Cdd:pfam02463  952 ENNKEEEEERNKRLL----LAKEELGKVNLMAIEEFEEKEERYNKDELEKER 999
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
948-1691 4.19e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 47.27  E-value: 4.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523   948 AAKKLQDLTASQELDAEKQRELddLKEEQEVVREIEAVFSRDEMKRQQHQQIKAELREMPAEGTGDGENEPDEEEEYLII 1027
Cdd:pfam02463  206 AKKALEYYQLKEKLELEEEYLL--YLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKL 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1028 EKEEVEQYTEDSIVEQESSMTKEEEIQKHQRDSQESEKKRKKSAEEEIEAAIAKVEAAERKARLEGASARQDESELDVEP 1107
Cdd:pfam02463  284 QEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEK 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1108 EQSKIKAEVQDIIATAKD------IAKSRTEEQLAKPAEEELSSPTPEEKLSKKTSDTKDDQIGAPVDVLpvNLQESLPE 1181
Cdd:pfam02463  364 LQEKLEQLEEELLAKKKLeserlsSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILE--EEEESIEL 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1182 EKFSATIESGATTAPTLPEDERIPLDQIKEDLVIEEKYVKEETKEAEAIVVATVQTL------------------PEAAP 1243
Cdd:pfam02463  442 KQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERsqkeskarsglkvllaliKDGVG 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1244 LAIDTILASATKDAPKDANAEALGELPDSGERVLpMKMTFEAQQNLLRDVIKTPDEVADLpvheeadlgLYEKDSQDAGA 1323
Cdd:pfam02463  522 GRIISAHGRLGDLGVAVENYKVAISTAVIVEVSA-TADEVEERQKLVRALTELPLGARKL---------RLLIPKLKLPL 591
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1324 KSISHKEESAKEEKETDDEKENKVGEIELGDEPNKVDISHVLLKESVQEVAEKVVVIETTVEKKQEEIVEATTVITQENQ 1403
Cdd:pfam02463  592 KSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSEL 671
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1404 ED---LMEQVKDKEEHEQKIESGIITEKEAKKSASTPEEKETSDITSDDELPA----QLADPTTVPPKSAKDREDTGSIE 1476
Cdd:pfam02463  672 TKellEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLAdrvqEAQDKINEELKLLKQKIDEEEEE 751
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1477 SPPTIEEAIEVEVQAKQEAQKPVPAPEEAIKTEKSPLASKETSRPESATGSVKE------------DTEQTKSKKSPVPS 1544
Cdd:pfam02463  752 EEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRAleeelkeeaellEEEQLLIEQEEKIK 831
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1545 RPESEAKDKKSPFASGEASRPESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIkpesGIDEKSA 1624
Cdd:pfam02463  832 EEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKK----ELEEESQ 907
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523  1625 LASKEASRPESVTDKSKEPSRRESIAESLKAESTKDEKSAPPSKEASRPGSVVESVKDETEKSKEPS 1691
Cdd:pfam02463  908 KLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGK 974
growth_prot_Scy NF041483
polarized growth protein Scy;
1890-2399 5.32e-04

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 46.74  E-value: 5.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1890 ESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLkgstRRESVAESDKSSQPFKETSRPESAVGSMKDESMSKEP 1969
Cdd:NF041483  567 AARQAEAAEELTRLHTEAEERLTAAEEALADARAEAERI----RREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEA 642
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1970 SRRESVKDGAAQS---RETSRPASVAESAKDGADDLKELSRPESTTQSKEAGSIKDEKSPLASEEASRPASVAE----SV 2042
Cdd:NF041483  643 AADASAARAEGENvavRLRSEAAAEAERLKSEAQESADRVRAEAAAAAERVGTEAAEALAAAQEEAARRRREAEetlgSA 722
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2043 KDEAEKSKEESRRES-----VAEKSPLPSK-EASRPASVAESIKDE---AEKSKEESRRESVAEKSPLPSKEAS--RPAS 2111
Cdd:NF041483  723 RAEADQERERAREQSeellaSARKRVEEAQaEAQRLVEEADRRATElvsAAEQTAQQVRDSVAGLQEQAEEEIAglRSAA 802
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2112 --VAESIKDEAEKSKEESRRESVAEKSPlPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPL---PSKEASRPASV 2186
Cdd:NF041483  803 ehAAERTRTEAQEEADRVRSDAYAERER-ASEDANRLRREAQEETEAAKALAERTVSEAIAEAERLrsdASEYAQRVRTE 881
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2187 AESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIK 2265
Cdd:NF041483  882 ASDTLASAEQDAARTRADAREDANRIRSDAAAQADRLIGEATSEAERLTAEARAEAERLRDeARAEAERVRADAAAQAEQ 961
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2266 DEAEKSKEESR-RESVAEKSPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEksplpskeasrpasvAESIKDEAEK 2344
Cdd:NF041483  962 LIAEATGEAERlRAEAAETVGSAQQHAERIRTEAERVKAEAAAEAERLRTEAREE---------------ADRTLDEARK 1026
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 161077523 2345 SKEESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQS 2399
Cdd:NF041483 1027 DANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVGAARK 1081
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
673-917 1.61e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 1.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523   673 PEPADTGDEAAPTEQEPEAetepepehepeaeqdKDVGEEKKVEVLiMKPQQA--TPAVIAASGKDGVDAASADAT-PT- 748
Cdd:pfam05109  476 PTPAGTTSGASPVTPSPSP---------------RDNGTESKAPDM-TSPTSAvtTPTPNATSPTPAVTTPTPNATsPTl 539
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523   749 GKLSKASAKGKADKPRAEVKPVVrsriDTKPPKSMDRKLAKRDEKKSSPTTTPAARAPVAQNAKPKVLSRPATKSSPSST 828
Cdd:pfam05109  540 GKTSPTSAVTTPTPNATSPTPAV----TTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSST 615
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523   829 PAKSAKEANNRKVLESKQQAARVQATSTVSRRVT--------STASERRVQQQAEAKTAATGATQATQRKPISRRPRGVS 900
Cdd:pfam05109  616 PVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSsisetlspSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVS 695
                          250
                   ....*....|....*..
gi 161077523   901 PSKRAPAPGSPVKQAKP 917
Cdd:pfam05109  696 TSSPAPRPGTTSQASGP 712
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2409-2436 2.20e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 38.74  E-value: 2.20e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2409 EVSRPESVAESVKDDPVK----SKEPSRRESV 2436
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
1489-1954 2.64e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 44.50  E-value: 2.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1489 VQAKQEaQKPVPAPEEAIKTEKSPLASKETSRPESATGSVKEDTEQTKSKKSPVPSRPE------SEAKDKKSPFASGEA 1562
Cdd:TIGR00600  297 IQGKTA-VKAVDSDDESLPSLSSQLDSNSEDLKSSPWEKLKPESESIVEAEPPSPRTLLakqaamSESSSEDSDESEWER 375
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1563 SRPESVAESVKDEAGKAESRRESIAKTHKDESSLD-KAKEQESRRESLAESIKPESGIDEK----SALASKEASRPESVT 1637
Cdd:TIGR00600  376 QELKRNNVAFVDDGSLSPRTLQAIGQALDDDEDKKvSASSDDQASPSKKTKMLLISRIEVEdddlDYLDQGEGIPLMAAL 455
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1638 DKSKEPSRRESIAeslkaeSTKDEKSAPPSKEASRPGSVVESVKDETEKSKEPSrrESIAESAKPPIEFREVSRPESvid 1717
Cdd:TIGR00600  456 QLSSVNSKPEAVA------STKIAREVTSSGHEAVPKAVQSLLLGATNDSPIPS--EFTILDRKSELSIERTVKPVS--- 524
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1718 gikDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESVAESFKADStkDEKSPLTSKDISRPESAVENVM 1797
Cdd:TIGR00600  525 ---SEFGLPSQREDKLAIPTEGTQNLQGISDHPEQFEFQNELSPLETKNNESNLSS--DAETEGSPNPEMPSWSSVTVPS 599
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1798 DAVGSAERSQPESVTASRDvsrpesVAESEKDDTDKPESVVESVIPASDVV---EIEKgAADKEKGVFVSLEIGKPDSPS 1874
Cdd:TIGR00600  600 EALDNYETTNPSNAKEVRN------FAETGIQTTNVGESADLLLISNPMEVepmESEK-EESESDGSFIEVDSVSSTLEL 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1875 EVisrpgPVVESVKPESrRESSTEIVlpchaeDSKEPSRPESKVECLKDESEVLKGSTRRESVAESDKSSQPFKETSRPE 1954
Cdd:TIGR00600  673 QV-----PSKSQPTDES-EENAENKV------ASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEE 740
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
2171-2490 3.09e-03

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 44.13  E-value: 3.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2171 EKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEkSPLP 2250
Cdd:NF033609  553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD-SDSA 631
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2251 SKEASRPASVAESIKDEAEKSKEESRRESVAEkSPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEK-SPLPSKEAS 2329
Cdd:NF033609  632 SDSDSASDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDS 710
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2330 RPASVAESIKDEAEKSKEESRRESAAEkSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGK-AQSIKGDQSPLK 2408
Cdd:NF033609  711 DSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSD 789
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2409 EVSRPESVAESVKDDPVKSKEPSRRESVAGSVT-ADSARDDQSPLESKGASRPESVVDSVKDEAEKQESRRESKTES--- 2484
Cdd:NF033609  790 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESgsn 869

                  ....*...
gi 161077523 2485 --VIPPKA 2490
Cdd:NF033609  870 nnVVPPNS 877
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2780-2945 3.41e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 43.81  E-value: 3.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2780 RSPVASTEISRPASAGETASSPIEEAPKDFAEFEQAEKAVLPLTIELKGNLptlSSPVDVAHGDFPQTSTPTSSPTVASV 2859
Cdd:PRK13108  298 REPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAES---VVQVADRDGESTPAVEETSEADIERE 374
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2860 QPAELSKvdiEKTASSPIDEAPKSLI-GCPAEERPESPAESAKDAAESVEKSKDASRP-PSVVESTKADSTKGDISPSPE 2937
Cdd:PRK13108  375 QPGDLAG---QAPAAHQVDAEAASAApEEPAALASEAHDETEPEVPEKAAPIPDPAKPdELAVAGPGDDPAEPDGIRRQD 451

                  ....*...
gi 161077523 2938 SVLEGPKD 2945
Cdd:PRK13108  452 DFSSRRRR 459
 
Name Accession Description Interval E-value
PTZ00121 PTZ00121
MAEBL; Provisional
3352-4195 1.42e-35

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 151.06  E-value: 1.42e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3352 KDSADESKEQRPESLPQSKAGSIKDEKSPLASKDEAEKSKEESRRESVAEQfplvSKEVSRPASVAESVKDEAEKSKEES 3431
Cdd:PTZ00121 1045 KDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEE----AFGKAEEAKKTETGKAEEARKAEEA 1120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3432 PLMSKEASRpasvAGSVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKe 3511
Cdd:PTZ00121 1121 KKKAEDARK----AEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRK- 1195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3512 ASRPASVAESIKDEAEKSKEESRRESVAEKSplpskEASRptSVAESVKDEAEKSKEESRRDSVAEKSPLASKEASRPAS 3591
Cdd:PTZ00121 1196 AEDARKAEAARKAEEERKAEEARKAEDAKKA-----EAVK--KAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3592 VAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRR-ESVAEKSPLASKEASRPTSVAESV 3670
Cdd:PTZ00121 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKaEEAKKKADAAKKKAEEAKKAAEAA 1348
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3671 KDEAEKSKEEssrdsvAEKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVK---DD 3747
Cdd:PTZ00121 1349 KAEAEAAADE------AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKkkaDE 1422
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3748 AEKSKEESRRESVAEKSPLASKEASRPASVAESVK--DEAEKSKEESRRESVAEKSPLPSKEASRPTSVAEsvkdEAEKS 3825
Cdd:PTZ00121 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKkaEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE----EAKKK 1498
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3826 KEESRRESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRR-ESVAEKSPLASKEASRPasvAESVKDEAEKSKEESR 3904
Cdd:PTZ00121 1499 ADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKaEEKKKADELKKAEELKK---AEEKKKAEEAKKAEED 1575
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3905 RESVAEKSPLPSK-EASRPTSVAESVKDEADKSKEESRRESGA--EKSPLASMEASRPTSVAESVKDETEKSKEESRR-- 3979
Cdd:PTZ00121 1576 KNMALRKAEEAKKaEEARIEEVMKLYEEEKKMKAEEAKKAEEAkiKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKka 1655
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3980 --ESVTEKSPLPSKEASRPTSVAESVKDEAEKSKEE---SRRESVAEKSPLASKESSRPASVAESIKDEAEGTK---QES 4051
Cdd:PTZ00121 1656 eeENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAealKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKikaEEA 1735
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4052 RRESMPESGKAESIKGDQSSlaSKETSRPDSVVESVKDETEKPEGSAID---KSQVASRPESVAVSAKDEKSPLHSRPES 4128
Cdd:PTZ00121 1736 KKEAEEDKKKAEEAKKDEEE--KKKIAHLKKEEEKKAEEIRKEKEAVIEeelDEEDEKRRMEVDKKIKDIFDNFANIIEG 1813
                         810       820       830       840       850       860       870
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 161077523 4129 VADKSP--DASKEA--SRSLSVAETASSPIEEGPRSIADLSLPLNLTGEAKGKLPTLSSPIDVAEGDFLEV 4195
Cdd:PTZ00121 1814 GKEGNLviNDSKEMedSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEI 1884
PTZ00121 PTZ00121
MAEBL; Provisional
1890-2644 2.64e-24

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 114.08  E-value: 2.64e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1890 ESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLKGSTRREsvAESDKSSQPFKETSRPESAVGSMKDESMSK-E 1968
Cdd:PTZ00121 1102 EAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARK--AEDAKRVEIARKAEDARKAEEARKAEDAKKaE 1179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1969 PSRR--------------ESVKDGAAQSRETSRPASVAESAKDgADDLKELSRPESTTQSKEAGSIKDEKSPLASEEASR 2034
Cdd:PTZ00121 1180 AARKaeevrkaeelrkaeDARKAEAARKAEEERKAEEARKAED-AKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFE 1258
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2035 PASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRR-ESVAEKSPLPSKEASRPASVA 2113
Cdd:PTZ00121 1259 EARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKaDEAKKKAEEAKKKADAAKKKA 1338
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2114 ESIKDEAE-KSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIK- 2191
Cdd:PTZ00121 1339 EEAKKAAEaAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKk 1418
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2192 --DEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIK--DEAEKSKEESRRESVAEKsplpSKEASRPASVAESIKDE 2267
Cdd:PTZ00121 1419 kaDEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKkaEEAKKKAEEAKKADEAKK----KAEEAKKADEAKKKAEE 1494
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2268 AEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEETRR-ESVAEKSPLPSKEASRPasvAESIKDEAEKSK 2346
Cdd:PTZ00121 1495 AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKaEEKKKADELKKAEELKK---AEEKKKAEEAKK 1571
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2347 EESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQSIKGDQSPLK-----------EVSRPES 2415
Cdd:PTZ00121 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKkveqlkkkeaeEKKKAEE 1651
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2416 VAESVKDDPVKSKEPSRR--------ESVAGSVTADSARDDQSPLESKGASRPESVVDSVKDEAEKQESRRESKTESvip 2487
Cdd:PTZ00121 1652 LKKAEEENKIKAAEEAKKaeedkkkaEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN--- 1728
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2488 pKAKDDKSPKEVLQPVSMTETIREDADQPMKPSQAESRRESIAESIKASSPRDEKSPLASKEASRPGSVAESIKYDLDKP 2567
Cdd:PTZ00121 1729 -KIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2568 QII-----KDDKSTEHSRRESLEDKSAVTSEKSVSRPLSVASDHEAAVAIEDDAKSSISPKDKSRPGFVAETVSSPIEEA 2642
Cdd:PTZ00121 1808 ANIieggkEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEA 1887

                  ..
gi 161077523 2643 TM 2644
Cdd:PTZ00121 1888 DE 1889
PTZ00121 PTZ00121
MAEBL; Provisional
1708-2499 3.79e-24

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 113.31  E-value: 3.79e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1708 EVSRPESVIDGIKDESAKPESRRDSPLASKEASRPesvleSVKDEPIKSTEKSRR-ESVAESFKADSTKDEKSPLTSKDI 1786
Cdd:PTZ00121 1040 DVLKEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKP-----SYKDFDFDAKEDNRAdEATEEAFGKAEEAKKTETGKAEEA 1114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1787 SRPESAVENVMDAVGSAERSQPESVTASRDVSRPESVAESEKDDTDKPESVVESVIPASDVVEIEkgAADKEKGVFVSLE 1866
Cdd:PTZ00121 1115 RKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAE--AARKAEEVRKAEE 1192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1867 IGKPDSpsevisrpgpvVESVKPESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLKGSTRRESVAESDKSSQP 1946
Cdd:PTZ00121 1193 LRKAED-----------ARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEAR 1261
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1947 FKETSRPESAVgsmKDESMSKEPSRRESVKDGAAQSRETSRPASVAESAKDGADDLKelsRPESTTQSKEAGSIKDEKSP 2026
Cdd:PTZ00121 1262 MAHFARRQAAI---KAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAK---KADEAKKKAEEAKKKADAAK 1335
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2027 LASEEASRpasvaesvKDEAEKSKEESRRESvAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEA 2106
Cdd:PTZ00121 1336 KKAEEAKK--------AAEAAKAEAEAAADE-AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2107 SRPASVAESIK---DEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIK--DEAEKSKEESRRESVAEKsplpSKEAS 2181
Cdd:PTZ00121 1407 ADELKKAAAAKkkaDEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKkaEEAKKKAEEAKKADEAKK----KAEEA 1482
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2182 RPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRR-ESVAEKSPLPSKEASRPasv 2260
Cdd:PTZ00121 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKaEEKKKADELKKAEELKK--- 1559
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2261 AESIKDEAEKSKEESRRESVAEKSPLPSK-EASRPASVAESIKDEAEKSKEETRRESVAE-KSPLPSKEASRPASVAESI 2338
Cdd:PTZ00121 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKaEEARIEEVMKLYEEEKKMKAEEAKKAEEAKiKAEELKKAEEEKKKVEQLK 1639
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2339 KDEAEKSKEESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQSIKGDQSPLKEVSRPESVAE 2418
Cdd:PTZ00121 1640 KKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAE 1719
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2419 SVKddpvksKEPSRRESVAGSVTADSARDDQSPLESKGASRPESVVDSVKDEAEKQESRRESKTESVIPP--KAKDDKSP 2496
Cdd:PTZ00121 1720 ELK------KAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEelDEEDEKRR 1793

                  ...
gi 161077523 2497 KEV 2499
Cdd:PTZ00121 1794 MEV 1796
PTZ00121 PTZ00121
MAEBL; Provisional
3027-3837 8.20e-24

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 112.16  E-value: 8.20e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3027 SKEASRRESVVESSKDDAEKSESRPESVIASGEPVPRESKSPLDSKDTSRPGSMVESVTAEDEKSEQQSRRESVAESVKa 3106
Cdd:PTZ00121 1083 AKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAE- 1161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3107 DTKKDGKSQEASRPSSVDELLKDDDEK--QESRRQSITGSHKAMSTMGDESPMDKADKSKEPSRPESVAESikHENTKDE 3184
Cdd:PTZ00121 1162 DARKAEEARKAEDAKKAEAARKAEEVRkaEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKA--EEAKKDA 1239
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3185 ESPLGSRRDSVAESIKSDITKGEKSPLPSKEVSRPESVVGSIKDEKAESRRESvAESVKPESSKDATSAPPSKEHSRPES 3264
Cdd:PTZ00121 1240 EEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA-DEAKKAEEKKKADEAKKKAEEAKKAD 1318
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3265 VLGSLKDEGDKTTSRRVSVADSIKDEKSLLVSQEASRPESEAESLKDAAAPSQETSRPESVTESVKdgKSPVASKEASRP 3344
Cdd:PTZ00121 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK--KKAEEKKKADEA 1396
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3345 ASVAENAKDSADESKEQRPEslpQSKAGSIKDEKSPLASKDEAEKSKEESRRESVAeqfplvsKEVSRPASVAESVKDEA 3424
Cdd:PTZ00121 1397 KKKAEEDKKKADELKKAAAA---KKKADEAKKKAEEKKKADEAKKKAEEAKKADEA-------KKKAEEAKKAEEAKKKA 1466
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3425 EKSKEESPLMSK-EASRPASVAGSVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAE 3503
Cdd:PTZ00121 1467 EEAKKADEAKKKaEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3504 KSPLASKeaSRPASVAESIKDEAEKSKEESRRESVAEKSPLPSK-EASRPTSVAESVKDEAEKSKEESRRDSVAE-KSPL 3581
Cdd:PTZ00121 1547 KADELKK--AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKaEEARIEEVMKLYEEEKKMKAEEAKKAEEAKiKAEE 1624
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3582 ASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRR--ESVAEKSPLASKE 3659
Cdd:PTZ00121 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKaaEALKKEAEEAKKA 1704
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3660 ASRPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASvaESVQDEAEKSKEESRRESVAEKSPLASKEasRPAS 3739
Cdd:PTZ00121 1705 EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE--EAKKDEEEKKKIAHLKKEEEKKAEEIRKE--KEAV 1780
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3740 VAESVKDDAEKSKEESRRESVAEKSPLA-----SKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSV 3814
Cdd:PTZ00121 1781 IEEELDEEDEKRRMEVDKKIKDIFDNFAniiegGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNEN 1860
                         810       820
                  ....*....|....*....|...
gi 161077523 3815 AESVKDEAEKSKEESRRESVAEK 3837
Cdd:PTZ00121 1861 GEDGNKEADFNKEKDLKEDDEEE 1883
PTZ00121 PTZ00121
MAEBL; Provisional
2878-3754 1.31e-15

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 85.19  E-value: 1.31e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2878 DEAPKSLIGCPAEERPESPAESAKDAAESVEKSKDASRPPSVVESTKADSTKgdispspeSVLEGPKDDVEKSKESSRPP 2957
Cdd:PTZ00121 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAK--------RVEIARKAEDARKAEEARKA 1172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2958 SVSASITGDSTKDVSRPASVVESVKDEHDKAESRResiAKVESVIDEAGKSDSKSSSQDSQKDEKSTLASKEASRRESvV 3037
Cdd:PTZ00121 1173 EDAKKAEAARKAEEVRKAEELRKAEDARKAEAARK---AEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE-E 1248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3038 ESSKDDAEKSESRPESVIASGEPVPRESKSPLDSKDTSRPGSMVESVTAEDEKSEQQSRRESVAESVKADTKKDgKSQEA 3117
Cdd:PTZ00121 1249 RNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK-KAEEA 1327
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3118 SRPSsvDELLKDDDEKQESRRQSITGSHKAmstmgdespMDKADKSKEPSRPESVAESikHENTKDEESPLGSRRDSVAE 3197
Cdd:PTZ00121 1328 KKKA--DAAKKKAEEAKKAAEAAKAEAEAA---------ADEAEAAEEKAEAAEKKKE--EAKKKADAAKKKAEEKKKAD 1394
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3198 SIKSditKGEKSPLPSKEVSRPESVVGSIKD--EKAESRRESVAESVKPESSKDATSAPPSKEHSRPESVLGSLKDEGDK 3275
Cdd:PTZ00121 1395 EAKK---KAEEDKKKADELKKAAAAKKKADEakKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3276 TTSRRVSVADSIKDEKSLLVSQEASRPESEAESLKDAAAPSQETSRPEsvtesvkdgkspvaSKEASRPASVAENAKdSA 3355
Cdd:PTZ00121 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE--------------EAKKADEAKKAEEAK-KA 1536
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3356 DESKEQRpeslPQSKAGSIKdeKSPLASKDEAEKSKEESRRESVAEQFPLVSKEVSRPASvaESVKDEAEKSKEESPLMS 3435
Cdd:PTZ00121 1537 DEAKKAE----EKKKADELK--KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE--EARIEEVMKLYEEEKKMK 1608
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3436 KEASRPAsvagsvKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRP 3515
Cdd:PTZ00121 1609 AEEAKKA------EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK 1682
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3516 ASVAESIKDEAEKSKEESRR--ESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSvaEKSPLASKEASRPASVA 3593
Cdd:PTZ00121 1683 AEEDEKKAAEALKKEAEEAKkaEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK--KKAEEAKKDEEEKKKIA 1760
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3594 ESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDeaEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDE 3673
Cdd:PTZ00121 1761 HLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKD--IFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADS 1838
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3674 AEKSKEES--------SRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEE--------SRRESVAEKSPLASKEASRP 3737
Cdd:PTZ00121 1839 KNMQLEEAdafekhkfNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIekidkddiEREIPNNNMAGKNNDIIDDK 1918
                         890
                  ....*....|....*..
gi 161077523 3738 ASVAESVKDDAEKSKEE 3754
Cdd:PTZ00121 1919 LDKDEYIKRDAEETREE 1935
growth_prot_Scy NF041483
polarized growth protein Scy;
3229-3983 1.63e-11

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 71.40  E-value: 1.63e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3229 EKAESRRESVAESVKPESSKDATSAppskehsrpESVLGSLKDEGDKTTSRRVSVADSIKDEKSLLVSQEASRPESEAES 3308
Cdd:NF041483  349 EAAEKARTVAAEDTAAQLAKAARTA---------EEVLTKASEDAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEQ 419
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3309 LKDAAAPSQETSRPESVtESVKDGKSPVASKEASRPASVAENAKDSAdeskEQRPESLPQSKAGSIKDEKSPLASKDEAE 3388
Cdd:NF041483  420 LKGAAKDDTKEYRAKTV-ELQEEARRLRGEAEQLRAEAVAEGERIRG----EARREAVQQIEEAARTAEELLTKAKADAD 494
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3389 kskeESRRESVAEQFPLVSKEVSRPASVAESVKDEAEKSKEESPLMSKEASRPA--------SVAGSVKDEAEKSKEESR 3460
Cdd:NF041483  495 ----ELRSTATAESERVRTEAIERATTLRRQAEETLERTRAEAERLRAEAEEQAeevraaaeRAARELREETERAIAARQ 570
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3461 RESVAEKSPLPSKEASRPASvAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKSKEESRRE---S 3537
Cdd:NF041483  571 AEAAEELTRLHTEAEERLTA-AEEALADARAEAERIRREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADasaA 649
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3538 VAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRrdsvAEKSPLASKEASRPASVAESVQDEAEKskeesRRESVAEKSP 3617
Cdd:NF041483  650 RAEGENVAVRLRSEAAAEAERLKSEAQESADRVR----AEAAAAAERVGTEAAEALAAAQEEAAR-----RRREAEETLG 720
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3618 LASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDEAekskeESSRDSVAEKSPLASKEA 3697
Cdd:NF041483  721 SARAEADQERERAREQSEELLASARKRVEEAQAEAQRLVEEADRRATELVSAAEQTA-----QQVRDSVAGLQEQAEEEI 795
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3698 SRPASVAESVQD----EAEKSKEESRRESVAEKSPlASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPL---ASKE 3770
Cdd:NF041483  796 AGLRSAAEHAAErtrtEAQEEADRVRSDAYAERER-ASEDANRLRREAQEETEAAKALAERTVSEAIAEAERLrsdASEY 874
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3771 ASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRES---VAEKSSLASKKASR 3847
Cdd:NF041483  875 AQRVRTEASDTLASAEQDAARTRADAREDANRIRSDAAAQADRLIGEATSEAERLTAEARAEAerlRDEARAEAERVRAD 954
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3848 PASVAESVKDEAEKSKEESRRESvAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEksplpskeasrptsvAE 3927
Cdd:NF041483  955 AAAQAEQLIAEATGEAERLRAEA-AETVGSAQQHAERIRTEAERVKAEAAAEAERLRTEAREE---------------AD 1018
                         730       740       750       760       770
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 3928 SVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAESVKDETEKSKEESRRESVT 3983
Cdd:NF041483 1019 RTLDEARKDANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADT 1074
growth_prot_Scy NF041483
polarized growth protein Scy;
3513-4055 2.06e-11

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 71.40  E-value: 2.06e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3513 SRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDsvaeksplaskeASRPASV 3592
Cdd:NF041483  159 ARTESQARRLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAILRRARKD------------AERLLNA 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3593 AeSVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRREsvAEKSPLASKE-ASRPTSVAESVK 3671
Cdd:NF041483  227 A-STQAQEATDHAEQLRSSTAAESDQARRQAAELSRAAEQRMQEAEEALREARAE--AEKVVAEAKEaAAKQLASAESAN 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3672 DEAEKSKEESSRDSVAEksplASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEA----SRPASVAESVKDD 3747
Cdd:NF041483  304 EQRTRTAKEEIARLVGE----ATKEAEALKAEAEQALADARAEAEKLVAEAAEKARTVAAEDTaaqlAKAARTAEEVLTK 379
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3748 AEKSKEESRRESVAEKSPL---ASKEASR----PASVAESVKDEAEKSKEESRRESVAEKsplpsKEASRPTSVAESVKD 3820
Cdd:NF041483  380 ASEDAKATTRAAAEEAERIrreAEAEADRlrgeAADQAEQLKGAAKDDTKEYRAKTVELQ-----EEARRLRGEAEQLRA 454
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3821 EA----EKSKEESRRESVaeksslasKKASRPASVAESVKDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEA 3896
Cdd:NF041483  455 EAvaegERIRGEARREAV--------QQIEEAARTAEELLTKAKADADELRSTATAESERVRTEAIERATTLRRQAEETL 526
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3897 EKSKEESRRESvAEKSPLPSKEASRPTSVAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAESVKDETEKSkEE 3976
Cdd:NF041483  527 ERTRAEAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERLTAAEEALADARAEA-ER 604
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3977 SRRESVTEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRE---SVAEKSPLASKESSRPASVAESIKDEAEGTKQESRR 4053
Cdd:NF041483  605 IRREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADasaARAEGENVAVRLRSEAAAEAERLKSEAQESADRVRA 684

                  ..
gi 161077523 4054 ES 4055
Cdd:NF041483  685 EA 686
growth_prot_Scy NF041483
polarized growth protein Scy;
3377-4089 3.01e-11

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 70.63  E-value: 3.01e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3377 EKSPLASKDEAEKSKEESRResvaeqfplvskevsRPASVAESVKDEAEKSKEESplmSKEASRPASVAGSVKDEAEKSK 3456
Cdd:NF041483  177 EQALAAARAEAERLAEEARQ---------------RLGSEAESARAEAEAILRRA---RKDAERLLNAASTQAQEATDHA 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3457 EESRRESVAEkSPLPSKEASRPASVAESVKDEADKSKEESRREsgAEKSPLASKE-ASRPASVAESIKDEAEKSKEESRR 3535
Cdd:NF041483  239 EQLRSSTAAE-SDQARRQAAELSRAAEQRMQEAEEALREARAE--AEKVVAEAKEaAAKQLASAESANEQRTRTAKEEIA 315
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3536 ESVAEKsplpskeasrpTSVAESVKDEAEKSKEESRRDsvAEKSPLASKEASRPASVAESVqdeAEKSKEESRRESVAEK 3615
Cdd:NF041483  316 RLVGEA-----------TKEAEALKAEAEQALADARAE--AEKLVAEAAEKARTVAAEDTA---AQLAKAARTAEEVLTK 379
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3616 SPLASKEASRPASV-AESIKDEAEKSKEESRRES--VAEKSPLASK---------------EASRPTSVAESVKDEA--- 3674
Cdd:NF041483  380 ASEDAKATTRAAAEeAERIRREAEAEADRLRGEAadQAEQLKGAAKddtkeyraktvelqeEARRLRGEAEQLRAEAvae 459
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3675 -EKSKEESSRDSVaeksplasKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKE 3753
Cdd:NF041483  460 gERIRGEARREAV--------QQIEEAARTAEELLTKAKADADELRSTATAESERVRTEAIERATTLRRQAEETLERTRA 531
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3754 ESRRESvAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSvAESVKDEAEKSKEESRRES 3833
Cdd:NF041483  532 EAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERLTA-AEEALADARAEAERIRREA 609
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3834 VAEKSSLASKKASRPASVAESVKDEAEKSKEESRRE---SVAEKSPLASKEASRPASVAESVKDEAEKSKEESRRES--- 3907
Cdd:NF041483  610 AEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADasaARAEGENVAVRLRSEAAAEAERLKSEAQESADRVRAEAaaa 689
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3908 -------VAEKSPLPSKEASRPTSVAE----SVKDEADKSKEESRRES------GAEKSPLASMEASRPTSVAESVKDET 3970
Cdd:NF041483  690 aervgteAAEALAAAQEEAARRRREAEetlgSARAEADQERERAREQSeellasARKRVEEAQAEAQRLVEEADRRATEL 769
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3971 EKSKEE---SRRESVTEKSPLPSKEASRPTS----VAESVKDEAEKSKEESRRESVAEKSPlASKESSRPASVAesiKDE 4043
Cdd:NF041483  770 VSAAEQtaqQVRDSVAGLQEQAEEEIAGLRSaaehAAERTRTEAQEEADRVRSDAYAERER-ASEDANRLRREA---QEE 845
                         730       740       750       760
                  ....*....|....*....|....*....|....*....|....*..
gi 161077523 4044 AEGTKQESRRESMPESGKAESIKGDQSSLASK-ETSRPDSVVESVKD 4089
Cdd:NF041483  846 TEAAKALAERTVSEAIAEAERLRSDASEYAQRvRTEASDTLASAEQD 892
growth_prot_Scy NF041483
polarized growth protein Scy;
3439-4062 2.25e-10

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 67.93  E-value: 2.25e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3439 SRPASVAGSVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRresgaeksplasKEASRPASV 3518
Cdd:NF041483  159 ARTESQARRLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAILRRAR------------KDAERLLNA 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3519 AESIKDEAEKSKEESRRESVAEkSPLPSKEASRPTSVAESVKDEAEKSKEESRRDsvAEKSPLASKE-ASRPASVAESVQ 3597
Cdd:NF041483  227 ASTQAQEATDHAEQLRSSTAAE-SDQARRQAAELSRAAEQRMQEAEEALREARAE--AEKVVAEAKEaAAKQLASAESAN 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3598 DEAEKSKEESRRESVAEksplASKEasrpasvAESIKDEAEKSKEESRREsvAEKSPLASKEASRPTSVAESVkdeAEKS 3677
Cdd:NF041483  304 EQRTRTAKEEIARLVGE----ATKE-------AEALKAEAEQALADARAE--AEKLVAEAAEKARTVAAEDTA---AQLA 367
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3678 KEESSRDSVAEKSPLASKEASRPASvaesvqDEAEKskeeSRRESVAEksplASKEASRPASVAESVKDDAEKSKEESRR 3757
Cdd:NF041483  368 KAARTAEEVLTKASEDAKATTRAAA------EEAER----IRREAEAE----ADRLRGEAADQAEQLKGAAKDDTKEYRA 433
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3758 ESVAeksplASKEASRPASVAESVKDEA----EKSKEESRRESVAEksplpSKEASRptsVAESVKDEAEKSKEESRRES 3833
Cdd:NF041483  434 KTVE-----LQEEARRLRGEAEQLRAEAvaegERIRGEARREAVQQ-----IEEAAR---TAEELLTKAKADADELRSTA 500
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3834 VAEKSSLASKKASRPASVAESVKDEAEKSKEESRRESvAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSP 3913
Cdd:NF041483  501 TAESERVRTEAIERATTLRRQAEETLERTRAEAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTR 579
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3914 LPSKEASRPTSvAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAESVKDETEKSKEESRRE---SVTEKSPLPS 3990
Cdd:NF041483  580 LHTEAEERLTA-AEEALADARAEAERIRREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADasaARAEGENVAV 658
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 161077523 3991 KEASRPTSVAESVKDEAEKSKEESRRESVAEKSPLASKESSRPASVAESI---KDEAEGTKQESRRESMPESGKA 4062
Cdd:NF041483  659 RLRSEAAAEAERLKSEAQESADRVRAEAAAAAERVGTEAAEALAAAQEEAarrRREAEETLGSARAEADQERERA 733
PTZ00121 PTZ00121
MAEBL; Provisional
1464-2163 1.55e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.16  E-value: 1.55e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1464 KSAKDREDTGSIESPPTIEEAIEVEVQAKQEAQKPVPA--PEEAIKTEKSPLA-----SKETSRPESATGSVKEDTEQTK 1536
Cdd:PTZ00121 1240 EEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEArkADELKKAEEKKKAdeakkAEEKKKADEAKKKAEEAKKADE 1319
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1537 SKKSPVPSRPESEAKDKKSPFASGEASRPESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIKPE 1616
Cdd:PTZ00121 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1617 SGIDEKSALASKEASRPESVTDKSKEPSRRESIAESLKAESTKDEKSAPPSKEASRPGSVVESVKDETEKSKepsrresi 1696
Cdd:PTZ00121 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK-------- 1471
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1697 AESAKPPIEfrEVSRPESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESvaESFKADSTKD 1776
Cdd:PTZ00121 1472 ADEAKKKAE--EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKAD--EAKKAEEKKK 1547
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1777 EKSPLTSKDISRPESavenvmdaVGSAERSQPESVTASRDVSRPESVAESEKddtDKPESVVESVIPASDVVEIEKGAAD 1856
Cdd:PTZ00121 1548 ADELKKAEELKKAEE--------KKKAEEAKKAEEDKNMALRKAEEAKKAEE---ARIEEVMKLYEEEKKMKAEEAKKAE 1616
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1857 KEKGVFVSLEIGKPDSPSEVISRPGPVVESVKPESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLKGSTRRES 1936
Cdd:PTZ00121 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK 1696
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1937 VAESDKSSQPFKETSRPESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRPASVAESAKDGADDLKELSRPESTTQSKE 2016
Cdd:PTZ00121 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2017 AGSIKDEKspLASEEASRPASVAESVKDEAEKSK--EESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKE----ES 2090
Cdd:PTZ00121 1777 KEAVIEEE--LDEEDEKRRMEVDKKIKDIFDNFAniIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAfekhKF 1854
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2091 RRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEAS--------RPASVAESIKDEAEKSKE 2162
Cdd:PTZ00121 1855 NKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAgknndiidDKLDKDEYIKRDAEETRE 1934

                  .
gi 161077523 2163 E 2163
Cdd:PTZ00121 1935 E 1935
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
2002-2322 1.62e-09

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 64.65  E-value: 1.62e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2002 LKELSRPESTTQSKEAGSIKDEKSPLASEEASRPASVAESVKDEAEK----SKEESRR----------ESVAEKSPLPSK 2067
Cdd:NF033838  107 LKEKSEAELTSKTKKELDAAFEQFKKDTLEPGKKVAEATKKVEEAEKkakdQKEEDRRnyptntyktlELEIAESDVEVK 186
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2068 EASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----ESVAEKSPLPSKE 2142
Cdd:NF033838  187 KAE-----LELVKEEAKEPRDEEKIKQAKAKVESKKAEATR----LEKIKTDREKAEEEAKRradakLKEAVEKNVATSE 257
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2143 ASRP------ASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEESRR-------- 2203
Cdd:NF033838  258 QDKPkrrakrGVLGEPATPDKKENDAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnyptntyk 337
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2204 --ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR---- 2277
Cdd:NF033838  338 tlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRkaae 408
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 161077523 2278 -ESVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEKSP 2322
Cdd:NF033838  409 eDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PTZ00121 PTZ00121
MAEBL; Provisional
580-1228 2.04e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 64.78  E-value: 2.04e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  580 APAIQTVTSTRKSLKSAIEATPAPPSASYKTTKFSPVASAALAVQHPQQQDNKAKEAAAAAAAAAAAAASAATIARAKAD 659
Cdd:PTZ00121 1226 AEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD 1305
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  660 SMDTDAEPEHEAD-----PEPADTGDEAAPTEQEPEAETEPEPEHEPEAEQDKDVGEEKKVEVLIMKPQQAtpaviaasg 734
Cdd:PTZ00121 1306 EAKKKAEEAKKADeakkkAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA--------- 1376
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  735 KDGVDAASADATPTGKLSKASAKGKADKPRA-EVKPVVRSRIDTKPPKSMDRKLAKRDEKKSSptttpAARAPVAQNAKP 813
Cdd:PTZ00121 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKAdELKKAAAAKKKADEAKKKAEEKKKADEAKKK-----AEEAKKADEAKK 1451
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  814 KV-LSRPATKSSPSSTPAKSAKEANNRKVLESKQQAARVQATSTVSRRVTSTASERRVQQQAEAKTAATGATQATQRKPI 892
Cdd:PTZ00121 1452 KAeEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  893 SRRPRGVSPSKRAPAPGSPVKQAKP-KAADLKKTRLDKGGTTDSSLVSTPSADEATAAKKLQDLTASQELDAEKQRELDD 971
Cdd:PTZ00121 1532 EAKKADEAKKAEEKKKADELKKAEElKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEE 1611
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  972 LKEEQEVVREIEAVFSRDEMKRQQHQQIKAELREM-PAEGTGDGENEPDEEEEYLIIEKEEVEQYTEDSIVEQESSMTKE 1050
Cdd:PTZ00121 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKkKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAA 1691
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1051 EEIQKHQRDSQESEKKRKKSAEEEIEAAIAKVEAAERKARLEGASARQDESELDVEpEQSKIKAEVQDIIATAKDIAKSR 1130
Cdd:PTZ00121 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE-EAKKDEEEKKKIAHLKKEEEKKA 1770
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1131 TEEQLAKPA--EEELSSPTPEEKLS--KKTSDTKDDqigapvdvlpvnlqeslpeekFSATIESGATTAPTLPEDERIPL 1206
Cdd:PTZ00121 1771 EEIRKEKEAviEEELDEEDEKRRMEvdKKIKDIFDN---------------------FANIIEGGKEGNLVINDSKEMED 1829
                         650       660
                  ....*....|....*....|..
gi 161077523 1207 DQIKEdLVIEEKYVKEETKEAE 1228
Cdd:PTZ00121 1830 SAIKE-VADSKNMQLEEADAFE 1850
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1948-2285 9.16e-09

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 61.95  E-value: 9.16e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1948 KETSRPESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRpasvAESAKDGADDLKELSRPESTTQSKEAGSIKDEKSPL 2027
Cdd:NF033838  108 KEKSEAELTSKTKKELDAAFEQFKKDTLEPGKKVAEATKK----VEEAEKKAKDQKEEDRRNYPTNTYKTLELEIAESDV 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2028 ASEEASrpasvAESVKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----ESVAEKSPLP 2102
Cdd:NF033838  184 EVKKAE-----LELVKEEAKEPRDEEKIKQAKAKVESKKAEATR----LEKIKTDREKAEEEAKRradakLKEAVEKNVA 254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2103 SKEASRP------ASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEESRR----- 2166
Cdd:NF033838  255 TSEQDKPkrrakrGVLGEPATPDKKENDAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnyptn 334
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2167 -----ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR- 2240
Cdd:NF033838  335 tyktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRk 405
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 2241 ----ESVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSP 2285
Cdd:NF033838  406 aaeeDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
3585-3799 1.61e-08

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 61.73  E-value: 1.61e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3585 EASRPASVAESVQDEAEKSKEESRRESVAEksplaskeasrpaSVAESIKDEAEKSKEESRRESVAEksplaSKEASRPT 3664
Cdd:PTZ00341  944 EANIEEDAEENVEEDAEENVEENVEENVEE-------------NVEENVEENVEENVEENVEENVEE-----NVEENIEE 1005
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3665 SVAESVKDEAEKSKEESSRDSVAEkspLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEasrpasVAESV 3744
Cdd:PTZ00341 1006 NVEENVEENIEENVEEYDEENVEE---VEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE------IEENI 1076
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 161077523 3745 KDDAEKSKEESRRESVAEKSplASKEASRPASVAESVKDEAEKSKEESRRESVAE 3799
Cdd:PTZ00341 1077 EENIEENVEENVEENVEEIE--ENVEENVEENAEENAEENAEENAEEYDDENPEE 1129
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
2028-2433 2.02e-08

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 61.18  E-value: 2.02e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2028 ASEEASRPASVAESVKDEAEKSKE---ESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSplpsk 2104
Cdd:NF033838   37 AEEVRGGNNPTVTSSGNESQKEHAkevESHLEKILSEIQKSLDKRKHTQNVALNKKLSDIKTEYLYELNVLKEKS----- 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2105 EASRPASVAESIKDEAEKSKEESRResvaeksplPSKEASRPASVAESIKDEAEKSKEESRR----------ESVAEKSP 2174
Cdd:NF033838  112 EAELTSKTKKELDAAFEQFKKDTLE---------PGKKVAEATKKVEEAEKKAKDQKEEDRRnyptntyktlELEIAESD 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2175 LPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----ESVAEKSPL 2249
Cdd:NF033838  183 VEVKKAE-----LELVKEEAKEPRDEEKIKQAKAKVESKKAEATR----LEKIKTDREKAEEEAKRradakLKEAVEKNV 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2250 PSKEASRP------ASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEETRR---- 2314
Cdd:NF033838  254 ATSEQDKPkrrakrGVLGEPATPDKKENDAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnypt 333
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2315 ------ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESAAEKSPLPSKEASRpasvAESVKDEADKSKEESRR 2388
Cdd:NF033838  334 ntyktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKR 404
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 2389 ESmAESGKAQSIKGDQ-------SPLKEVSRPESVAESVK----DDPVKSKEPSRR 2433
Cdd:NF033838  405 KA-AEEDKVKEKPAEQpqpapapQPEKPAPKPEKPAEQPKaekpADQQAEEDYARR 459
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
2017-2245 2.19e-08

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 61.34  E-value: 2.19e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2017 AGSIKDEKSPLASEEASRPASVAESVKDEAEKSKEESRRESVAEKSplpskEASRPASVAESIKDEAEKSKEESRRESVA 2096
Cdd:PTZ00341  914 SGNIAHEINLINKELKNQNENVPEHLKEHAEANIEEDAEENVEEDA-----EENVEENVEENVEENVEENVEENVEENVE 988
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2097 EksplpSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASV-----AESIKDEAEKSKEESRRESVAE 2171
Cdd:PTZ00341  989 E-----NVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEydeenVEEIEENAEENVEENIEENIEE 1063
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 161077523 2172 KSPLPSKEasrpasVAESIKDEAEKSKEESRRESVAEKSplPSKEASRPASVAESIKDEAEKSKEESRRESVAE 2245
Cdd:PTZ00341 1064 YDEENVEE------IEENIEENIEENVEENVEENVEEIE--ENVEENVEENAEENAEENAEENAEEYDDENPEE 1129
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
3638-4024 3.85e-08

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 60.03  E-value: 3.85e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3638 EKSKEESRRESVAEKSPlASKEASRPTSVAESVKDEAEKSKEESSRDSVAEKSP----------------LASKEASRPA 3701
Cdd:NF033838  109 EKSEAELTSKTKKELDA-AFEQFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEdrrnyptntyktleleIAESDVEVKK 187
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3702 SVAESVQDEAEKSKEESRRESVAEKSPLASKEASRpasvAESVKDDAEKSKEESRRESVAEKSPLASKEASrpasvaesv 3781
Cdd:NF033838  188 AELELVKEEAKEPRDEEKIKQAKAKVESKKAEATR----LEKIKTDREKAEEEAKRRADAKLKEAVEKNVA--------- 254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3782 KDEAEKSKEESRRESVAEKSPlPSKEASRPTSVAESVKDEAEKSKEESRRESVAEksslASKKAsrpasvaESVKDEAEK 3861
Cdd:NF033838  255 TSEQDKPKRRAKRGVLGEPAT-PDKKENDAKSSDSSVGEETLPSPSLKPEKKVAE----AEKKV-------EEAKKKAKD 322
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3862 SKEESRR----------ESVAEKSPLASKEASrpasvAESVKDEAEKSKEESRRESVAEKSPLPSKEASRptsvAESVKD 3931
Cdd:NF033838  323 QKEEDRRnyptntyktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKT 393
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3932 EADKSKEESRRESGAEksplasmeasrptsvaESVKDETEKSKEesrresvteKSPLPSKEASRPTSVAESVKDEAEKSK 4011
Cdd:NF033838  394 DRKKAEEEAKRKAAEE----------------DKVKEKPAEQPQ---------PAPAPQPEKPAPKPEKPAEQPKAEKPA 448
                         410
                  ....*....|...
gi 161077523 4012 EESRRESVAEKSP 4024
Cdd:NF033838  449 DQQAEEDYARRSE 461
PTZ00121 PTZ00121
MAEBL; Provisional
3819-4495 4.95e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 60.15  E-value: 4.95e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3819 KDEAEKSKEESRRESVAEKSSLASKKASRPaSVAESVKDEAEKSKEESRRESVAEKSPLASKEASRPASvAESVKDEAEK 3898
Cdd:PTZ00121 1045 KDIIDEDIDGNHEGKAEAKAHVGQDEGLKP-SYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAE-EARKAEEAKK 1122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3899 SKEESRRESVAEKSplpsKEASRptsVAESVKDEADKSKEESRRESGAEKsplasmeasrptsVAESVKDETEKSKEESR 3978
Cdd:PTZ00121 1123 KAEDARKAEEARKA----EDARK---AEEARKAEDAKRVEIARKAEDARK-------------AEEARKAEDAKKAEAAR 1182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3979 R-ESVTEKSPLPSKEASRptSVAESVKDEAEKSKEESRRESVAEKSPLASKessrpasVAESIKDEAEGTKQESRRESMP 4057
Cdd:PTZ00121 1183 KaEEVRKAEELRKAEDAR--KAEAARKAEEERKAEEARKAEDAKKAEAVKK-------AEEAKKDAEEAKKAEEERNNEE 1253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4058 ESGKAESIKGDQSSLASKETSRPDSVVESVKDETEKPEGSAIDKSQVASRPESVAVSAKDEKSP--LHSRPESVADKSPD 4135
Cdd:PTZ00121 1254 IRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKAdeAKKKAEEAKKKADA 1333
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4136 ASKEASRSLSVAETASSPIEEGPRSIAdlslplnlTGEAKGKLPTLSSPIDVAEGDFLEVKAESSPRPAVLSKPAEfsqp 4215
Cdd:PTZ00121 1334 AKKKAEEAKKAAEAAKAEAEAAADEAE--------AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE---- 1401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4216 dtghtastpvdEASPVLEEIEVVEQHTTSGVGATGATAETDLLDLTETKSETVTKQSEttlfetLTSKVESKVEVlESSV 4295
Cdd:PTZ00121 1402 -----------EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADE------AKKKAEEAKKA-EEAK 1463
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4296 KQVEEKVQTSVKQAETTVTDSLEQLTKKSSEqlteiksvldtnfeevAKIVADVAKVLKSDKDITDIIPDFDERQLEEKL 4375
Cdd:PTZ00121 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEE----------------AKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4376 KSTADTEEESDKSTRDEKSLEISVKVEIESEKSSPDQKSGPISIEEKDKIEQSEKAQLRQGILTSSRPESVASQPESVPS 4455
Cdd:PTZ00121 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKM 1607
                         650       660       670       680
                  ....*....|....*....|....*....|....*....|
gi 161077523 4456 PSQSAASHEHKEVELSESHKAEKSSRPESVASQVSEKDMK 4495
Cdd:PTZ00121 1608 KAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKK 1647
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1917-2248 5.79e-08

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 59.64  E-value: 5.79e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1917 KVECLKDESEVLKGStrrESVAESDKSSQPF-KETSRPESAVGSMK---DESMSKEPSRRESVKDGAAQSRETSRPASVA 1992
Cdd:NF033838  103 ELNVLKEKSEAELTS---KTKKELDAAFEQFkKDTLEPGKKVAEATkkvEEAEKKAKDQKEEDRRNYPTNTYKTLELEIA 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1993 ESAKDGADDLKELSRpESTTQSKEAGSIKDEKSPLASEEASrpASVAESVKDEAEKSKEESRR-----ESVAEKSPLPSK 2067
Cdd:NF033838  180 ESDVEVKKAELELVK-EEAKEPRDEEKIKQAKAKVESKKAE--ATRLEKIKTDREKAEEEAKRradakLKEAVEKNVATS 256
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2068 EASRP------ASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEESRR------- 2129
Cdd:NF033838  257 EQDKPkrrakrGVLGEPATPDKKENDAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnyptnty 336
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2130 ---ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR--- 2203
Cdd:NF033838  337 ktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRkaa 407
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 161077523 2204 --ESVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSP 2248
Cdd:NF033838  408 eeDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
3713-3947 6.23e-08

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 59.80  E-value: 6.23e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3713 KSKEESRRESVAEKSplaskEASRPASVAESVKDDAEKSKEESRRESVAEksplaskeasrpaSVAESVKDEAEKSKEES 3792
Cdd:PTZ00341  929 KNQNENVPEHLKEHA-----EANIEEDAEENVEEDAEENVEENVEENVEE-------------NVEENVEENVEENVEEN 990
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3793 RRESVAEksplpSKEASRPTSVAESVKDEAEKSKEESRRESVAEKSSLASKKASRPasvAESVKDEAEKSKEESRRESVA 3872
Cdd:PTZ00341  991 VEENVEE-----NVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEEN---VEEIEENAEENVEENIEENIE 1062
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 161077523 3873 EKSPLASKEasrpasVAESVKDEAEKSKEESRRESVAEksplpsKEASRPTSVAESVKDEADKSKEESRRESGAE 3947
Cdd:PTZ00341 1063 EYDEENVEE------IEENIEENIEENVEENVEENVEE------IEENVEENVEENAEENAEENAEENAEEYDDE 1125
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
3459-3923 8.74e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 59.32  E-value: 8.74e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3459 SRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESV 3538
Cdd:PTZ00449  477 SKIQFTQEIKKLIKKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGE 556
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3539 AEKSPLPSKEaSRPTSVAESVKDEAEKSKEESRRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRESvAEKSPL 3618
Cdd:PTZ00449  557 VGKKPGPAKE-HKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPE-SPKSPK 634
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3619 ASKEASRPASvaesikdeaekskeESRRESvaEKSPLASKEASRPtsvaesvKDEAEKSKEESSRDSVAEKSPlASKEAS 3698
Cdd:PTZ00449  635 RPPPPQRPSS--------------PERPEG--PKIIKSPKPPKSP-------KPPFDPKFKEKFYDDYLDAAA-KSKETK 690
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3699 RPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPLASKEASR----P 3774
Cdd:PTZ00449  691 TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPAdtplP 770
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3775 ASVAESVKDEAEKSKEESRREsvAEKSP-LPSKEASRPTSVAESVKDEAEKSkeESRRESVAEKSSLASKKASRPASVAE 3853
Cdd:PTZ00449  771 DILAEEFKEEDIHAETGEPDE--AMKRPdSPSEHEDKPPGDHPSLPKKRHRL--DGLALSTTDLESDAGRIAKDASGKIV 846
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3854 SVK-----------DEAEKSKEESRRESVAEKSPLASKEASRPASvaesvkdeaEKSKEESRRESvAEKSPLPSKEASRP 3922
Cdd:PTZ00449  847 KLKrsksfddlttvEEAEEMGAEARKIVVDDDGTEADDEDTHPPE---------EKHKSEVRRRR-PPKKPSKPKKPSKP 916

                  .
gi 161077523 3923 T 3923
Cdd:PTZ00449  917 K 917
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
2053-2498 9.83e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 58.93  E-value: 9.83e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2053 SRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESV 2132
Cdd:PTZ00449  477 SKIQFTQEIKKLIKKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGE 556
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2133 AEKSPLPSKEaSRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESvAEKSPL 2212
Cdd:PTZ00449  557 VGKKPGPAKE-HKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPE-SPKSPK 634
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2213 PSKEASRPASVAesiKDEAEKSKEESRresvAEKSPLPSKEASRPASVAESIKDEAEKskeesrresvaeksplpSKEAS 2292
Cdd:PTZ00449  635 RPPPPQRPSSPE---RPEGPKIIKSPK----PPKSPKPPFDPKFKEKFYDDYLDAAAK-----------------SKETK 690
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2293 RPASVAESIKDEAEKSKEETRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESAAEKSPLPSKEASR----P 2368
Cdd:PTZ00449  691 TTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPAdtplP 770
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2369 ASVAESVKDE----ADKSKEESRRESMAESGKAQSIKGDQSPL-KEVSRPESVAESVKDdpvKSKEPSR--RESVAGSVT 2441
Cdd:PTZ00449  771 DILAEEFKEEdihaETGEPDEAMKRPDSPSEHEDKPPGDHPSLpKKRHRLDGLALSTTD---LESDAGRiaKDASGKIVK 847
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 161077523 2442 ADSAR--DDQSPLESKGASRPES---VVDSVKDEAEKQES------------RRESKTESVIPPKAKDDKSPKE 2498
Cdd:PTZ00449  848 LKRSKsfDDLTTVEEAEEMGAEArkiVVDDDGTEADDEDThppeekhksevrRRRPPKKPSKPKKPSKPKKPKK 921
growth_prot_Scy NF041483
polarized growth protein Scy;
3612-4135 4.71e-07

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 56.76  E-value: 4.71e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3612 VAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDEAEK--------------- 3676
Cdd:NF041483  147 VNENVAWAEQLRARTESQARRLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAilrrarkdaerllna 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3677 ---------SKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRREsvAEKSPLASKE-ASRPASVAESVKD 3746
Cdd:NF041483  227 astqaqeatDHAEQLRSSTAAESDQARRQAAELSRAAEQRMQEAEEALREARAE--AEKVVAEAKEaAAKQLASAESANE 304
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3747 DAEKSKEESRRESVAEksplASKEasrpasvAESVKDEAEKSKEESRREsvAEKSPLPSKEASRPTSVAESVkdeAEKSK 3826
Cdd:NF041483  305 QRTRTAKEEIARLVGE----ATKE-------AEALKAEAEQALADARAE--AEKLVAEAAEKARTVAAEDTA---AQLAK 368
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3827 EESRRESVAEKSSLASKKASRPASV-AESVKDEAEKSKEESRRES--VAEKSPLASK---------------EASRPASV 3888
Cdd:NF041483  369 AARTAEEVLTKASEDAKATTRAAAEeAERIRREAEAEADRLRGEAadQAEQLKGAAKddtkeyraktvelqeEARRLRGE 448
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3889 AESVKDEA----EKSKEESRRESVAEksplpSKEASRptsVAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAE 3964
Cdd:NF041483  449 AEQLRAEAvaegERIRGEARREAVQQ-----IEEAAR---TAEELLTKAKADADELRSTATAESERVRTEAIERATTLRR 520
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3965 SVKDETEKSKEESRRESvTEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRESVAEKSPLASKESSRPASVAESIKDeA 4044
Cdd:NF041483  521 QAEETLERTRAEAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERLTAAEEALAD-A 598
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4045 EGTKQESRRESMPESgkaESIKGDQSSLASKETSRPDSVVESVKDEtekpegSAIDKSQVASRPESVAVSAKDEKSPLHS 4124
Cdd:NF041483  599 RAEAERIRREAAEET---ERLRTEAAERIRTLQAQAEQEAERLRTE------AAADASAARAEGENVAVRLRSEAAAEAE 669
                         570
                  ....*....|.
gi 161077523 4125 RPESVADKSPD 4135
Cdd:NF041483  670 RLKSEAQESAD 680
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
3813-4021 5.80e-07

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 56.33  E-value: 5.80e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3813 SVAESVKDEAEKSKEESRRESVAEKSSlaskkasrpASVAESVKDEAEKSKEESRRESVAEksplaSKEASRPASVAESV 3892
Cdd:PTZ00341  934 NVPEHLKEHAEANIEEDAEENVEEDAE---------ENVEENVEENVEENVEENVEENVEE-----NVEENVEENVEENV 999
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3893 KDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVaESVKDETEK 3972
Cdd:PTZ00341 1000 EENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENV-EEIEENIEE 1078
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 161077523 3973 SKEESRRESVTEKSPLPSKeasrptSVAESVKDEAEKSKEESRRESVAE 4021
Cdd:PTZ00341 1079 NIEENVEENVEENVEEIEE------NVEENVEENAEENAEENAEENAEE 1121
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1909-2211 1.26e-06

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 55.02  E-value: 1.26e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1909 KEPSRPESKVEclKDESEVLKGSTRRESVAESDKSSQPFK-----ETSRPESAVGSMKDEsmskepsrRESVKDGAAQSR 1983
Cdd:NF033838  132 KDTLEPGKKVA--EATKKVEEAEKKAKDQKEEDRRNYPTNtyktlELEIAESDVEVKKAE--------LELVKEEAKEPR 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1984 ETSRPASV---AESAKDGADDLKELS--RPESTTQSKEAGSIKDEKS---PLASEEASRP------ASVAESVKDEAEKS 2049
Cdd:NF033838  202 DEEKIKQAkakVESKKAEATRLEKIKtdREKAEEEAKRRADAKLKEAvekNVATSEQDKPkrrakrGVLGEPATPDKKEN 281
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2050 KEESRRESVAEKS-PLPSKEASRPASVAESIKDEAEKS----KEESRR----------ESVAEKSPLPSKEASrpasvAE 2114
Cdd:NF033838  282 DAKSSDSSVGEETlPSPSLKPEKKVAEAEKKVEEAKKKakdqKEEDRRnyptntyktlELEIAESDVKVKEAE-----LE 356
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2115 SIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----ESVAEK-------SPLPSKEASR 2182
Cdd:NF033838  357 LVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRkaaeeDKVKEKpaeqpqpAPAPQPEKPA 432
                         330       340
                  ....*....|....*....|....*....
gi 161077523 2183 PASVAESIKDEAEKSKEESRRESVAEKSP 2211
Cdd:NF033838  433 PKPEKPAEQPKAEKPADQQAEEDYARRSE 461
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
3029-3390 1.71e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 54.92  E-value: 1.71e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3029 EASRRESVVESSKDDAEKSESRPESVIASGEPVPRESKSPLDSKDTSRPGSMVESVTAEDEKSEQQSRRESVAESvKADT 3108
Cdd:NF033609  550 EPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDS-ASDS 628
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3109 KKDGKSQEASRPSSVDELLKDDDEKQESRRQSITGSHKAMSTMGDeSPMDKADKSKEPSRPESVAESikhENTKDEESPL 3188
Cdd:NF033609  629 DSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDS---DSDSDSDSDS 704
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3189 GSRRDSVAESiKSDITKGEKSPLPSKEVSRPESVVGSIKDEKAESRRESVAESvkpESSKDATSAPPSKEHSRPESVLGS 3268
Cdd:NF033609  705 DSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS---DSDSDSDSDSDSDSDSDSDSDSDS 780
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3269 LKDEGDKTTSRRVSVADSIKDEKSLLVSQEASRPESEAESLKDAAAPSQETSRPESVTESVKDGKSPVASKEASRPASVA 3348
Cdd:NF033609  781 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDS 860
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*....
gi 161077523 3349 ENAKDSADESKEQRPESLPQ----SKAGSIKDEKSPL---ASKDEAEKS 3390
Cdd:NF033609  861 NSDSESGSNNNVVPPNSPKNgtnaSNKNEAKDSKEPLpdtGSEDEANTS 909
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
3636-3873 1.84e-06

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 54.79  E-value: 1.84e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3636 EAEKSKEESRRESVAEKSPLASKE-ASRPTSVAESVKDEAEKSKEESSRDSVAEKSPlaskeasrpASVAESVQDEAEKS 3714
Cdd:PTZ00341  904 KAKKKDAKDLSGNIAHEINLINKElKNQNENVPEHLKEHAEANIEEDAEENVEEDAE---------ENVEENVEENVEEN 974
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3715 KEESRRESVAEksplaSKEASRPASVAESVKDDAEKSKEESRRESVAEKSPLASKE-----ASRPASVAESVKDEAEKSK 3789
Cdd:PTZ00341  975 VEENVEENVEE-----NVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEEnveevEENVEEYDEENVEEIEENA 1049
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3790 EESRRESVAEKspLPSKEASRPTSVAESVKDEAEKSKEESRRESVAEKSSlaskkasrpaSVAESVKDEAEKSKEESRRE 3869
Cdd:PTZ00341 1050 EENVEENIEEN--IEEYDEENVEEIEENIEENIEENVEENVEENVEEIEE----------NVEENVEENAEENAEENAEE 1117

                  ....
gi 161077523 3870 SVAE 3873
Cdd:PTZ00341 1118 NAEE 1121
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
3418-3688 3.07e-06

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 54.02  E-value: 3.07e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3418 ESVKDEAEKSKEESPLMSKEASRPASVAGSVKDEAEKSKEESRRESVAEKSPLPSKE-ASRPASVAESVKDEADKSKEES 3496
Cdd:PTZ00341  871 EGLDEKKLKKRAESLKKLANAIEKYAGGGKKDKKAKKKDAKDLSGNIAHEINLINKElKNQNENVPEHLKEHAEANIEED 950
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3497 RRESgAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPlPSKEASRPTSVAESVKDEAEKSKEESRRDSVA 3576
Cdd:PTZ00341  951 AEEN-VEEDAEENVEENVEENVEENVEENVEENVEENVEENVEENVE-ENVEENIEENVEENVEENIEENVEEYDEENVE 1028
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3577 EkspLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEasrpasVAESIKDEAEKSKEESRRESVAEKSplA 3656
Cdd:PTZ00341 1029 E---VEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENVEE------IEENIEENIEENVEENVEENVEEIE--E 1097
                         250       260       270
                  ....*....|....*....|....*....|..
gi 161077523 3657 SKEASRPTSVAESVKDEAEKSKEESSRDSVAE 3688
Cdd:PTZ00341 1098 NVEENVEENAEENAEENAEENAEEYDDENPEE 1129
growth_prot_Scy NF041483
polarized growth protein Scy;
1943-2400 4.60e-06

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 53.68  E-value: 4.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1943 SSQPFKETSRPESAVGSMKDESmskEPSRRESVKDGAAQSRETSRPASVAESAKDGADDLKELSRPESTTQSKEAGSIKD 2022
Cdd:NF041483  228 STQAQEATDHAEQLRSSTAAES---DQARRQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAKEAAAKQLASAESANE 304
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2023 EKSPLASEEASR----PASVAESVKDEAEKSKEESRREsvAEKSPLPSKEASRPASVAESikdEAEKSKEESRRESVAEK 2098
Cdd:NF041483  305 QRTRTAKEEIARlvgeATKEAEALKAEAEQALADARAE--AEKLVAEAAEKARTVAAEDT---AAQLAKAARTAEEVLTK 379
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2099 SPLPSKEASRPASV-AESIKDEAEKSKEESRRESvaeksplpskeasrpASVAESIKDEAEKSKEESRRESVAEKsplps 2177
Cdd:NF041483  380 ASEDAKATTRAAAEeAERIRREAEAEADRLRGEA---------------ADQAEQLKGAAKDDTKEYRAKTVELQ----- 439
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2178 KEASRPASVAESIKDEA----EKSKEESRRESVaeksplpsKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKE 2253
Cdd:NF041483  440 EEARRLRGEAEQLRAEAvaegERIRGEARREAV--------QQIEEAARTAEELLTKAKADADELRSTATAESERVRTEA 511
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2254 ASRPASVAESIKDEAEKSKEESRRESvAEKSPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEKSPLPSKEASRPAS 2333
Cdd:NF041483  512 IERATTLRRQAEETLERTRAEAERLR-AEAEEQAEEVRAAAERAARELREETERAIAARQAEAAEELTRLHTEAEERLTA 590
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 2334 VAESIKDeAEKSKEESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQSI 2400
Cdd:NF041483  591 AEEALAD-ARAEAERIRREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEAAADASAARAEGENV 656
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
3893-4433 4.63e-06

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 53.36  E-value: 4.63e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3893 KDEAEKSKE-ESRRESVAEKS-PLPSKEASRPTSVAESVKDEADkskeesrreSGAEKSPLASMEASRPTSVAESVKDET 3970
Cdd:TIGR00600  269 RDEGGFLKEvELRRVVSEDTShYILIKGIQGKTAVKAVDSDDES---------LPSLSSQLDSNSEDLKSSPWEKLKPES 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3971 EKSkeeSRRESVTEKSpLPSKEASRPTSVAESvKDEAEKSKEESRRESVAEKSPLASKESSRPAsvAESIKDEAEGTKQE 4050
Cdd:TIGR00600  340 ESI---VEAEPPSPRT-LLAKQAAMSESSSED-SDESEWERQELKRNNVAFVDDGSLSPRTLQA--IGQALDDDEDKKVS 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  4051 SRRESMPESGKAESIkgdqssLASKETSRPDSVVESVKDETEKPEGSAIDKSQVASRPESVAVSAKdeksplhsrPESVA 4130
Cdd:TIGR00600  413 ASSDDQASPSKKTKM------LLISRIEVEDDDLDYLDQGEGIPLMAALQLSSVNSKPEAVASTKI---------AREVT 477
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  4131 DKSPDASKEASRSLSVAETASSPIeegPRSIADLSLPLNLTGEAKGKLPTLSSPIDVAEGDFLEVKAESSPRPAVLSKPA 4210
Cdd:TIGR00600  478 SSGHEAVPKAVQSLLLGATNDSPI---PSEFTILDRKSELSIERTVKPVSSEFGLPSQREDKLAIPTEGTQNLQGISDHP 554
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  4211 EFSQPDTGHTASTPVDEASPVLEE-----IEVVEQHTTSGVgatgaTAETDLLDLTETksetvtkqsettlfetltskve 4285
Cdd:TIGR00600  555 EQFEFQNELSPLETKNNESNLSSDaetegSPNPEMPSWSSV-----TVPSEALDNYET---------------------- 607
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  4286 skveVLESSVKQVEEKVQTSVKQAETTVTDSLEQLTKKSSEQLTEI---KSVLDTNFEEVAKIVADVAKVLKSDKDITDI 4362
Cdd:TIGR00600  608 ----TNPSNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESekeESESDGSFIEVDSVSSTLELQVPSKSQPTDE 683
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 161077523  4363 IPDFDERQLEEKLKSTADTEEESDKSTRDEKSLEISVKVEIESEKSSPDQKSgpISIEEKDKIEQSEKAQL 4433
Cdd:TIGR00600  684 SEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQD--ISLEELEALEANLLAEQ 752
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
2024-2435 6.80e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 52.77  E-value: 6.80e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2024 KSPLASEEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEaSRPASVAESIKDEAEKSKEESRRESVAEKSPLPS 2103
Cdd:PTZ00449  522 KAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKE-HKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRP 600
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2104 KEASRPASVAESIKDEAEKSKEESRRESvAEKSPLPSKEASRPASVAesiKDEAEKSKEESRresvAEKSPLPSKEASRP 2183
Cdd:PTZ00449  601 RSAQRPTRPKSPKLPELLDIPKSPKRPE-SPKSPKRPPPPQRPSSPE---RPEGPKIIKSPK----PPKSPKPPFDPKFK 672
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2184 ASVAESIKDEAEKSKE--------ESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEkSPLPSKEAS 2255
Cdd:PTZ00449  673 EKFYDDYLDAAAKSKEtkttvvldESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAE-QPDDIEFFT 751
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2256 RPASVAESIKDEAEKSK------EESRRESVAEKSPLPSKEASRPASVAE----SIKDEAEKSKEETRRESVAEKSPLPS 2325
Cdd:PTZ00449  752 PPEEERTFFHETPADTPlpdilaEEFKEEDIHAETGEPDEAMKRPDSPSEhedkPPGDHPSLPKKRHRLDGLALSTTDLE 831
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2326 KEASRPAsvaesiKDEAEKSKEESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEEsrrESMAESGKAQSIKGDQS 2405
Cdd:PTZ00449  832 SDAGRIA------KDASGKIVKLKRSKSFDDLTTVEEAEEMGAEARKIVVDDDGTEADDE---DTHPPEEKHKSEVRRRR 902
                         410       420       430
                  ....*....|....*....|....*....|
gi 161077523 2406 PLKEVSRPESvaesvkddPVKSKEPSRRES 2435
Cdd:PTZ00449  903 PPKKPSKPKK--------PSKPKKPKKPDS 924
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1374-1695 1.67e-05

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 51.59  E-value: 1.67e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1374 AEKVVVIETTVEKKQEEI--VEATTVITQENqEDLMEQVKDKEEHEQKIESGIITEKEAKKSAStpeeketsditsddel 1451
Cdd:PTZ00108 1101 KEKVEKLNAELEKKEKELekLKNTTPKDMWL-EDLDKFEEALEEQEEVEEKEIAKEQRLKSKTK---------------- 1163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1452 paQLADPTTVPPKSAKDREDTGSiespptieeaiEVEVQAKQEAQKPVPAPEEAIKTEKSPLASKETSRPESATGSVKED 1531
Cdd:PTZ00108 1164 --GKASKLRKPKLKKKEKKKKKS-----------SADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEE 1230
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1532 TEQTKSKKSPVPSRPESEAKDKKSPFASGEASRPESVAESVKDEAGKAESRRESiakTHKDESSLDKAKEQESRRESlaE 1611
Cdd:PTZ00108 1231 QKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYS---PPPPSKRPDGESNGGSKPSS--P 1305
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1612 SIKPESGIDEKSALASKEASRPESVTDKSKEPSRREsiAESLKAESTKDEKSAPPSKEASRPGSVVESVKDETEKSKEPS 1691
Cdd:PTZ00108 1306 TKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRV--KQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDED 1383

                  ....
gi 161077523 1692 RRES 1695
Cdd:PTZ00108 1384 DEDD 1387
PTZ00121 PTZ00121
MAEBL; Provisional
950-1858 2.06e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 51.68  E-value: 2.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  950 KKLQDLT-ASQELDAEKQRELDDLKEEQEVVREIEAVFSRDEMKRQQHQQIKAELREMpaEGTGDGENEPDEEEEYLIIE 1028
Cdd:PTZ00121 1027 EKIEELTeYGNNDDVLKEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAK--EDNRADEATEEAFGKAEEAK 1104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1029 KEEVEQYTEDSivEQESSMTKEEEIQKHQRDSQESEKKR----KKSAEEEIEAAIAKVEAAERkarleGASARQDESELD 1104
Cdd:PTZ00121 1105 KTETGKAEEAR--KAEEAKKKAEDARKAEEARKAEDARKaeeaRKAEDAKRVEIARKAEDARK-----AEEARKAEDAKK 1177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1105 VEPEQSKIKAEVQDIIATAKDIAK---SRTEEQLAKPAEEELSSPTPEEKLSKKTSDTKDDQIGAPVDVLPVNLQESLPE 1181
Cdd:PTZ00121 1178 AEAARKAEEVRKAEELRKAEDARKaeaARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKF 1257
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1182 EKfsATIESGATTAPTLPEDERIPLDQIKEdlvIEEKYVKEETKEAEaivvaTVQTLPEAAPLAIDTILASATKDAPKDA 1261
Cdd:PTZ00121 1258 EE--ARMAHFARRQAAIKAEEARKADELKK---AEEKKKADEAKKAE-----EKKKADEAKKKAEEAKKADEAKKKAEEA 1327
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1262 NAEAlGELPDSGERVLPMKMTFEAQQNLLRDVIKTPDEVAdlpvhEEADLGLYEKDSQDAGAKSISHKEESAKEEKETDD 1341
Cdd:PTZ00121 1328 KKKA-DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA-----EAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE 1401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1342 EKENKVGEIELGDEPNKVDISHVLLKESVQEVAEKVVVIETT-----VEKKQEEIVEATTVITQENQEDLMEQVKDKEEH 1416
Cdd:PTZ00121 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAkkadeAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1417 EQKIESGIITEKEAKKSASTPEEKETSDITSDDELPAQladpttvppkSAKDREDTGSIESPPTIEEAIEVEVQAKQEAQ 1496
Cdd:PTZ00121 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE----------EAKKADEAKKAEEAKKADEAKKAEEKKKADEL 1551
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1497 KPVPAPEEAIKTEKSPLASKETSRPESATGSVKEDTEQTKSKKSPVPSRPESEAKDKkspfasGEASRPESVAESVKDEA 1576
Cdd:PTZ00121 1552 KKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK------AEEAKKAEEAKIKAEEL 1625
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1577 GKAESRResiakthKDESSLDKAKEQESRReslAESIKPEsgiDEKSALASKEASRPESVTDKSKEPSRRESIAESLKAE 1656
Cdd:PTZ00121 1626 KKAEEEK-------KKVEQLKKKEAEEKKK---AEELKKA---EEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1657 STKDEKSAPPSKEASRPGSVVESVKDETEKSKEPSRRESIAESAKPPIEFREVSRPESVIDGIKDESAKPESRRDSPLAS 1736
Cdd:PTZ00121 1693 ALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1737 KEASRPESVLESVKDEPIK---STEKSRRESVAESFKADSTKDEKSPLTSKDISRPESAVENVMDAVGSaERSQPESVTA 1813
Cdd:PTZ00121 1773 IRKEKEAVIEEELDEEDEKrrmEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNM-QLEEADAFEK 1851
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|
gi 161077523 1814 SRDVSRPESVAESEKD-----DTDKPESVVESVIPASDVVEIEKGAADKE 1858
Cdd:PTZ00121 1852 HKFNKNNENGEDGNKEadfnkEKDLKEDDEEEIEEADEIEKIDKDDIERE 1901
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
3298-3580 2.14e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 51.17  E-value: 2.14e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3298 EASRPESEAESLKDAAAPSQETSRPESVTESVKDGKSPVASK--EASRPASVAENAKDSADESKEQRPESLpqskagsiK 3375
Cdd:NF033838  175 ELEIAESDVEVKKAELELVKEEAKEPRDEEKIKQAKAKVESKkaEATRLEKIKTDREKAEEEAKRRADAKL--------K 246
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3376 DEKSPLASKDEAEKSKEESRRESVAEQFPLVSKEvSRPASVAESVKDEAEKSKEESPlmSKEASRPASVAGSVKDEAEKS 3455
Cdd:NF033838  247 EAVEKNVATSEQDKPKRRAKRGVLGEPATPDKKE-NDAKSSDSSVGEETLPSPSLKP--EKKVAEAEKKVEEAKKKAKDQ 323
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3456 KEESRR----------ESVAEKSPLPSKEASrpasvAESVKDEADKSKEESRRESGAEKSPLASKEASRpasvAESIKDE 3525
Cdd:NF033838  324 KEEDRRnyptntyktlELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTD 394
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 3526 AEKSKEESRR-----ESVAEK-------SPLPSKEASRPTSVAESVKDEAEKSKEESRRDSVAEKSP 3580
Cdd:NF033838  395 RKKAEEEAKRkaaeeDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3406-3575 2.60e-05

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 50.36  E-value: 2.60e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3406 VSKEVSRPASVAESVKDEAEKSKEESPLMSKEASRPASVAGSVKDEAEKSKEESRRESVAEKSPLPSKEASRPASvaesv 3485
Cdd:PRK13108  291 VVDEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEE----- 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3486 KDEADKSKEESrRESGAEKSPLASKEASRPASVAESIKDEAEKSKEESRREsVAEKSPlPSKEASRPT-SVAESVKDEAE 3564
Cdd:PRK13108  366 TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAA-PIPDPAKPDeLAVAGPGDDPA 442
                         170
                  ....*....|.
gi 161077523 3565 KSKEESRRDSV 3575
Cdd:PRK13108  443 EPDGIRRQDDF 453
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
3466-3827 2.87e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.68  E-value: 2.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3466 EKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEkSPLP 3545
Cdd:NF033609  553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD-SDSA 631
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3546 SKEASRPTSVAESVKDEAEKSKEESRRDSVAEkSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSplaSKEASR 3625
Cdd:NF033609  632 SDSDSASDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSD 707
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3626 PASVAESIKDEAEKSKEESRRESVAEkSPLASKEASRPTSVAESVKDEAEKSKEESSRDSVAEkSPLASKEASRPASVAE 3705
Cdd:NF033609  708 SDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSD 785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3706 SVQDEAEKSKEESRRESVAEkSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSplaSKEASRPASVAESVKDEA 3785
Cdd:NF033609  786 SDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSESDSN 861
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 161077523 3786 EKSKEESRRESVAEKSPLPSKEASRptsvaesvKDEAEKSKE 3827
Cdd:NF033609  862 SDSESGSNNNVVPPNSPKNGTNASN--------KNEAKDSKE 895
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1434-1831 2.93e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.94  E-value: 2.93e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1434 ASTPEEKETSDIT-SDDELPAQLADPTTVPPKSAKDREDTGSIESPPTIEEAieVEVQAKQEAQKPVPAPeeaikTEKSP 1512
Cdd:PHA03307   25 PATPGDAADDLLSgSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPG--TEAPANESRSTPTWSL-----STLAP 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1513 LASKETSRPESATGSVKEDTEQTKSKKSPVPSRPESEAKDKKSPFASGEASRPESVAESVK--DEAGKAESRRESIAKTH 1590
Cdd:PHA03307   98 ASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASpaAVASDAASSRQAALPLS 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1591 KDESSLDKAKEQESRRESLAESIKPESGIDEKSALASKEASRPESVTDKSKEPSRRESIAESLKAESTKDE--------- 1661
Cdd:PHA03307  178 SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGwgpenecpl 257
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1662 KSAPPSKEASRPGSVVESVKDETEKSKEPSRrESIAESAKPPIEFREVSRP-ESVIDGIKDESAKPESRRDSPLASKEAS 1740
Cdd:PHA03307  258 PRPAPITLPTRIWEASGWNGPSSRPGPASSS-SSPRERSPSPSPSSPGSGPaPSSPRASSSSSSSRESSSSSTSSSSESS 336
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1741 RPESV--------LESVKDEPIKSTEKSRRESVAESFKADSTKDEKSPLTSKDISRPESAVENVMDAVGSAERSQPESVT 1812
Cdd:PHA03307  337 RGAAVspgpspsrSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSP 416
                         410
                  ....*....|....*....
gi 161077523 1813 ASRDVsRPESVAESEKDDT 1831
Cdd:PHA03307  417 LDAGA-ASGAFYARYPLLT 434
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
3829-4168 3.08e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 50.84  E-value: 3.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3829 SRRESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESV 3908
Cdd:PTZ00449  477 SKIQFTQEIKKLIKKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGE 556
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3909 AEKSPLPSKEaSRPTSVAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAESVKDETEK-SKEESRRESVTE-KS 3986
Cdd:PTZ00449  557 VGKKPGPAKE-HKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDiPKSPKRPESPKSpKR 635
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3987 PLPSKEASRPTsvaesvKDEAEKSKEESRresvAEKSPLASKESSRPASVAESIKDEAEGTKQESRRESMPESGKAESIK 4066
Cdd:PTZ00449  636 PPPPQRPSSPE------RPEGPKIIKSPK----PPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKE 705
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4067 GDQSSLASKETSR----------PDSVVESVKDetekPEGSAIDKSQVASRPESVAVSAKdEKSPLHSRPESVAD--KSP 4134
Cdd:PTZ00449  706 TLPETPGTPFTTPrplppklprdEEFPFEPIGD----PDAEQPDDIEFFTPPEEERTFFH-ETPADTPLPDILAEefKEE 780
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 161077523 4135 DASKEASRSLSVAETASSPIEEGPRSIADL-SLPL 4168
Cdd:PTZ00449  781 DIHAETGEPDEAMKRPDSPSEHEDKPPGDHpSLPK 815
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
3302-3714 3.10e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 50.88  E-value: 3.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3302 PESEAE-SLKDAAAPSQETSRPEsvtESVKDGKSPVASKEASRPASVAEnakDSADESKEQRPESLPQSKagsikDEKSP 3380
Cdd:PRK14949  368 VDDPAEiSLPEGQTPSALAAAVQ---APHANEPQFVNAAPAEKKTALTE---QTTAQQQVQAANAEAVAE-----ADASA 436
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3381 LASKDEAEKSKEESRRES--VAEQFPLVSKEVSRPASVAESVKDEAEKSKEESPLMSKEASRPASVAGSVKDE-----AE 3453
Cdd:PRK14949  437 EPADTVEQALDDESELLAalNAEQAVILSQAQSQGFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVDDtsasnNS 516
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3454 KSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKSKEES 3533
Cdd:PRK14949  517 AADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSPI 596
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3534 RRESVAEKSP-----LPSKEASRPT----SVAESVKDEAEKSKEESRRdsvaEKSPLASK---EASRPASVAESVQDEAE 3601
Cdd:PRK14949  597 SAVTTAAASLadddiLDAVLAARDSllsdLDALSPKEGDGKKSSADRK----PKTPPSRAppaSLSKPASSPDASQTSAS 672
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3602 KSKE-ESRRESVAEKSPLASKEASRPASVAESiKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDEAEKSKEE 3680
Cdd:PRK14949  673 FDLDpDFELATHQSVPEAALASGSAPAPPPVP-DPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQ 751
                         410       420       430
                  ....*....|....*....|....*....|....
gi 161077523 3681 SSRDSVAEKSPLASKEASRPASVAESVQDEAEKS 3714
Cdd:PRK14949  752 ATHQPQVQAEAQSPASTTALTQTSSEVQDTELNL 785
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
2908-3136 3.70e-05

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 50.43  E-value: 3.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2908 EKSKDASRPPSVVESTKADSTKGDIS---PSPESVLEGPKDDVEKSKESSRPPSVSASITGDSTKDVSRPASVVESVKDE 2984
Cdd:PTZ00108 1149 EKEIAKEQRLKSKTKGKASKLRKPKLkkkEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDD 1228
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2985 HDKAESRRESIAKVESVIDEAGKSDSKSSSQDSQKDEKSTLASKEASRRESVVESSKDDAEKSESRPESVIASGEPvprE 3064
Cdd:PTZ00108 1229 EEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSS---P 1305
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3065 SKSPLDSKDTSRPGSMVESVTAEDEKSEQQSRRESVAESVKAD---------TKKDGKSQEASRPSSVDELLKDDDEKQE 3135
Cdd:PTZ00108 1306 TKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQssrllrrprKKKSDSSSEDDDDSEVDDSEDEDDEDDE 1385

                  .
gi 161077523 3136 S 3136
Cdd:PTZ00108 1386 D 1386
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
3523-3800 3.98e-05

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 49.98  E-value: 3.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3523 KDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSVAE-KSPLASKEASRPASVAESVQDEAE 3601
Cdd:PRK07735   12 KEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAaKAKAAALAKQKREGTEEVTEEEKA 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3602 KSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPlaSKEASRPTSVAESVKDEAEKSKEES 3681
Cdd:PRK07735   92 KAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAK--QKREGTEEVTEEEEETDKEKAKAKA 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3682 SRDSVAEKSPLASKEASRP-ASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESvkddaekskEESRRESV 3760
Cdd:PRK07735  170 AAAAKAKAAALAKQKAAEAgEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGD---------EDAKAKAI 240
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 161077523 3761 AEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEK 3800
Cdd:PRK07735  241 AAAKAKAAAAARAKTKGAEGKKEEEPKQEEPSVNQPYLNK 280
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1854-2229 4.10e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 4.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1854 AADKEKGVFVSLEIGKPDSPSEVISRPGPVVESVKPESRressteiVLPCHAEDSKEPSRPESKVECLKDESEVLKGSTR 1933
Cdd:PHA03307   47 SAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESR-------STPTWSLSTLAPASPAREGSPTPPGPSSPDPPPP 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1934 RESVAESDKSSQP-FKETSRPESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRPASVAESAKDGADDLKElSRPESTT 2012
Cdd:PHA03307  120 TPPPASPPPSPAPdLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPA-EPPPSTP 198
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2013 QSKEAGSIKDEKSPLASEEASRPASVAESVKDEAEKS-----------KEESRRESVAEKSPLPSKEASRPASVAESIKD 2081
Cdd:PHA03307  199 PAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASssdssssessgCGWGPENECPLPRPAPITLPTRIWEASGWNGP 278
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2082 EAEKSKE-------ESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIK 2154
Cdd:PHA03307  279 SSRPGPAsssssprERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPP 358
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 2155 DEAEKSKeeSRRESVAEKSPLPSKEASRPAS-VAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKD 2229
Cdd:PHA03307  359 PADPSSP--RKRPRPSRAPSSPAASAGRPTRrRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPL 432
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
942-1476 6.70e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 49.55  E-value: 6.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  942 SADEATAAKKLQDLTASQELDAEKQRELDDLKEEQEvvrEIEAVFSRDEMKRQQHQQIKAELREMPAEGTGDGENEPDEE 1021
Cdd:COG1196   263 AELEAELEELRLELEELELELEEAQAEEYELLAELA---RLEQDIARLEERRRELEERLEELEEELAELEEELEELEEEL 339
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1022 EEYL--IIEKEEVEQYTEDSIVEQESSMTKEEEIQKHQRDSQESEKKRKKSAEEEIEAAIAKVEAAERkaRLEGASARQD 1099
Cdd:COG1196   340 EELEeeLEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEE--AEEALLERLE 417
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1100 ESELDVEPEQSKIKAEVQDIIATAKDIAKSRTEEQLAKPAEEELSSPTPEEKLSKKTSDTKDDQIGAPVDVLpVNLQESL 1179
Cdd:COG1196   418 RLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEA-AARLLLL 496
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1180 PEEKFSATIESGATTAPTLPEDERIPLDQIKEDLVIEEKYVKEETKEAEAIVVATVQTLPEAAPLAIDTIlasatkdapK 1259
Cdd:COG1196   497 LEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYL---------K 567
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1260 DANAEALGELPDSGERVLPMKMTFEAQQNLLRDViktpDEVADLPVheEADLGLYEKDSQDAGAKSISHKEESAKEEKET 1339
Cdd:COG1196   568 AAKAGRATFLPLDKIRARAALAAALARGAIGAAV----DLVASDLR--EADARYYVLGDTLLGRTLVAARLEAALRRAVT 641
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1340 DDEKENKVGEIELGDEPNKVDISHvllkeSVQEVAEKVVVIETTVEKKQEEIVEATTVITQENQEDLMEQVKDKEEHEQK 1419
Cdd:COG1196   642 LAGRLREVTLEGEGGSAGGSLTGG-----SRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEER 716
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 1420 IESGIITEKEAKKSASTPEEKETSDITSDDELPAQLADPTTVPPKSAKDREDTGSIE 1476
Cdd:COG1196   717 LEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLE 773
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
3210-3607 7.60e-05

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 49.48  E-value: 7.60e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3210 PLPSKEVSRPESVVGSIKDEKAESRR-ESVAESVKPESSKDATSAPP--------SKEHSRPESVLGSLKDEGDKTTSRR 3280
Cdd:PLN03237 1073 PFPKKAKSVEAAVAGATDDAAEEEEEiDVSSSSGVRGSDYDYLLSMAigtltlekVQELCADRDKLNIEVEDLKKTTPKS 1152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3281 VSVADSIKDEKSL-LVSQEASRPESEAESLKDAAAPSQETSRPESVTESVKDGKSPVASKEASRPASVAENAKDSADESK 3359
Cdd:PLN03237 1153 LWLKDLDALEKELdKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETE 1232
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3360 EQRPESLPQSKAGSIKdeKSPLASKDEAEKSKEESRRESVAEQfplvskevsRPASVAESVKDEAEKSKEESPLMSKEAS 3439
Cdd:PLN03237 1233 NVAEVVKPKGRAGAKK--KAPAAAKEKEEEDEILDLKDRLAAY---------NLDSAPAQSAKMEETVKAVPARRAAARK 1301
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3440 RPASVAGSVKDEAEKSKEESRRESVAEKSplpSKEASRPASVAesvKDEADKSKEESRRESGAEKSPLASKeasrpasVA 3519
Cdd:PLN03237 1302 KPLASVSVISDSDDDDDDFAVEVSLAERL---KKKGGRKPAAA---NKKAAKPPAAAKKRGPATVQSGQKL-------LT 1368
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3520 ESIKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAesvKDEAEKSKEESRRDSVAEKSPLASKEASRPAS-----VAE 3594
Cdd:PLN03237 1369 EMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAA---TNKETESSENVSGSSSSEKDEIDVSAKPRPQRanrkqTTY 1445
                         410
                  ....*....|...
gi 161077523 3595 SVQDEAEKSKEES 3607
Cdd:PLN03237 1446 VLSDSESESADDS 1458
PHA00430 PHA00430
tail fiber protein
3771-3935 8.66e-05

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 49.12  E-value: 8.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3771 ASRPASVAESVKD-EAEKSKEESRRESVAEKSplpSKEASRPTSVAESVKDEAEKSKEESRRESVAEKSSLASKKASRpa 3849
Cdd:PHA00430  133 GRRIVNLADAVDDgDAVPLGQIKTWNQSAWNA---RNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSN-- 207
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3850 svaesvkDEAEKSKEESrrESVAEKSPLASKEASRPASVAESVKDEAEKSKEESRR----ESVAEKSPLPSK-EASRPTS 3924
Cdd:PHA00430  208 -------SEANRFKGYA--DSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAahasEVNAANSATAAAtSANRAKQ 278
                         170
                  ....*....|.
gi 161077523 3925 VAESVKDEADK 3935
Cdd:PHA00430  279 QADRAKTEADK 289
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3992-4165 1.06e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 48.44  E-value: 1.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3992 EASRPTSVAESVKDEAEKSKEESRRESVAEK--SPLASKESSRPASVAESIKDEAegtkQESRRESMPESGKAESIKGDQ 4069
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEV----AEVTDEVAAESVVQVADRDGE 358
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4070 SSLASKETSRPDSVVESVKD-ETEKPEGSAIDKSQVASRPESVAVSAKDEKSplHSRPESVADKSPDASKEASRSLSVAE 4148
Cdd:PRK13108  359 STPAVEETSEADIEREQPGDlAGQAPAAHQVDAEAASAAPEEPAALASEAHD--ETEPEVPEKAAPIPDPAKPDELAVAG 436
                         170
                  ....*....|....*...
gi 161077523 4149 TASSPIE-EGPRSIADLS 4165
Cdd:PRK13108  437 PGDDPAEpDGIRRQDDFS 454
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
2266-2493 1.07e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 49.27  E-value: 1.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2266 DEAEKSKEESRRESVAEKSPLPSKEASRPASVAEsiKDEAEKSKEETRRESVAEKSPLPSKEASRPASVAESIKDEAEKS 2345
Cdd:PTZ00108 1135 DKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASK--LRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDK 1212
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2346 KEESRRESAaeKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQSIKGDQSPLKEVSRPESVAESvkdDPV 2425
Cdd:PTZ00108 1213 PDNKKSNSS--GSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYS---PPP 1287
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 2426 KSKEPSRRESVAGSVTADSARDDQSPLE--------SKGASRPESVVDSVKDEAEKQESRRESKTESVIPPKAKDD 2493
Cdd:PTZ00108 1288 PSKRPDGESNGGSKPSSPTKKKVKKRLEgslaalkkKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDS 1363
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2216-2396 1.09e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 48.44  E-value: 1.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2216 EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSK--EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASR 2293
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2294 PASVAESikdEAEKSKEEtrrESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESAAEKSPLPskEASRPASVAE 2373
Cdd:PRK13108  363 VEETSEA---DIEREQPG---DLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIP--DPAKPDELAV 434
                         170       180
                  ....*....|....*....|...
gi 161077523 2374 SVKDEADKSKEESRRESMAESGK 2396
Cdd:PRK13108  435 AGPGDDPAEPDGIRRQDDFSSRR 457
PHA00430 PHA00430
tail fiber protein
3414-3528 1.13e-04

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 48.73  E-value: 1.13e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3414 ASVAESVKDEAEKSKEESPLMSKEASRPASVAGSVKDEAEKSKEESRR-----ESVAEKSPLPSKEASRPASVAESVKDE 3488
Cdd:PHA00430  165 RNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRfkgyaDSMTSSVEAAKGQAESSSKEANTAGDY 244
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 161077523 3489 ADKSKEESRR----ESGAEKSPLASK-EASRPASVAESIKDEAEK 3528
Cdd:PHA00430  245 ATKAAASASAahasEVNAANSATAAAtSANRAKQQADRAKTEADK 289
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
3045-3431 1.14e-04

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 49.09  E-value: 1.14e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3045 EKSESRPESVIASGEPVPRESKSPLDSKDTSRPGSMVE--------SVTAE--DEKSEQQSRRESVAESVKADTKK---- 3110
Cdd:PLN03237 1076 KKAKSVEAAVAGATDDAAEEEEEIDVSSSSGVRGSDYDyllsmaigTLTLEkvQELCADRDKLNIEVEDLKKTTPKslwl 1155
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3111 -DGKSQEASrpssVDELLKDDDEKQESRRQSITGSHKAMSTMGdesPMDKADKSKEPSRPESVAESIKHENTKDEESPLG 3189
Cdd:PLN03237 1156 kDLDALEKE----LDKLDKEDAKAEEAREKLQRAAARGESGAA---KKVSRQAPKKPAPKKTTKKASESETTEETYGSSA 1228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3190 SRRDSVAESIKSDITKGEKSPLPSKEVSRPESV-VGSIKDEKAESRRESV-AESVKPESSKDATSAPPSKEHSRPESVLG 3267
Cdd:PLN03237 1229 METENVAEVVKPKGRAGAKKKAPAAAKEKEEEDeILDLKDRLAAYNLDSApAQSAKMEETVKAVPARRAAARKKPLASVS 1308
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3268 SLKDEGDKTTsrrvsvaDSIKDEKSLLVSQEASRPESEAESLKDAAAPSQETSRPESVTESVKDGKSpvaskEASRPAsv 3347
Cdd:PLN03237 1309 VISDSDDDDD-------DFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLT-----EMLKPA-- 1374
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3348 aeNAKDSADESKEQRPESLP-QSKAGSIKDEksplASKDEAEKSKEESRRESVAEQFPLVSKEVSRPAS-----VAESVK 3421
Cdd:PLN03237 1375 --EAIGISPEKKVRKMRASPfNKKSGSVLGR----AATNKETESSENVSGSSSSEKDEIDVSAKPRPQRanrkqTTYVLS 1448
                         410
                  ....*....|
gi 161077523 3422 DEAEKSKEES 3431
Cdd:PLN03237 1449 DSESESADDS 1458
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
3375-3577 1.18e-04

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 49.01  E-value: 1.18e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3375 KDEKSPLASKDEAEKSKEESRRESVAEqfplvSKEVSRPASVAESVKDEAEKSKEESPLMSKEASRPASVAGSVKDEAEK 3454
Cdd:PTZ00341  931 QNENVPEHLKEHAEANIEEDAEENVEE-----DAEENVEENVEENVEENVEENVEENVEENVEENVEENVEENVEENIEE 1005
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3455 SKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVaESIKDEAEKSKEESR 3534
Cdd:PTZ00341 1006 NVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEEIEENAEENVEENIEENIEEYDEENV-EEIEENIEENIEENV 1084
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 161077523 3535 RESVAEKSPLPSKeasrptSVAESVKDEAEKSKEESRRDSVAE 3577
Cdd:PTZ00341 1085 EENVEENVEEIEE------NVEENVEENAEENAEENAEENAEE 1121
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
1979-2169 1.19e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 48.44  E-value: 1.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1979 AAQSRETSRPASVAESAKDGADDlkelSRPESTTQSKEAGSIKDeKSPLASEEASRPASVAESVKDEAEKSKEESRRESV 2058
Cdd:PRK13108  275 APKGREAPGALRGSEYVVDEALE----REPAELAAAAVASAASA-VGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESV 349
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2059 AEKSPLPSKEASRPASvaesiKDEAEKSKEESrRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRREsVAEKSPl 2138
Cdd:PRK13108  350 VQVADRDGESTPAVEE-----TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAA- 421
                         170       180       190
                  ....*....|....*....|....*....|..
gi 161077523 2139 PSKEASRP-ASVAESIKDEAEKSKEESRRESV 2169
Cdd:PRK13108  422 PIPDPAKPdELAVAGPGDDPAEPDGIRRQDDF 453
rne PRK10811
ribonuclease E; Reviewed
3450-3855 1.21e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.88  E-value: 1.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3450 DEAEKSKEESRREsvaEKSPLPSKEASRPASVAESvkDEADKSKEESRRESGAEKSPLASKEaSRPASVAESIKDEAEKS 3529
Cdd:PRK10811  630 REGRENREENRRN---RRQAQQQTAETRESQQAEV--TEKARTQDEQQQAPRRERQRRRNDE-KRQAQQEAKALNVEEQS 703
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3530 -----KEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSVAEKSPLASKEASRPA-SVAESVQDEAEKS 3603
Cdd:PRK10811  704 vqeteQEERVQQVQPRRKQRQLNQKVRIEQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVKVPLpVVAQTAPEQDEEN 783
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3604 KEESRRESVAeksPLASKEASRPASVaesikdeaekSKEESRR---ESVAEKSPL------ASKEA-------SRP-TSV 3666
Cdd:PRK10811  784 NAENRDNNGM---PRRSRRSPRHLRV----------SGQRRRRyrdERYPTQSPMpltvacASPEMasgkvwiRYPvVRP 850
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3667 AESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEkskeESRRESVAEKSPLASKEASRPASVAESVkd 3746
Cdd:PRK10811  851 QDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVE----EPVVVAEPQPEEVVVVETTHPEVIAAPV-- 924
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3747 DAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVkdeAEKSK 3826
Cdd:PRK10811  925 TEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTA---VEPEV 1001
                         410       420       430
                  ....*....|....*....|....*....|..
gi 161077523 3827 EESRRESVAEKSSLAS---KKASRPASVAESV 3855
Cdd:PRK10811 1002 APAQVPEATVEHNHATapmTRAPAPEYVPEAP 1033
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
3634-3911 1.21e-04

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 48.44  E-value: 1.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3634 KDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDEAEKSKEESSRDSVAE-KSPLASKEASRPASVAESVQDEAE 3712
Cdd:PRK07735   12 KEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAaKAKAAALAKQKREGTEEVTEEEKA 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3713 KSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPlaSKEASRPASVAESVKDEAEKSKEES 3792
Cdd:PRK07735   92 KAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAK--QKREGTEEVTEEEEETDKEKAKAKA 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3793 RRESVAEKSPLPSKEASRPTS-VAESVKDEAEKSKEESRRESVAEKSSLASKKASRPASVAEsvkDEAEKSKeesrreSV 3871
Cdd:PRK07735  170 AAAAKAKAAALAKQKAAEAGEgTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSG---DEDAKAK------AI 240
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 161077523 3872 AEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEK 3911
Cdd:PRK07735  241 AAAKAKAAAAARAKTKGAEGKKEEEPKQEEPSVNQPYLNK 280
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1907-2174 1.39e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 48.47  E-value: 1.39e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1907 DSKEPSRPESKVECLKDESEVLKG-STRRESVAESDKSSQPFKEtsrpESAVGSMKDESMSKEPSRResVKDGAAQsrET 1985
Cdd:NF033838  202 DEEKIKQAKAKVESKKAEATRLEKiKTDREKAEEEAKRRADAKL----KEAVEKNVATSEQDKPKRR--AKRGVLG--EP 273
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1986 SRPASVAESAKDGADDLKELSRPESttqskeagSIKDEKSPLASEEAsrpasvAESVKDEAEKSKEESRR---------- 2055
Cdd:NF033838  274 ATPDKKENDAKSSDSSVGEETLPSP--------SLKPEKKVAEAEKK------VEEAKKKAKDQKEEDRRnyptntyktl 339
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2056 ESVAEKSPLPSKEASrpasvAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSKEESRR-----E 2130
Cdd:NF033838  340 ELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAEEEAKRkaaeeD 410
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 161077523 2131 SVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSP 2174
Cdd:NF033838  411 KVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1777-2137 1.52e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 48.47  E-value: 1.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1777 EKSPLTSKDISRPESAVENVMDAVGSAERSQPEsvtASRDVSRPESVAES--EKDDTDKPESVVESVipasdvvEIEKGA 1854
Cdd:NF033838  111 SEAELTSKTKKELDAAFEQFKKDTLEPGKKVAE---ATKKVEEAEKKAKDqkEEDRRNYPTNTYKTL-------ELEIAE 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1855 AD---KEKGVFVSLEIGKPDSPSEVISRPGPVVESVKPESRRESSTEIVLPCHAEDSKEPSRPESKVECLKD----ESEV 1927
Cdd:NF033838  181 SDvevKKAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNvatsEQDK 260
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1928 LKGSTRRESVAESDKSSQPFKETSRPESAVGsmkdESMSKEPSRRESVKDGAAQSRetsrpasvAESAKDGADDLKELSR 2007
Cdd:NF033838  261 PKRRAKRGVLGEPATPDKKENDAKSSDSSVG----EETLPSPSLKPEKKVAEAEKK--------VEEAKKKAKDQKEEDR 328
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2008 PESTTQSKEAGSIKDEKSPLASEEASrpasvAESVKDEAEKSKEESRRESVAEKSPLPSKEASRpasvAESIKDEAEKSK 2087
Cdd:NF033838  329 RNYPTNTYKTLELEIAESDVKVKEAE-----LELVKEEAKEPRNEEKIKQAKAKVESKKAEATR----LEKIKTDRKKAE 399
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 161077523 2088 EESRR-----ESVAEK-------SPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSP 2137
Cdd:NF033838  400 EEAKRkaaeeDKVKEKpaeqpqpAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSE 461
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2253-2436 1.56e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 48.05  E-value: 1.56e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2253 EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSK--EASRPASVAESIKDEAEKSKEETRRESVAEKSPLPSKEASR 2330
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2331 PASvaesiKDEAEKSKEESrRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRREsmaESGKAQSIKGDQSPlkev 2410
Cdd:PRK13108  363 VEE-----TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE---VPEKAAPIPDPAKP---- 429
                         170       180
                  ....*....|....*....|....*.
gi 161077523 2411 srPESVAESVKDDPVKSKEPSRRESV 2436
Cdd:PRK13108  430 --DELAVAGPGDDPAEPDGIRRQDDF 453
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
3515-3897 1.77e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 48.18  E-value: 1.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3515 PASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSVAEKSPLASKEASRPA---- 3590
Cdd:PRK14949  380 QTPSALAAAVQAPHANEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEAVAEADASAEPADTVEQALDDESELLAAlnae 459
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3591 -----SVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPAS----VAESIKDEAEKSKEESRRESVAEKSPLASKEAS 3661
Cdd:PRK14949  460 qavilSQAQSQGFEASSSLDADNSAVPEQIDSTAEQSVVNPSVtdtqVDDTSASNNSAADNTVDDNYSAEDTLESNGLDE 539
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3662 RPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSP---------LASK 3732
Cdd:PRK14949  540 GDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTAAASLadddildavLAAR 619
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3733 EASRPASVAESVKDDAEKSKEESRREsvaeKSPLASK---EASRPASVAESVKDEAEKSKE-ESRRESVAEKSPLPSKEA 3808
Cdd:PRK14949  620 DSLLSDLDALSPKEGDGKKSSADRKP----KTPPSRAppaSLSKPASSPDASQTSASFDLDpDFELATHQSVPEAALASG 695
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3809 SRPTSVAESvKDEAEKSKEEsrresvAEKSSLASKKASRPASVAESVKDEAEKSKEESRRESVAEKSPLASKEASRPASV 3888
Cdd:PRK14949  696 SAPAPPPVP-DPYDRPPWEE------APEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPAST 768

                  ....*....
gi 161077523 3889 AESVKDEAE 3897
Cdd:PRK14949  769 TALTQTSSE 777
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3585-3760 1.89e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 47.67  E-value: 1.89e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3585 EASRPASVAESVQDEAEKSKEESRRESVAEK--SPLASKEASRPASVAESIKDEAEKSKEESRRESVaekSPLASKEASR 3662
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESV---VQVADRDGES 359
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3663 PTSVAESVKDEAEKskeESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRREsVAEKSPLASKEASRPASVAE 3742
Cdd:PRK13108  360 TPAVEETSEADIER---EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAAPIPDPAKPDELAVA 435
                         170
                  ....*....|....*...
gi 161077523 3743 SVKDDAEKSKEESRRESV 3760
Cdd:PRK13108  436 GPGDDPAEPDGIRRQDDF 453
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1393-2320 2.25e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 48.04  E-value: 2.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1393 EATTVITQENQEDLMEQVKDKEEHEQKIESGIITEKEAKKSASTPEEKETSDITSddelpaqladpttvppksaKDREDT 1472
Cdd:pfam02463  133 EAYNFLVQGGKIEIIAMMKPERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAE-------------------LIIDLE 193
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1473 GSIESPPTIEEAIEVEVQAKQEAQKPVPAPEEAIKTEKsplaSKETSRPESATGSVKEDTEQTKSKKSPVPSRPESEAKD 1552
Cdd:pfam02463  194 ELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDY----LKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQ 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1553 KKSPFASGEASRPESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIKPESGIDEKSALASKEASR 1632
Cdd:pfam02463  270 VLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEI 349
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1633 PESVTDKSKEPSRRESIAESLKAESTKDEKSappSKEASRPGSVVESVKDETEKSKEPSRRESIAESAkppiefrevsrp 1712
Cdd:pfam02463  350 KREAEEEEEEELEKLQEKLEQLEEELLAKKK---LESERLSSAAKLKEEELELKSEEEKEAQLLLELA------------ 414
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1713 ESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESVAESFKAdSTKDEKSPLTSKDISRPESA 1792
Cdd:pfam02463  415 RQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDL-LKETQLVKLQEQLELLLSRQ 493
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1793 VENVMDAVGSAERSqPESVTASRDVSRPESVAESEKDDTDKPESVVESVIPA-SDVVEIEKGAADKEKGVFVSLEIGKPD 1871
Cdd:pfam02463  494 KLEERSQKESKARS-GLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAiSTAVIVEVSATADEVEERQKLVRALTE 572
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1872 SPSEVISRPGPVVESVKPESR-RESSTEIVLPCHAEDSKEPS-RPESKVECLKDESEVLKGSTRRESVAESDKSSQPFKE 1949
Cdd:pfam02463  573 LPLGARKLRLLIPKLKLPLKSiAVLEIDPILNLAQLDKATLEaDEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGV 652
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1950 TSRPESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRPASVAESAKDGAddLKELSRPESTTQSKEAGS-IKDEKSPLA 2028
Cdd:pfam02463  653 SLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIK--KKEQREKEELKKLKLEAEeLLADRVQEA 730
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  2029 SEEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASR 2108
Cdd:pfam02463  731 QDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEEL 810
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  2109 PASVAESIKDEAEKSKEESRRESVAEKSPLPSKeasrpasvaESIKDEAEKSKEESRRESVAEKsplpsKEASRPASVAE 2188
Cdd:pfam02463  811 KEEAELLEEEQLLIEQEEKIKEEELEELALELK---------EEQKLEKLAEEELERLEEEITK-----EELLQELLLKE 876
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  2189 SIKDEAEKSKEESRRESVAEKSPLPSKEASRpasvaESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEA 2268
Cdd:pfam02463  877 EELEEQKLKDELESKEEKEKEEKKELEEESQ-----KLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKE 951
                          890       900       910       920       930
                   ....*....|....*....|....*....|....*....|....*....|..
gi 161077523  2269 EKSKEESRRESVAEKsplpSKEASRPASVAESIKDEAEKSKEETRRESVAEK 2320
Cdd:pfam02463  952 ENNKEEEEERNKRLL----LAKEELGKVNLMAIEEFEEKEERYNKDELEKER 999
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3808-4165 2.52e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.86  E-value: 2.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3808 ASRPTSVAESVKDEAEKSKEESRRESVAEKSSLASKKASRPASVAE--SVKDEAEKSKEESRRESVAEKSPLASKEASRP 3885
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTwsLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3886 ASVAESVKDEAekskeesrresvaeKSPLPSKEASRPTSVAESVKDEADKSKEESRRESGAEKSPLASM----------- 3954
Cdd:PHA03307  124 ASPPPSPAPDL--------------SEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSpeetarapssp 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3955 EASRPTSVAESVKDETEKSKEESRRESVTEKSPLPSKEASRPTSVAESVKDEAEKS-KEESRRESVAEKSPLASKESSRP 4033
Cdd:PHA03307  190 PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSgCGWGPENECPLPRPAPITLPTRI 269
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4034 ASVAESIKDEAEGTKQESRRESMPESGKAESIKGDQSSLASkeTSRPDSVVESVKDEtekpEGSAIDKSQVASRPESVAV 4113
Cdd:PHA03307  270 WEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPS--SPRASSSSSSSRES----SSSSTSSSSESSRGAAVSP 343
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 161077523 4114 SAKDEKSPLHSRPESVADKSPDASKEASRSLSVAETASSPIEEGPRSIADLS 4165
Cdd:PHA03307  344 GPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVA 395
valS PRK14900
valyl-tRNA synthetase; Provisional
2039-2234 2.66e-04

valyl-tRNA synthetase; Provisional


Pssm-ID: 237855 [Multi-domain]  Cd Length: 1052  Bit Score: 47.68  E-value: 2.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2039 AESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEEsrresvaeKSPLPSKEASRPASVAESIKD 2118
Cdd:PRK14900  842 AETARVDKEIGKVDQDLAVLERKLQNPSFVQNAPPAVVEKDRARAEELREK--------RGKLEAHRAMLSGSEANSARR 913
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2119 EAEKSKEESRResvAEKSPLPSKEASRPASVAESIKDEAEKSKE-------------ESRRESVAEKSPLPSKEA-SRPA 2184
Cdd:PRK14900  914 DTMEIQNEQKP---TQDGPAAEAQPAQENTVVESAEKAVAAVSEaaqqaatavasgiEKVAEAVRKTVRRSVKKAaATRA 990
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 2185 SVAESIKDEAEKSKEESR----RESVAEKSP---LPSKEASRPASVAESIKDEAEKS 2234
Cdd:PRK14900  991 AMKKKVAKKAPAKKAAAKkaaaKKAAAKKKVakkAPAKKVARKPAAKKAAKKPARKA 1047
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2031-2206 2.70e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 47.28  E-value: 2.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2031 EASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSK--EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASR 2108
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2109 PASvaesiKDEAEKSKEESrRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRREsVAEKSPlPSKEASRP-ASVA 2187
Cdd:PRK13108  363 VEE-----TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAA-PIPDPAKPdELAV 434
                         170
                  ....*....|....*....
gi 161077523 2188 ESIKDEAEKSKEESRRESV 2206
Cdd:PRK13108  435 AGPGDDPAEPDGIRRQDDF 453
valS PRK14900
valyl-tRNA synthetase; Provisional
2187-2382 2.99e-04

valyl-tRNA synthetase; Provisional


Pssm-ID: 237855 [Multi-domain]  Cd Length: 1052  Bit Score: 47.68  E-value: 2.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2187 AESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEEsrresvaeKSPLPSKEASRPASVAESIKD 2266
Cdd:PRK14900  842 AETARVDKEIGKVDQDLAVLERKLQNPSFVQNAPPAVVEKDRARAEELREK--------RGKLEAHRAMLSGSEANSARR 913
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2267 EAEKSKEESRResvAEKSPLPSKEASRPASVAESIKDEAEKSKE-------------ETRRESVAEKSPLPSKEA-SRPA 2332
Cdd:PRK14900  914 DTMEIQNEQKP---TQDGPAAEAQPAQENTVVESAEKAVAAVSEaaqqaatavasgiEKVAEAVRKTVRRSVKKAaATRA 990
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 2333 SVAESIKDEAEKSKEESR----RESAAEKSP---LPSKEASRPASVAESVKDEADKS 2382
Cdd:PRK14900  991 AMKKKVAKKAPAKKAAAKkaaaKKAAAKKKVakkAPAKKVARKPAAKKAAKKPARKA 1047
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3410-3772 3.08e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.47  E-value: 3.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3410 VSRPASVAESVKDEAEKSKEESPLMSKEASRPASVAGSVKDEAEKSKEESRRESVAEkSPLPSkEASRPASVAESVKDEA 3489
Cdd:PHA03307   68 PTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPA-SPPPS-PAPDLSEMLRPVGSPG 145
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3490 DKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKskeesrresvaeksPLPSKEASRPTSVAESVKDEAEKSKEE 3569
Cdd:PHA03307  146 PPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETAR--------------APSSPPAEPPPSTPPAAASPRPPRRSS 211
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3570 SRRDSVAEKSPLASKEASRPASVAESVQDEAEKS-KEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEES---- 3644
Cdd:PHA03307  212 PISASASSPAPAPGRSAADDAGASSSDSSSSESSgCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSsssp 291
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3645 --RRESVAEKSPLASKEASRPTSVAESVKDeAEKSKEESSRDSVAEKSPLAS--KEASRPASVAESVQDEAEKSKEESRR 3720
Cdd:PHA03307  292 reRSPSPSPSSPGSGPAPSSPRASSSSSSS-RESSSSSTSSSSESSRGAAVSpgPSPSRSPSPSRPPPPADPSSPRKRPR 370
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 161077523 3721 ESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRE-SVAEKSPLASKEAS 3772
Cdd:PHA03307  371 PSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPaGRPRPSPLDAGAAS 423
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3622-3797 3.50e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 46.90  E-value: 3.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3622 EASRPASVAESIKDEAEKSKEESRRESVAEK--SPLASKEASRPTSVAESVKDEAEKSKEESSRDSvaeksplASKEASR 3699
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAES-------VVQVADR 355
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3700 PASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPLASKEASRPASVAE 3779
Cdd:PRK13108  356 DGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVA 435
                         170
                  ....*....|....*...
gi 161077523 3780 SVKDEAEKSKEESRRESV 3797
Cdd:PRK13108  436 GPGDDPAEPDGIRRQDDF 453
PHA00430 PHA00430
tail fiber protein
2252-2386 3.56e-04

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 47.19  E-value: 3.56e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2252 KEASRPASVAESIKDEAEKSKEESRRESVAEKSplPSKEASRPASVAESIKDEAEKSKEETRREsvaeksplpSKEASRP 2331
Cdd:PHA00430  166 NEANRSRNEADRARNQAERFNNESGASATNTKQ--WRSEADGSNSEANRFKGYADSMTSSVEAA---------KGQAESS 234
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 2332 ASVAESIKDEAEKSKEEsrrESAAEKSPLPSK-EASRPASVAESVKDEADKSKEES 2386
Cdd:PHA00430  235 SKEANTAGDYATKAAAS---ASAAHASEVNAAnSATAAATSANRAKQQADRAKTEA 287
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
1633-2073 4.03e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 46.99  E-value: 4.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1633 PESVTDKSKEPSRRESiaESLKAESTKDEKSappSKEASRPGSVVESV---KDETEKSKEPSRRESIAESAKPPIEFREV 1709
Cdd:PTZ00449  514 PEASGLPPKAPGDKEG--EEGEHEDSKESDE---PKEGGKPGETKEGEvgkKPGPAKEHKPSKIPTLSKKPEFPKDPKHP 588
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1710 SRPESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESVAEsfKADSTKDEKSPLTSKDISRP 1789
Cdd:PTZ00449  589 KDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPE--RPEGPKIIKSPKPPKSPKPP 666
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1790 --ESAVENVMDAVGSAERSQPESVTASRDVSRPESVAESEKDDTDKPESVVESVIPAsdvveieKGAADKEkgvFVSLEI 1867
Cdd:PTZ00449  667 fdPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPP-------KLPRDEE---FPFEPI 736
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1868 GKPDSPSevisrPGPVVESVKPESRR----ESSTEIVLP-CHAEDSKEP---SRPESKVECLKDEsevlKGSTRRESVAE 1939
Cdd:PTZ00449  737 GDPDAEQ-----PDDIEFFTPPEEERtffhETPADTPLPdILAEEFKEEdihAETGEPDEAMKRP----DSPSEHEDKPP 807
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1940 SDKSSQPFKetsRPESAVGSMKDESMSKEPSRreSVKDGAAQSRETSRPASVaesakdgaDDLKELSRPEStTQSKEAGS 2019
Cdd:PTZ00449  808 GDHPSLPKK---RHRLDGLALSTTDLESDAGR--IAKDASGKIVKLKRSKSF--------DDLTTVEEAEE-MGAEARKI 873
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....
gi 161077523 2020 IKDEKSPLASEEASRPASvaesvkdeaEKSKEESRRESvAEKSPLPSKEASRPA 2073
Cdd:PTZ00449  874 VVDDDGTEADDEDTHPPE---------EKHKSEVRRRR-PPKKPSKPKKPSKPK 917
PHA00430 PHA00430
tail fiber protein
3656-3787 4.04e-04

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 46.81  E-value: 4.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3656 ASKEASRPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRpasvaesvqDEAEKSKEESrrESVAEKSPLASKEAS 3735
Cdd:PHA00430  164 ARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSN---------SEANRFKGYA--DSMTSSVEAAKGQAE 232
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 3736 RPASVAESVKDDAEKSKEESRR----ESVAEKSPLASK-EASRPASVAESVKDEAEK 3787
Cdd:PHA00430  233 SSSKEANTAGDYATKAAASASAahasEVNAANSATAAAtSANRAKQQADRAKTEADK 289
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
948-1691 4.19e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 47.27  E-value: 4.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523   948 AAKKLQDLTASQELDAEKQRELddLKEEQEVVREIEAVFSRDEMKRQQHQQIKAELREMPAEGTGDGENEPDEEEEYLII 1027
Cdd:pfam02463  206 AKKALEYYQLKEKLELEEEYLL--YLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKL 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1028 EKEEVEQYTEDSIVEQESSMTKEEEIQKHQRDSQESEKKRKKSAEEEIEAAIAKVEAAERKARLEGASARQDESELDVEP 1107
Cdd:pfam02463  284 QEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEK 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1108 EQSKIKAEVQDIIATAKD------IAKSRTEEQLAKPAEEELSSPTPEEKLSKKTSDTKDDQIGAPVDVLpvNLQESLPE 1181
Cdd:pfam02463  364 LQEKLEQLEEELLAKKKLeserlsSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILE--EEEESIEL 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1182 EKFSATIESGATTAPTLPEDERIPLDQIKEDLVIEEKYVKEETKEAEAIVVATVQTL------------------PEAAP 1243
Cdd:pfam02463  442 KQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERsqkeskarsglkvllaliKDGVG 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1244 LAIDTILASATKDAPKDANAEALGELPDSGERVLpMKMTFEAQQNLLRDVIKTPDEVADLpvheeadlgLYEKDSQDAGA 1323
Cdd:pfam02463  522 GRIISAHGRLGDLGVAVENYKVAISTAVIVEVSA-TADEVEERQKLVRALTELPLGARKL---------RLLIPKLKLPL 591
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1324 KSISHKEESAKEEKETDDEKENKVGEIELGDEPNKVDISHVLLKESVQEVAEKVVVIETTVEKKQEEIVEATTVITQENQ 1403
Cdd:pfam02463  592 KSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSEL 671
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1404 ED---LMEQVKDKEEHEQKIESGIITEKEAKKSASTPEEKETSDITSDDELPA----QLADPTTVPPKSAKDREDTGSIE 1476
Cdd:pfam02463  672 TKellEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLAdrvqEAQDKINEELKLLKQKIDEEEEE 751
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1477 SPPTIEEAIEVEVQAKQEAQKPVPAPEEAIKTEKSPLASKETSRPESATGSVKE------------DTEQTKSKKSPVPS 1544
Cdd:pfam02463  752 EEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRAleeelkeeaellEEEQLLIEQEEKIK 831
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1545 RPESEAKDKKSPFASGEASRPESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIkpesGIDEKSA 1624
Cdd:pfam02463  832 EEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKK----ELEEESQ 907
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523  1625 LASKEASRPESVTDKSKEPSRRESIAESLKAESTKDEKSAPPSKEASRPGSVVESVKDETEKSKEPS 1691
Cdd:pfam02463  908 KLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGK 974
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
3737-4176 4.27e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 47.03  E-value: 4.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3737 PASVAESVKDDAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSPlpskEASRPTSVAE 3816
Cdd:PRK14949  371 PAEISLPEGQTPSALAAAVQAPHANEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEAVAEADASA----EPADTVEQAL 446
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3817 SVKDEAEKSKEESRRESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEa 3896
Cdd:PRK14949  447 DDESELLAALNAEQAVILSQAQSQGFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDN- 525
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3897 ekskeesrreSVAEKSPLPSKEASRPTSVAESVKDEADKSKEESRRESGAEKSPLASMEASRPTSVAESVKDETEKSKEE 3976
Cdd:PRK14949  526 ----------YSAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSP 595
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3977 SRRESVTEKSP-----LPSKEASRPT----SVAESVKDEAEKSKEESRREsvaeKSPLASKESSRPASVAESIKDEAEGT 4047
Cdd:PRK14949  596 ISAVTTAAASLadddiLDAVLAARDSllsdLDALSPKEGDGKKSSADRKP----KTPPSRAPPASLSKPASSPDASQTSA 671
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 4048 KQESRRESMPESGKAESIKGDQSslASKETSRPDSVVESVKDETEKPEGSAIDKSQVASRPESVAVSAKDEKSPLHSRPE 4127
Cdd:PRK14949  672 SFDLDPDFELATHQSVPEAALAS--GSAPAPPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVE 749
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*....
gi 161077523 4128 SVADKSPDASKEASrslSVAETASSPIEEGPRSIADLSLPLNLTGEAKG 4176
Cdd:PRK14949  750 QQATHQPQVQAEAQ---SPASTTALTQTSSEVQDTELNLVLLSSGSITG 795
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1480-1732 4.42e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 46.96  E-value: 4.42e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1480 TIEEAIEVEVQAKQEAQK-PVPAPEEAIKTEKSPLASKETS-RPESATGSVKEDTEQTKSKKSPVPSRPESEAKDKKSPF 1557
Cdd:PTZ00108 1140 ALEEQEEVEEKEIAKEQRlKSKTKGKASKLRKPKLKKKEKKkKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSN 1219
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1558 ASGEASRPESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIKPESGIDEKSALASKEASRPESVT 1637
Cdd:PTZ00108 1220 SSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGG 1299
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1638 DKSKEPSRR---ESIAESLKA-ESTKDEKSAppskeasrpgsVVESVKDETEKSKEPSRRESIAESAKPPIEFREVSRPE 1713
Cdd:PTZ00108 1300 SKPSSPTKKkvkKRLEGSLAAlKKKKKSEKK-----------TARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDD 1368
                         250
                  ....*....|....*....
gi 161077523 1714 SVIDGIKDESAKPESRRDS 1732
Cdd:PTZ00108 1369 DDSEVDDSEDEDDEDDEDD 1387
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2106-2490 4.50e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 4.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2106 ASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAE--SIKDEAEKSKEESRRESVAEKSPLPSKEASRP 2183
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTwsLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2184 ASVAES---IKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDE-AEKSKEESRRESVAEKSPLPSKEASRPAS 2259
Cdd:PHA03307  124 ASPPPSpapDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAAlPLSSPEETARAPSSPPAEPPPSTPPAAAS 203
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2260 vaesiKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEETRRESvAEKSPLPSKEASRPASVAESIK 2339
Cdd:PHA03307  204 -----PRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEC-PLPRPAPITLPTRIWEASGWNG 277
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2340 DEAEKSKEESRR-ESAAEKSPLPSKEASRPASVAesvkdeADKSKEESRRESMAESGKAQSIKGDQSPLKEVSRPESVAE 2418
Cdd:PHA03307  278 PSSRPGPASSSSsPRERSPSPSPSSPGSGPAPSS------PRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSP 351
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 161077523 2419 SVKDDPVKSKEPSRRESvagsvtADSARDDQSPLESKGASRPESVVDSVKDEAEKQE--SRRESKTESVIPPKA 2490
Cdd:PHA03307  352 SPSRPPPPADPSSPRKR------PRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDatGRFPAGRPRPSPLDA 419
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3696-3871 4.52e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 46.51  E-value: 4.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3696 EASRPASVAESVQDEAEKSKEESRRESVAEK--SPLASKEASRPASVAESVKDDAEKSKEESRRESVaekSPLASKEASR 3773
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESV---VQVADRDGES 359
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3774 PASVAESvkDEAEKSKEESRrESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRREsVAEKSSLASKKASRPASVAE 3853
Cdd:PRK13108  360 TPAVEET--SEADIEREQPG-DLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAAPIPDPAKPDELAVA 435
                         170
                  ....*....|....*...
gi 161077523 3854 SVKDEAEKSKEESRRESV 3871
Cdd:PRK13108  436 GPGDDPAEPDGIRRQDDF 453
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3770-3943 4.93e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 46.51  E-value: 4.93e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3770 EASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSK--EASRPTSVAESVKDEAEKSKEESRRESVAEKSSLASKKASR 3847
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3848 PASvaesvKDEAEKSKEESrRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESRREsVAEKSPlPSKEASRPT-SVA 3926
Cdd:PRK13108  363 VEE-----TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAA-PIPDPAKPDeLAV 434
                         170
                  ....*....|....*..
gi 161077523 3927 ESVKDEADKSKEESRRE 3943
Cdd:PRK13108  435 AGPGDDPAEPDGIRRQD 451
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
3634-3917 5.08e-04

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 46.13  E-value: 5.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3634 KDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKdEAEKSKEESSRDSVAEKSPLASKEASrpASVAESVQDEAEK 3713
Cdd:PRK07735    5 KDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENR-EKEKALPKNDDMTIEEAKRRAAAAAK--AKAAALAKQKREG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3714 SKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESR 3793
Cdd:PRK07735   82 TEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETD 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3794 RESVAEKSPLPSKeaSRPTSVAESVKDEAEKSKEESRRESVAEKSSLASKKASrpASVAESVKDEAEKSKEESRRESVAE 3873
Cdd:PRK07735  162 KEKAKAKAAAAAK--AKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAK--AKAAALAKQKASQGNGDSGDEDAKA 237
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....
gi 161077523 3874 KSPLASKeASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSK 3917
Cdd:PRK07735  238 KAIAAAK-AKAAAAARAKTKGAEGKKEEEPKQEEPSVNQPYLNK 280
PHA00430 PHA00430
tail fiber protein
3734-3898 5.09e-04

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 46.42  E-value: 5.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3734 ASRPASVAESVKD-DAEKSKEESRRESVAEKsplASKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRpt 3812
Cdd:PHA00430  133 GRRIVNLADAVDDgDAVPLGQIKTWNQSAWN---ARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSN-- 207
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3813 svaesvkDEAEKSKEESrrESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRR----ESVAEKSPLASK-EASRPAS 3887
Cdd:PHA00430  208 -------SEANRFKGYA--DSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAahasEVNAANSATAAAtSANRAKQ 278
                         170
                  ....*....|.
gi 161077523 3888 VAESVKDEAEK 3898
Cdd:PHA00430  279 QADRAKTEADK 289
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
3626-4052 5.31e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 46.64  E-value: 5.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3626 PASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDEAEKSKEESSRDSVAEKSPlaskEASRPASVAE 3705
Cdd:PRK14949  371 PAEISLPEGQTPSALAAAVQAPHANEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEAVAEADASA----EPADTVEQAL 446
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3706 SVQDEAEKSKeesrresVAEKSPLASKEASRPASVAESVKDDAEKSKEESrrESVAEKSPLASKEASRPasVAESVKDEA 3785
Cdd:PRK14949  447 DDESELLAAL-------NAEQAVILSQAQSQGFEASSSLDADNSAVPEQI--DSTAEQSVVNPSVTDTQ--VDDTSASNN 515
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3786 EKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRESVAEKSSLASKKASRPASVAESVKDEAEKSKEE 3865
Cdd:PRK14949  516 SAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSP 595
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3866 SRRESVAEKSP---------LASKEASRPASVAESVKDEAEKSKEESRREsvaeKSPLPSK---EASRPTSVAESVKDEA 3933
Cdd:PRK14949  596 ISAVTTAAASLadddildavLAARDSLLSDLDALSPKEGDGKKSSADRKP----KTPPSRAppaSLSKPASSPDASQTSA 671
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3934 DKSKEESRRESGAEKSPLASMEAsrPTSVAESVKDETEKSK--EESRRESVTEKSPLPSKEASRPTSVAESVKDeaeksk 4011
Cdd:PRK14949  672 SFDLDPDFELATHQSVPEAALAS--GSAPAPPPVPDPYDRPpwEEAPEVASANDGPNNAAEGNLSESVEDASNS------ 743
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|.
gi 161077523 4012 EESRRESVAEKSPLASKESSRPASVAESIKDEAEGTKQESR 4052
Cdd:PRK14949  744 ELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTELN 784
growth_prot_Scy NF041483
polarized growth protein Scy;
1890-2399 5.32e-04

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 46.74  E-value: 5.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1890 ESRRESSTEIVLPCHAEDSKEPSRPESKVECLKDESEVLkgstRRESVAESDKSSQPFKETSRPESAVGSMKDESMSKEP 1969
Cdd:NF041483  567 AARQAEAAEELTRLHTEAEERLTAAEEALADARAEAERI----RREAAEETERLRTEAAERIRTLQAQAEQEAERLRTEA 642
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1970 SRRESVKDGAAQS---RETSRPASVAESAKDGADDLKELSRPESTTQSKEAGSIKDEKSPLASEEASRPASVAE----SV 2042
Cdd:NF041483  643 AADASAARAEGENvavRLRSEAAAEAERLKSEAQESADRVRAEAAAAAERVGTEAAEALAAAQEEAARRRREAEetlgSA 722
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2043 KDEAEKSKEESRRES-----VAEKSPLPSK-EASRPASVAESIKDE---AEKSKEESRRESVAEKSPLPSKEAS--RPAS 2111
Cdd:NF041483  723 RAEADQERERAREQSeellaSARKRVEEAQaEAQRLVEEADRRATElvsAAEQTAQQVRDSVAGLQEQAEEEIAglRSAA 802
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2112 --VAESIKDEAEKSKEESRRESVAEKSPlPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPL---PSKEASRPASV 2186
Cdd:NF041483  803 ehAAERTRTEAQEEADRVRSDAYAERER-ASEDANRLRREAQEETEAAKALAERTVSEAIAEAERLrsdASEYAQRVRTE 881
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2187 AESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKS-PLPSKEASRPASVAESIK 2265
Cdd:NF041483  882 ASDTLASAEQDAARTRADAREDANRIRSDAAAQADRLIGEATSEAERLTAEARAEAERLRDeARAEAERVRADAAAQAEQ 961
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2266 DEAEKSKEESR-RESVAEKSPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEksplpskeasrpasvAESIKDEAEK 2344
Cdd:NF041483  962 LIAEATGEAERlRAEAAETVGSAQQHAERIRTEAERVKAEAAAEAERLRTEAREE---------------ADRTLDEARK 1026
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 161077523 2345 SKEESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGKAQS 2399
Cdd:NF041483 1027 DANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVGAARK 1081
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1569-1840 5.41e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 46.96  E-value: 5.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1569 AESVKDEAGKAESRRESIAKTHKDESS------LDKAKE--QESRRESLAESIKPESGIDEKSALASKEASRPesvtDKS 1640
Cdd:PTZ00108 1101 KEKVEKLNAELEKKEKELEKLKNTTPKdmwledLDKFEEalEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPK----LKK 1176
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1641 KEPSRRESIAESLKAESTKDEK---SAPPSKEASRPGSVVESVKDETEKSKEPSRRESIAESAKPPIEFREVSRPESVID 1717
Cdd:PTZ00108 1177 KEKKKKKSSADKSKKASVVGNSkrvDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSED 1256
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1718 GIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESvAESFKADSTKDEKSPLTSKDISRPESAVENVM 1797
Cdd:PTZ00108 1257 NDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPS-SPTKKKVKKRLEGSLAALKKKKKSEKKTARKK 1335
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 161077523 1798 DAVGSAERSQPESVTASRDVSRPESVAESEKDDTDKPESVVES 1840
Cdd:PTZ00108 1336 KSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSED 1378
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2179-2352 5.46e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 46.13  E-value: 5.46e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2179 EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSK--EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASR 2256
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2257 PASvaesiKDEAEKSKEESrRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEETRREsVAEKSPlPSKEASRP-ASVA 2335
Cdd:PRK13108  363 VEE-----TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAA-PIPDPAKPdELAV 434
                         170
                  ....*....|....*..
gi 161077523 2336 ESIKDEAEKSKEESRRE 2352
Cdd:PRK13108  435 AGPGDDPAEPDGIRRQD 451
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
3711-3985 6.51e-04

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 46.13  E-value: 6.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3711 AEKSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAE-KSPLASKEASRPASVAESVKDEAEKSK 3789
Cdd:PRK07735   15 ARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAaKAKAAALAKQKREGTEEVTEEEKAKAK 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3790 EESRRESVAEKSPLPSKEASRPTSVAEsvkDEAEKSKEESRRESVAEKSSLASKKASRPASVA-ESVKDEAEKSKEESRR 3868
Cdd:PRK07735   95 AKAAAAAKAKAAALAKQKREGTEEVTE---EEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTeEEEETDKEKAKAKAAA 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3869 ESVAEKSPLASKEASRP-ASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAEsvkDEADKSKEESRRESGAE 3947
Cdd:PRK07735  172 AAKAKAAALAKQKAAEAgEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSG---DEDAKAKAIAAAKAKAA 248
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 161077523 3948 KSPLASMEAsrptsvAESVKDETEKSKEESRRESVTEK 3985
Cdd:PRK07735  249 AAARAKTKG------AEGKKEEEPKQEEPSVNQPYLNK 280
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3511-3644 6.54e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 46.13  E-value: 6.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3511 EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSK--EASRPTSVAESVKDEAEKSKEESRRDSVAE------KSPLA 3582
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEVAEVTDEVAAESVVQvadrdgESTPA 362
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 161077523 3583 SKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEES 3644
Cdd:PRK13108  363 VEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIP 424
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3202-3359 6.71e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 46.13  E-value: 6.71e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3202 DITKGEKSPLPSKEVSRPESVVGSIKDEKAESRRESVAESVKPESSKDATSAPPSKEHSRPESVlgslkDEGDKTTSRRV 3281
Cdd:PRK13108  309 ASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIE-----REQPGDLAGQA 383
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 161077523 3282 SVADSIKDEKSllvSQEASRPESEAESLKDAAAPSQ-ETSRPESvtESVKDGKSPVASKEASRPASVAENAKDSADESK 3359
Cdd:PRK13108  384 PAAHQVDAEAA---SAAPEEPAALASEAHDETEPEVpEKAAPIP--DPAKPDELAVAGPGDDPAEPDGIRRQDDFSSRR 457
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3573-3957 7.21e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.32  E-value: 7.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3573 DSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEAsRPASVAESIKDEAEKSKEESRRESVAek 3652
Cdd:PHA03307   45 SDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTL-APASPAREGSPTPPGPSSPDPPPPTP-- 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3653 sPLASKEASRPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEesrresvaeksPLASK 3732
Cdd:PHA03307  122 -PPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETAR-----------APSSP 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3733 EASRPASVAESVKDDAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEKS-KEESRRESVAEKSPLPSKEASRP 3811
Cdd:PHA03307  190 PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSgCGWGPENECPLPRPAPITLPTRI 269
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3812 TSVAESVKDEAEKSKEES------RRESVAEKSSLASKKASRPASVAESVKD-EAEKSKEESRRESVAEKSPLASKEASR 3884
Cdd:PHA03307  270 WEASGWNGPSSRPGPASSssspreRSPSPSPSSPGSGPAPSSPRASSSSSSSrESSSSSTSSSSESSRGAAVSPGPSPSR 349
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 161077523 3885 PASVAESVKDEAEKSKeeSRRESVAEKSPLPSKEASRPTS--VAESVKDEADKSKEESRRESGAEK-SPLASMEAS 3957
Cdd:PHA03307  350 SPSPSRPPPPADPSSP--RKRPRPSRAPSSPAASAGRPTRrrARAAVAGRARRRDATGRFPAGRPRpSPLDAGAAS 423
rne PRK10811
ribonuclease E; Reviewed
3469-3887 7.70e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 46.19  E-value: 7.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3469 PLPSKEASRPASVAESVKDEADKSKEESR----RESGAEKSPLASKEASRPASVAESIKDEAEKSKEE---SRRESVAEK 3541
Cdd:PRK10811  538 PPAPTPAEPAAPVVAAAPKAAAATPPAQPgllsRFFGALKALFSGGEETKPQEQPAPKAEAKPERQQDrrkPRQNNRRDR 617
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3542 SplpskeaSRPTSVAESVKDEAEKSKEESRRDSVAeksplASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASK 3621
Cdd:PRK10811  618 N-------ERRDTRDNRTRREGRENREENRRNRRQ-----AQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRND 685
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3622 EaSRPASVAESIKDEAEKS-----KEESRRESVAEKSPLASKEASRPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKE 3696
Cdd:PRK10811  686 E-KRQAQQEAKALNVEEQSvqeteQEERVQQVQPRRKQRQLNQKVRIEQSVAEEAVAPVVEETVAAEPVVQEVPAPRTEL 764
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3697 ASRPA-SVAESVQDEAEKSKEE----------SRRE---------------------------SVAEKSP-LAS------ 3731
Cdd:PRK10811  765 VKVPLpVVAQTAPEQDEENNAEnrdnngmprrSRRSprhlrvsgqrrrryrderyptqspmplTVACASPeMASgkvwir 844
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3732 --------------KEASRPASVAESVKDDAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESV 3797
Cdd:PRK10811  845 ypvvrpqdvqveeqREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPV 924
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3798 AEKSPLPSKEAsrpTSVAESVKDEAEKSKEES-RRESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRRESVAEKSP 3876
Cdd:PRK10811  925 TEQPQVITESD---VAVAQEVAEHAEPVVEPQdETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEV 1001
                         490
                  ....*....|.
gi 161077523 3877 LASKEASRPAS 3887
Cdd:PRK10811 1002 APAQVPEATVE 1012
PHA00430 PHA00430
tail fiber protein
3582-3718 7.92e-04

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 46.04  E-value: 7.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3582 ASKEASRPASVAESVQDEAEKSKEESRRESVAEKSplASKEASRPASVAESIKDEAEKSKEESRResvaeksplASKEAS 3661
Cdd:PHA00430  164 ARNEANRSRNEADRARNQAERFNNESGASATNTKQ--WRSEADGSNSEANRFKGYADSMTSSVEA---------AKGQAE 232
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 3662 RPTSVAESVKDEAEKSKEESSRDSVAEKSplASKEASRPASVAESVQDEAEKSKEES 3718
Cdd:PHA00430  233 SSSKEANTAGDYATKAAASASAAHASEVN--AANSATAAATSANRAKQQADRAKTEA 287
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
1513-1730 8.03e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 45.74  E-value: 8.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1513 LASKETSRPESATGSvkEDTEQTKSKKSPVPSRPESEAKDKK--SPFASGEASRPESVAESVKDEAGKAESrresiakth 1590
Cdd:PRK13108  274 LAPKGREAPGALRGS--EYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEVAEVTD--------- 342
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1591 kdessldkakeqesrrESLAESIKPESGIDEKSALASKEASRPESVTDKSKEPSRRESIAESLKAESTKDEKSAPPSKEA 1670
Cdd:PRK13108  343 ----------------EVAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALAS 406
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1671 SRPgsvvesvkDETEKSkEPSRRESIAESAKPpiefrevsrPESVIDGIKDESAKPESRR 1730
Cdd:PRK13108  407 EAH--------DETEPE-VPEKAAPIPDPAKP---------DELAVAGPGDDPAEPDGIR 448
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2069-2460 8.90e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.93  E-value: 8.90e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2069 ASRPASVAESIKDEAEKSKEESRRESVAEKSPL---PSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASR 2145
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGteaPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2146 PAS-------VAESIKDEAEKSKEESRRESVAEKSPLPSKEA---SRPASVAESIKDEAEKskeesrresvaeksPLPSK 2215
Cdd:PHA03307  124 ASPppspapdLSEMLRPVGSPGPPPAASPPAAGASPAAVASDaasSRQAALPLSSPEETAR--------------APSSP 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2216 EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKS-KEESRRESVAEKSPLPSKEASRP 2294
Cdd:PHA03307  190 PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSgCGWGPENECPLPRPAPITLPTRI 269
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2295 ASVAESIKDEAEKSKE-------ETRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESAAEKSPLPSKEASR 2367
Cdd:PHA03307  270 WEASGWNGPSSRPGPAsssssprERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSR 349
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2368 PASVAESVKDEADKSKEESRRESMAESGKAQSiKGDqsplkevSRPESVAESVKDDPVKSKEPSRREsvaGSVTADSARD 2447
Cdd:PHA03307  350 SPSPSRPPPPADPSSPRKRPRPSRAPSSPAAS-AGR-------PTRRRARAAVAGRARRRDATGRFP---AGRPRPSPLD 418
                         410
                  ....*....|...
gi 161077523 2448 DQSPLESKGASRP 2460
Cdd:PHA03307  419 AGAASGAFYARYP 431
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2290-2477 9.28e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 45.35  E-value: 9.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2290 EASRPASVAESIKDEAEKSKEETRRESVAEKSPLPSK--EASRPASVAESIKDEAeksKEESRRESAAEKSPLPSKEASR 2367
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEV---AEVTDEVAAESVVQVADRDGES 359
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2368 PASVAESvkDEADKSKEESrresmaESGKAQSIKGDQSPLKEVSR-PESVAESVKDDPVKSkEPSRRESVAGSvtADSAR 2446
Cdd:PRK13108  360 TPAVEET--SEADIEREQP------GDLAGQAPAAHQVDAEAASAaPEEPAALASEAHDET-EPEVPEKAAPI--PDPAK 428
                         170       180       190
                  ....*....|....*....|....*....|.
gi 161077523 2447 DDQSPLESKGASRPESVVDSVKDEAEKQESR 2477
Cdd:PRK13108  429 PDELAVAGPGDDPAEPDGIRRQDDFSSRRRR 459
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
3405-3819 9.44e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.84  E-value: 9.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3405 LVSKEVSRPASVAESVKDeaeksKEESPLMSKEASRPASVAGSVKDEAEKSKEESR--RESVAEKSPLPSKEASR----- 3477
Cdd:PTZ00449  488 LIKKSKKKLAPIEEEDSD-----KHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKesDEPKEGGKPGETKEGEVgkkpg 562
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3478 PASVAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKSKE-----ESRRESVAEKSPLPSKEASRP 3552
Cdd:PTZ00449  563 PAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPElldipKSPKRPESPKSPKRPPPPQRP 642
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3553 TS--------------VAESVKDEAEKSKEESRRDSVAEKSPlASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPL 3618
Cdd:PTZ00449  643 SSperpegpkiikspkPPKSPKPPFDPKFKEKFYDDYLDAAA-KSKETKTTVVLDESFESILKETLPETPGTPFTTPRPL 721
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3619 ASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVkdeAEKSKEEssrDSVAE-KSPlaSKEA 3697
Cdd:PTZ00449  722 PPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDIL---AEEFKEE---DIHAEtGEP--DEAM 793
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3698 SRPASVAE----SVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKsPLASKEASR 3773
Cdd:PTZ00449  794 KRPDSPSEhedkPPGDHPSLPKKRHRLDGLALSTTDLESDAGRIAKDASGKIVKLKRSKSFDDLTTVEEA-EEMGAEARK 872
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*.
gi 161077523 3774 PASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVK 3819
Cdd:PTZ00449  873 IVVDDDGTEADDEDTHPPEEKHKSEVRRRRPPKKPSKPKKPSKPKK 918
PHA00430 PHA00430
tail fiber protein
3475-3639 9.96e-04

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 45.65  E-value: 9.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3475 ASRPASVAESVKD-EADKSKEESRRESGAEKsplASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRpt 3553
Cdd:PHA00430  133 GRRIVNLADAVDDgDAVPLGQIKTWNQSAWN---ARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSN-- 207
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3554 svaesvkDEAEKSKEESrrDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRR----ESVAEKSPLASK-EASRPAS 3628
Cdd:PHA00430  208 -------SEANRFKGYA--DSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAahasEVNAANSATAAAtSANRAKQ 278
                         170
                  ....*....|.
gi 161077523 3629 VAESIKDEAEK 3639
Cdd:PHA00430  279 QADRAKTEADK 289
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
1953-2090 1.02e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 45.35  E-value: 1.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1953 PESAVGSMKDESMSKEPSRRESVKDGAAQSRETSRPASVAESAK--DGADDLKELSRPESTTQSKEAGSIKDEK---SPL 2027
Cdd:PRK13108  282 PGALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGPGEPNQpdDVAEAVKAEVAEVTDEVAAESVVQVADRdgeSTP 361
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 161077523 2028 ASEEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEES 2090
Cdd:PRK13108  362 AVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIP 424
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
3449-3726 1.07e-03

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 45.36  E-value: 1.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3449 KDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESV-KDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAE 3527
Cdd:PRK07735   12 KEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALpKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEVTEEEKA 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3528 KSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSVAEKSPlaSKEASRPASVAESVQDEAEKSKEES 3607
Cdd:PRK07735   92 KAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAK--QKREGTEEVTEEEEETDKEKAKAKA 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3608 RRESVAEKSPLASKEASRP-ASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAEsvkDEAEKSKeessrdSV 3686
Cdd:PRK07735  170 AAAAKAKAAALAKQKAAEAgEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSG---DEDAKAK------AI 240
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 161077523 3687 AEKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEK 3726
Cdd:PRK07735  241 AAAKAKAAAAARAKTKGAEGKKEEEPKQEEPSVNQPYLNK 280
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3474-3649 1.11e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 45.35  E-value: 1.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3474 EASRPASVAESVKDEADKSKEESRRESGAEK--SPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASR 3551
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3552 PTSvaesvKDEAEKSKEESrRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRREsVAEKSPLASKEASRPASVAE 3631
Cdd:PRK13108  363 VEE-----TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAAPIPDPAKPDELAVA 435
                         170
                  ....*....|....*...
gi 161077523 3632 SIKDEAEKSKEESRRESV 3649
Cdd:PRK13108  436 GPGDDPAEPDGIRRQDDF 453
PHA00430 PHA00430
tail fiber protein
3844-4014 1.36e-03

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 45.27  E-value: 1.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3844 KASRPASVAESVKD-EAEKSKEESRRESVAEKsplASKEASRPASVAESVKDEAEKSKEESrresvaeksplpskeaSRP 3922
Cdd:PHA00430  132 RGRRIVNLADAVDDgDAVPLGQIKTWNQSAWN---ARNEANRSRNEADRARNQAERFNNES----------------GAS 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3923 TSVAESVKDEADKSKEESRRESGAEKSPLASMEASRptSVAESVKDETEKSKEESRRESVTEKSPLPSK-----EASRPT 3997
Cdd:PHA00430  193 ATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAK--GQAESSSKEANTAGDYATKAAASASAAHASEvnaanSATAAA 270
                         170
                  ....*....|....*..
gi 161077523 3998 SVAESVKDEAEKSKEES 4014
Cdd:PHA00430  271 TSANRAKQQADRAKTEA 287
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
3600-3874 1.40e-03

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 44.97  E-value: 1.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3600 AEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAE-KSPLASKEASRPTSVAESVKDEAEKSK 3678
Cdd:PRK07735   15 ARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAaKAKAAALAKQKREGTEEVTEEEKAKAK 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3679 EESSRDSVAEKSPLASKEASRPASVAEsvqDEAEKSKEESRRESVAEKSPLA-SKEASRPASVAESVKDDAEKSKEESRR 3757
Cdd:PRK07735   95 AKAAAAAKAKAAALAKQKREGTEEVTE---EEKAAAKAKAAAAAKAKAAALAkQKREGTEEVTEEEEETDKEKAKAKAAA 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3758 ESVAEKSPLASKEASRP-ASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAEsvkDEAEKSKEESRRESVAE 3836
Cdd:PRK07735  172 AAKAKAAALAKQKAAEAgEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSG---DEDAKAKAIAAAKAKAA 248
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 161077523 3837 KSSLASKKAsrpasvAESVKDEAEKSKEESRRESVAEK 3874
Cdd:PRK07735  249 AAARAKTKG------AEGKKEEEPKQEEPSVNQPYLNK 280
PRK13735 PRK13735
conjugal transfer mating pair stabilization protein TraG; Provisional
3091-3567 1.53e-03

conjugal transfer mating pair stabilization protein TraG; Provisional


Pssm-ID: 184287 [Multi-domain]  Cd Length: 942  Bit Score: 45.12  E-value: 1.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3091 SEQQSRRESvaeSVKADTKKDGKSQEASRPSSvdellkdddekQESRRQSITGSHKAMsTMGDESPMDKADkSKEPSRPE 3170
Cdd:PRK13735  528 AQQEMAREA---SNQAESALHGFSSSIASAWN-----------QLSQFGSNRGSSDSV-TGGADSTMSAQD-SMMASRMR 591
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3171 SVAESI-KHENTKDEES--PLGSRRDSVAESIKSD--ITKGEKSPLPSKEVSRPESVVGSIKDEKAESRRESVAESVKPE 3245
Cdd:PRK13735  592 SAVESYaKAHNISNEQAtqELASRSTRGSAGGYGDahAEWGVGPKILGKGGKLGLGVKGGGRAGIDWSDSDGHSASSSSR 671
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3246 SSKDAtsappskEHSRPESVLGSLKDEGDKTTSRRVSVADSIKDEKSllvsqeASRPESEAESLKDAAapSQETSRPESV 3325
Cdd:PRK13735  672 SSHDA-------RHDIDAQATKDFREASDYFTSRKVSESGSHTDNNA------DSRVDQLSAALNSAK--QSYDQYTTNM 736
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3326 TESvkdgkspvasKEASRPASVAENAKDSADESKEQRPESLPQSKAGSikDEKSPLASKDEAEKSkeeSRRESVAEQFpl 3405
Cdd:PRK13735  737 TRS----------HEYAEMASRTESMSGQMSENLSQQFAQYVMKHAPQ--DAEAILTNTSSPEIA---ERRRAMAWSF-- 799
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3406 vskevsrpasVAESVKDEAEKSKEESplMSKEASRPASVAGSVKDEAEKSKEESRRESVaeksplpsKEASRPASVAESV 3485
Cdd:PRK13735  800 ----------VQEQVQPGVDNAWRES--RGDIGKGMESVPSGGGSQDIIADHQGHQAII--------EQRTQDSGIRNDV 859
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3486 KDEADKSKEESRRESGAEKSPLASKEASRPASVAE--SIKDEAEKSKEESRRESVAEKSPLPSKEAsrPTSVAESVKDEA 3563
Cdd:PRK13735  860 KHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSElqNHHKTEALSQNNKYNEEKSAQERMPGADS--PEELMKRAKEYQ 937

                  ....
gi 161077523 3564 EKSK 3567
Cdd:PRK13735  938 DKHK 941
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
1669-1696 1.53e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 39.13  E-value: 1.53e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  1669 EASRPGSVVESVKDETEK----SKEPSRRESI 1696
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3438-3809 1.60e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3438 ASRPASVAGSVKDEAEKSKEESRRESVAEKSPL---PSKEASRPASVAESVKDEADKSKEESRRESGAEKSPlaskEASR 3514
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGteaPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPD----PPPP 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3515 PASVAESIKDEAEKSKEESRRESvaeksplPSKEASRPTSVAESVKDEAEKSKEESRRDSVaekSPLASKEASRPASVAE 3594
Cdd:PHA03307  120 TPPPASPPPSPAPDLSEMLRPVG-------SPGPPPAASPPAAGASPAAVASDAASSRQAA---LPLSSPEETARAPSSP 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3595 SVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRP--------TSV 3666
Cdd:PHA03307  190 PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPrpapitlpTRI 269
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3667 AESVKDEAEKSKE--ESSRDSVAEKSPLAS--------KEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASR 3736
Cdd:PHA03307  270 WEASGWNGPSSRPgpASSSSSPRERSPSPSpsspgsgpAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSR 349
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 161077523 3737 PASVAESVKDDAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESRRE-SVAEKSPLPSKEAS 3809
Cdd:PHA03307  350 SPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPaGRPRPSPLDAGAAS 423
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
673-917 1.61e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 1.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523   673 PEPADTGDEAAPTEQEPEAetepepehepeaeqdKDVGEEKKVEVLiMKPQQA--TPAVIAASGKDGVDAASADAT-PT- 748
Cdd:pfam05109  476 PTPAGTTSGASPVTPSPSP---------------RDNGTESKAPDM-TSPTSAvtTPTPNATSPTPAVTTPTPNATsPTl 539
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523   749 GKLSKASAKGKADKPRAEVKPVVrsriDTKPPKSMDRKLAKRDEKKSSPTTTPAARAPVAQNAKPKVLSRPATKSSPSST 828
Cdd:pfam05109  540 GKTSPTSAVTTPTPNATSPTPAV----TTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSST 615
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523   829 PAKSAKEANNRKVLESKQQAARVQATSTVSRRVT--------STASERRVQQQAEAKTAATGATQATQRKPISRRPRGVS 900
Cdd:pfam05109  616 PVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSsisetlspSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVS 695
                          250
                   ....*....|....*..
gi 161077523   901 PSKRAPAPGSPVKQAKP 917
Cdd:pfam05109  696 TSSPAPRPGTTSQASGP 712
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3152-3550 1.64e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3152 GDESPMDKADKSKEPSRPESVAESikhentkdEESPLGSRRDSVAESIKSDITKGEKSPLPSKEVSRPESvvGSIKDEKA 3231
Cdd:PHA03307   50 LAAVTVVAGAAACDRFEPPTGPPP--------GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGP--SSPDPPPP 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3232 ESRRESVAESVKPESSKDATSAPPSKEHSRPESVLGSLKD---EGDKTTSRRVSVADSIKDEKSLLVS--QEASRPESEA 3306
Cdd:PHA03307  120 TPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPaavASDAASSRQAALPLSSPEETARAPSspPAEPPPSTPP 199
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3307 ESLKDAAAPSQETSRPESVTESVKDGKSPVASKEASRPASVAENAKDSADESKEQRPESLPQSKAgsikdeksPLASKDE 3386
Cdd:PHA03307  200 AAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPIT--------LPTRIWE 271
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3387 AEKSKEESRRESvaeqfplvskevsrPASVAESVKDEAEKSKEESPLMSKEASRPASVAGSVKD-EAEKSKEESRRESVA 3465
Cdd:PHA03307  272 ASGWNGPSSRPG--------------PASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSrESSSSSTSSSSESSR 337
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3466 EKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEASRPASVAESIKDEAEKSKEESRRE-SVAEKSPL 3544
Cdd:PHA03307  338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPaGRPRPSPL 417

                  ....*.
gi 161077523 3545 PSKEAS 3550
Cdd:PHA03307  418 DAGAAS 423
PHA00430 PHA00430
tail fiber protein
3730-3866 1.82e-03

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 44.88  E-value: 1.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3730 ASKEASRPASVAESVKDDAEKSKEESRRESVAEKSplASKEASRPASVAESVKDEAEKSKEESRREsvaeksplpSKEAS 3809
Cdd:PHA00430  164 ARNEANRSRNEADRARNQAERFNNESGASATNTKQ--WRSEADGSNSEANRFKGYADSMTSSVEAA---------KGQAE 232
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 161077523 3810 RPTSVAESVKDEAEKSKEEsrrESVAEKSSLASK-KASRPASVAESVKDEAEKSKEES 3866
Cdd:PHA00430  233 SSSKEANTAGDYATKAAAS---ASAAHASEVNAAnSATAAATSANRAKQQADRAKTEA 287
valS PRK14900
valyl-tRNA synthetase; Provisional
2009-2197 1.89e-03

valyl-tRNA synthetase; Provisional


Pssm-ID: 237855 [Multi-domain]  Cd Length: 1052  Bit Score: 44.98  E-value: 1.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2009 ESTTQSKEAGSIkDEKSPLASEEASRPASVAESVKDEAEKSKeeSRRESVAE-KSPLPSKEASRPASVAESIKDEAEKSK 2087
Cdd:PRK14900  843 ETARVDKEIGKV-DQDLAVLERKLQNPSFVQNAPPAVVEKDR--ARAEELREkRGKLEAHRAMLSGSEANSARRDTMEIQ 919
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2088 EESRResvAEKSPLPSKEASRPASVAESIKDEAEKSKE-------------ESRRESVAEKSPLPSKEA-SRPASVAESI 2153
Cdd:PRK14900  920 NEQKP---TQDGPAAEAQPAQENTVVESAEKAVAAVSEaaqqaatavasgiEKVAEAVRKTVRRSVKKAaATRAAMKKKV 996
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 161077523 2154 KDEAEKSKEESR----RESVAEKSP---LPSKEASRPASVAESIKDEAEKS 2197
Cdd:PRK14900  997 AKKAPAKKAAAKkaaaKKAAAKKKVakkAPAKKVARKPAAKKAAKKPARKA 1047
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
3599-3986 1.93e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 44.65  E-value: 1.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3599 EAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKsplasKEASRPTSVAESVKDEAEKSK 3678
Cdd:PRK02224  269 ETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDR-----DEELRDRLEECRVAAQAHNEE 343
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3679 EESSRDSVAEKSPLAsKEASRPASVAESVQDEAEkSKEESRRESVAEKSplaskeaSRPASVAESVKD-DAEKSKEESRR 3757
Cdd:PRK02224  344 AESLREDADDLEERA-EELREEAAELESELEEAR-EAVEDRREEIEELE-------EEIEELRERFGDaPVDLGNAEDFL 414
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3758 ESVAE-KSPLASKEASRPASVaesvkDEAEKSKEESRRESVAEKSP---LPSKEASRPTSVAEsvkDEAEKSKEESRRES 3833
Cdd:PRK02224  415 EELREeRDELREREAELEATL-----RTARERVEEAEALLEAGKCPecgQPVEGSPHVETIEE---DRERVEELEAELED 486
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3834 VAEKSSLASKKASRPASVAESvkdeaekskeESRRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESR--RESVAEK 3911
Cdd:PRK02224  487 LEEEVEEVEERLERAEDLVEA----------EDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAelEAEAEEK 556
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 3912 SPLPSKEASRPTSVAESVKD-EADKSKEESRRES-GAEKSPLASMEASRPTSVAESVKDETEKSKEESRRESVTEKS 3986
Cdd:PRK02224  557 REAAAEAEEEAEEAREEVAElNSKLAELKERIESlERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKR 633
valS PRK14900
valyl-tRNA synthetase; Provisional
3667-3905 1.94e-03

valyl-tRNA synthetase; Provisional


Pssm-ID: 237855 [Multi-domain]  Cd Length: 1052  Bit Score: 44.98  E-value: 1.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3667 AESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEEsrresvaeKSPLASKEASRPASVAESVKD 3746
Cdd:PRK14900  842 AETARVDKEIGKVDQDLAVLERKLQNPSFVQNAPPAVVEKDRARAEELREK--------RGKLEAHRAMLSGSEANSARR 913
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3747 DAEKSKEESRResvAEKSPLASKEASRPASVAESVKDEaekskeesrresVAEKSPLPSKEASRPTSVAESVKDEAEKSK 3826
Cdd:PRK14900  914 DTMEIQNEQKP---TQDGPAAEAQPAQENTVVESAEKA------------VAAVSEAAQQAATAVASGIEKVAEAVRKTV 978
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3827 EESRRESVAEKSSLASKKASRPASVAESVKDEAEKSKeesrresVAEKSPlASKEASRPASVAESVKDEAEKS--KEESR 3904
Cdd:PRK14900  979 RRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKA-------AAKKKV-AKKAPAKKVARKPAAKKAAKKParKAAGR 1050

                  .
gi 161077523 3905 R 3905
Cdd:PRK14900 1051 K 1051
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2409-2436 2.20e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 38.74  E-value: 2.20e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2409 EVSRPESVAESVKDDPVK----SKEPSRRESV 2436
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3918-4078 2.24e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 44.20  E-value: 2.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3918 EASRPTSVAESVKDEADKSKEESRRESGAEK--SPLASMEASRPTSVAESVKDETEKSKEESRRESVteksplpSKEASR 3995
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESV-------VQVADR 355
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3996 PTSVAESVKDEAEKSKEESRRESVAEKSPLASKESSRPASVA--ESIKDEAEGTKQESRRESMPESGKAESIKGDQSSLA 4073
Cdd:PRK13108  356 DGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAApeEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVA 435

                  ....*
gi 161077523 4074 SKETS 4078
Cdd:PRK13108  436 GPGDD 440
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
3083-3365 2.48e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 44.65  E-value: 2.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3083 SVTAE--DEKSEQQSRRESVAESVKADTKKDGKSQEASRpssVDELLKDDDEKQESRRQsiTGSHKAMSTMGDESPMDKA 3160
Cdd:PTZ00108 1098 SLTKEkvEKLNAELEKKEKELEKLKNTTPKDMWLEDLDK---FEEALEEQEEVEEKEIA--KEQRLKSKTKGKASKLRKP 1172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3161 DKSKEPSRPESVAESIKHENTKDEESPLGSRRDSVAESIKSDITKGEKSPLPSKEVSRPESVVGSIKDEKAESRRESVAE 3240
Cdd:PTZ00108 1173 KLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSK 1252
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3241 SVKPESSKDATSAPPSKEHSRPESVLGSLKDEGDKTTSRRVSVADSIKDEKSLLVSQEASRPESEAESLKDAAAPSQETS 3320
Cdd:PTZ00108 1253 SSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTA 1332
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*.
gi 161077523 3321 RPESVTESVKDGK-SPVASKEASRPASVAENAKDSADESKEQRPES 3365
Cdd:PTZ00108 1333 RKKKSKTRVKQASaSQSSRLLRRPRKKKSDSSSEDDDDSEVDDSED 1378
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3881-3908 2.50e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 38.36  E-value: 2.50e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3881 EASRPASVAESVKDEAEK----SKEESRRESV 3908
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3770-3797 2.50e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 38.36  E-value: 2.50e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3770 EASRPASVAESVKDEAEK----SKEESRRESV 3797
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2031-2058 2.50e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 38.36  E-value: 2.50e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2031 EASRPASVAESVKDEAEK----SKEESRRESV 2058
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
3432-3829 2.61e-03

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 44.47  E-value: 2.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3432 PLMSKEASRPASVAGSVKDEAEKSKEesrrESVAEKSPLPSKEASRPASVA------ESVKDE-ADKSKEESRRESGAEK 3504
Cdd:PLN03237 1073 PFPKKAKSVEAAVAGATDDAAEEEEE----IDVSSSSGVRGSDYDYLLSMAigtltlEKVQELcADRDKLNIEVEDLKKT 1148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3505 SPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSVAEKSPLASk 3584
Cdd:PLN03237 1149 TPKSLWLKDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSS- 1227
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3585 eASRPASVAESVQDEAekskeesrRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPT 3664
Cdd:PLN03237 1228 -AMETENVAEVVKPKG--------RAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAA 1298
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3665 SVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKeasrpasVAESV 3744
Cdd:PLN03237 1299 ARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKL-------LTEML 1371
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3745 KDDAEKSKEESRRESVAEKSPLASKEASRPASVAesvKDEAEKSKEESRRESVAEKSPLPSKEASRPTS-----VAESVK 3819
Cdd:PLN03237 1372 KPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAA---TNKETESSENVSGSSSSEKDEIDVSAKPRPQRanrkqTTYVLS 1448
                         410
                  ....*....|
gi 161077523 3820 DEAEKSKEES 3829
Cdd:PLN03237 1449 DSESESADDS 1458
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
1489-1954 2.64e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 44.50  E-value: 2.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1489 VQAKQEaQKPVPAPEEAIKTEKSPLASKETSRPESATGSVKEDTEQTKSKKSPVPSRPE------SEAKDKKSPFASGEA 1562
Cdd:TIGR00600  297 IQGKTA-VKAVDSDDESLPSLSSQLDSNSEDLKSSPWEKLKPESESIVEAEPPSPRTLLakqaamSESSSEDSDESEWER 375
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1563 SRPESVAESVKDEAGKAESRRESIAKTHKDESSLD-KAKEQESRRESLAESIKPESGIDEK----SALASKEASRPESVT 1637
Cdd:TIGR00600  376 QELKRNNVAFVDDGSLSPRTLQAIGQALDDDEDKKvSASSDDQASPSKKTKMLLISRIEVEdddlDYLDQGEGIPLMAAL 455
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1638 DKSKEPSRRESIAeslkaeSTKDEKSAPPSKEASRPGSVVESVKDETEKSKEPSrrESIAESAKPPIEFREVSRPESvid 1717
Cdd:TIGR00600  456 QLSSVNSKPEAVA------STKIAREVTSSGHEAVPKAVQSLLLGATNDSPIPS--EFTILDRKSELSIERTVKPVS--- 524
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1718 gikDESAKPESRRDSPLASKEASRPESVLESVKDEPIKSTEKSRRESVAESFKADStkDEKSPLTSKDISRPESAVENVM 1797
Cdd:TIGR00600  525 ---SEFGLPSQREDKLAIPTEGTQNLQGISDHPEQFEFQNELSPLETKNNESNLSS--DAETEGSPNPEMPSWSSVTVPS 599
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1798 DAVGSAERSQPESVTASRDvsrpesVAESEKDDTDKPESVVESVIPASDVV---EIEKgAADKEKGVFVSLEIGKPDSPS 1874
Cdd:TIGR00600  600 EALDNYETTNPSNAKEVRN------FAETGIQTTNVGESADLLLISNPMEVepmESEK-EESESDGSFIEVDSVSSTLEL 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  1875 EVisrpgPVVESVKPESrRESSTEIVlpchaeDSKEPSRPESKVECLKDESEVLKGSTRRESVAESDKSSQPFKETSRPE 1954
Cdd:TIGR00600  673 QV-----PSKSQPTDES-EENAENKV------ASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKDADDFKNEWQDISLEE 740
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
2171-2490 3.09e-03

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 44.13  E-value: 3.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2171 EKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEkSPLP 2250
Cdd:NF033609  553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD-SDSA 631
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2251 SKEASRPASVAESIKDEAEKSKEESRRESVAEkSPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEK-SPLPSKEAS 2329
Cdd:NF033609  632 SDSDSASDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDS 710
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2330 RPASVAESIKDEAEKSKEESRRESAAEkSPLPSKEASRPASVAESVKDEADKSKEESRRESMAESGK-AQSIKGDQSPLK 2408
Cdd:NF033609  711 DSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSD 789
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2409 EVSRPESVAESVKDDPVKSKEPSRRESVAGSVT-ADSARDDQSPLESKGASRPESVVDSVKDEAEKQESRRESKTES--- 2484
Cdd:NF033609  790 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESgsn 869

                  ....*...
gi 161077523 2485 --VIPPKA 2490
Cdd:NF033609  870 nnVVPPNS 877
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2216-2243 3.17e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.17e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2216 EASRPASVAESIKDEAEK----SKEESRRESV 2243
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2253-2280 3.17e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.17e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2253 EASRPASVAESIKDEAEK----SKEESRRESV 2280
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2179-2206 3.17e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.17e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2179 EASRPASVAESIKDEAEK----SKEESRRESV 2206
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3511-3538 3.17e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.17e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3511 EASRPASVAESIKDEAEK----SKEESRRESV 3538
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2142-2169 3.17e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.17e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2142 EASRPASVAESIKDEAEK----SKEESRRESV 2169
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2105-2132 3.17e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.17e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2105 EASRPASVAESIKDEAEK----SKEESRRESV 2132
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2068-2095 3.17e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.17e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  2068 EASRPASVAESIKDEAEK----SKEESRRESV 2095
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3622-3649 3.17e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.17e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3622 EASRPASVAESIKDEAEK----SKEESRRESV 3649
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
3263-3800 3.20e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 44.26  E-value: 3.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3263 ESVLGSLKDEGDKTTSRRVSVADSIKDEKSLLVSQEASRpeSEAESLKDAAAPSQET-----SRPESVTESVKDGKSPVA 3337
Cdd:PRK02224  212 ESELAELDEEIERYEEQREQARETRDEADEVLEEHEERR--EELETLEAEIEDLRETiaeteREREELAEEVRDLRERLE 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3338 SKEASRPASVAENAKDSAD-ESKEQRPESLpqskagsikdeksplaskdeaekskeESRRESVAEQFPLVSKEVSRPASV 3416
Cdd:PRK02224  290 ELEEERDDLLAEAGLDDADaEAVEARREEL--------------------------EDRDEELRDRLEECRVAAQAHNEE 343
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3417 AESVKDEAEKSKEESPLMSKEASRPASVAGSVKDEAEKskeesRRESVAEKSplpskeaSRPASVAESVKD-EADKSKEE 3495
Cdd:PRK02224  344 AESLREDADDLEERAEELREEAAELESELEEAREAVED-----RREEIEELE-------EEIEELRERFGDaPVDLGNAE 411
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3496 SRRESGAE-KSPLASKEASRPASVaesikDEAEKSKEESRRESVAEKSP---LPSKEASRPTSVAEsvkDEAEKSKEESR 3571
Cdd:PRK02224  412 DFLEELREeRDELREREAELEATL-----RTARERVEEAEALLEAGKCPecgQPVEGSPHVETIEE---DRERVEELEAE 483
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3572 RDSVAEKSPLASKEASRPASVAESvqdEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRresvaE 3651
Cdd:PRK02224  484 LEDLEEEVEEVEERLERAEDLVEA---EDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAE-----E 555
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3652 KSPLASKEASRPTSVAESVKD-EAEKSKEESSRDSVAEKSPLASKEA---SRPASVAESVQDEAEksKEESRRESVAEKS 3727
Cdd:PRK02224  556 KREAAAEAEEEAEEAREEVAElNSKLAELKERIESLERIRTLLAAIAdaeDEIERLREKREALAE--LNDERRERLAEKR 633
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3728 plaskeaSRPASVAESVKDD------AEKSKEESRRESVAEKspLASKEASRPASVAE--SVKDEAEKSKE-ESRRESVA 3798
Cdd:PRK02224  634 -------ERKRELEAEFDEArieearEDKERAEEYLEQVEEK--LDELREERDDLQAEigAVENELEELEElRERREALE 704

                  ..
gi 161077523 3799 EK 3800
Cdd:PRK02224  705 NR 706
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2780-2945 3.41e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 43.81  E-value: 3.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2780 RSPVASTEISRPASAGETASSPIEEAPKDFAEFEQAEKAVLPLTIELKGNLptlSSPVDVAHGDFPQTSTPTSSPTVASV 2859
Cdd:PRK13108  298 REPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAES---VVQVADRDGESTPAVEETSEADIERE 374
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2860 QPAELSKvdiEKTASSPIDEAPKSLI-GCPAEERPESPAESAKDAAESVEKSKDASRP-PSVVESTKADSTKGDISPSPE 2937
Cdd:PRK13108  375 QPGDLAG---QAPAAHQVDAEAASAApEEPAALASEAHDETEPEVPEKAAPIPDPAKPdELAVAGPGDDPAEPDGIRRQD 451

                  ....*...
gi 161077523 2938 SVLEGPKD 2945
Cdd:PRK13108  452 DFSSRRRR 459
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3733-3760 3.46e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.97  E-value: 3.46e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3733 EASRPASVAESVKDDAEK----SKEESRRESV 3760
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
PHA00430 PHA00430
tail fiber protein
3693-3824 3.60e-03

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 43.73  E-value: 3.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3693 ASKEASRPASVAESVQDEAEKSKEESRRESVAEKSplASKEASRPASVAESVKDDAEKSKEESRResvaeksplASKEAS 3772
Cdd:PHA00430  164 ARNEANRSRNEADRARNQAERFNNESGASATNTKQ--WRSEADGSNSEANRFKGYADSMTSSVEA---------AKGQAE 232
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 3773 RPASVAESVKDEAEKSKEESRR----ESVAEKSPLPSK-EASRPTSVAESVKDEAEK 3824
Cdd:PHA00430  233 SSSKEANTAGDYATKAAASASAahasEVNAANSATAAAtSANRAKQQADRAKTEADK 289
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
2222-2429 3.94e-03

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 44.01  E-value: 3.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2222 SVAESIKDEAEKSKEESRRESVAEKSplpskEASRPASVAESIKDEAEKSKEESRRESVAEksplpSKEASRPASVAESI 2301
Cdd:PTZ00341  934 NVPEHLKEHAEANIEEDAEENVEEDA-----EENVEENVEENVEENVEENVEENVEENVEE-----NVEENVEENVEENI 1003
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2302 KDEAEKSKEETRRESVAEKSPLPSKEASRPASV-----AESIKDEAEKSKEESRRESAAEKSPLPSKEASRpaSVAESVK 2376
Cdd:PTZ00341 1004 EENVEENVEENIEENVEEYDEENVEEVEENVEEydeenVEEIEENAEENVEENIEENIEEYDEENVEEIEE--NIEENIE 1081
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 161077523 2377 DEADKSKEESRREsmAESGKAQSIKGDQSPLKEVSRPESVAESVKDDPVKSKE 2429
Cdd:PTZ00341 1082 ENVEENVEENVEE--IEENVEENVEENAEENAEENAEENAEEYDDENPEEHNE 1132
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
2040-2392 3.95e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 43.88  E-value: 3.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2040 ESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKsplpsKEASRPASVAESIKDE 2119
Cdd:PRK02224  264 RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDR-----DEELRDRLEECRVAAQ 338
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2120 AEKSKEESRRESVAEKsplpSKEASRPASVAESIKDEAEKSKE--ESRRESVAEkspLPSKEASRPASVAESikdEAEKS 2197
Cdd:PRK02224  339 AHNEEAESLREDADDL----EERAEELREEAAELESELEEAREavEDRREEIEE---LEEEIEELRERFGDA---PVDLG 408
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2198 KEESRRESVAE-KSPLPSKEASRPASVaesikDEAEKSKEESRRESVAEKSP---LPSKEASRPASVAEsikDEAEKSKE 2273
Cdd:PRK02224  409 NAEDFLEELREeRDELREREAELEATL-----RTARERVEEAEALLEAGKCPecgQPVEGSPHVETIEE---DRERVEEL 480
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2274 ESRRESVAEKSPLPSKEASRPASVAESikdEAEKSKEETRRESVAEKSPLPSKEASRPASVAESIKDEAEK--SKEESRR 2351
Cdd:PRK02224  481 EAELEDLEEEVEEVEERLERAEDLVEA---EDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAEleAEAEEKR 557
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 161077523 2352 ESAAEKsplpskeASRPASVAESVKD-EADKSKEESRRESMA 2392
Cdd:PRK02224  558 EAAAEA-------EEEAEEAREEVAElNSKLAELKERIESLE 592
DUF612 pfam04747
Protein of unknown function, DUF612; This family includes several uncharacterized proteins ...
3662-4056 4.27e-03

Protein of unknown function, DUF612; This family includes several uncharacterized proteins from Caenorhabditis elegans.


Pssm-ID: 282585 [Multi-domain]  Cd Length: 511  Bit Score: 43.51  E-value: 4.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3662 RPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESvQDEAEKSKEESRRESVAEKSPLASKEASRPASVA 3741
Cdd:pfam04747   63 QPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEA-EAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLK 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3742 esvKDDAEKSKEESRRESVAEKSPlASKEASRPASVAESVkdeAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDE 3821
Cdd:pfam04747  142 ---KLQAEKKKEKAVKAEKAEKAE-KTKKASTPAPVEEEI---VVKKVANDRSAAPAPEPKTPTNTPAEPAEQVQEITGK 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3822 AEKSKEESRRESVAEKSSLASKKASRPASVAESVKDEA---EKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEK 3898
Cdd:pfam04747  215 KNKKNKKKSESEATAAPASVEQVVEQPKVVTEEPHQQAapqEKKNKKNKRKSESENVPAASETPVEPVVETTPPASENQK 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3899 SKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAD-----KSKEESRRESGaeKSPLASMEASRPTSVAESVKDETEKS 3973
Cdd:pfam04747  295 KNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDfldfvTAKEEPKDEPA--ETPAAPVEEVVENVVENVVEKSTTPP 372
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3974 KEESRRESVTEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRESVAEKSPLASKESSRPASVAESIKDEAEGTKQESRR 4053
Cdd:pfam04747  373 ATENKKKNKKDKKKSESEKVTEQPVESAPAPPQVEQVVETTPPASENKKKNKKDKKKSESEKAVEEPVQAAPSSKKPTAD 452

                   ...
gi 161077523  4054 ESM 4056
Cdd:pfam04747  453 DNM 455
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2105-2238 4.30e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 43.43  E-value: 4.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2105 EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSK--EASRPASVAESIKDEAEKSKEESRRESVAE------KSPLP 2176
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEVAEVTDEVAAESVVQvadrdgESTPA 362
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 161077523 2177 SKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEES 2238
Cdd:PRK13108  363 VEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIP 424
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
2142-2275 4.30e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 43.43  E-value: 4.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2142 EASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSK--EASRPASVAESIKDEAEKSKEESRRESVAE------KSPLP 2213
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGpgEPNQPDDVAEAVKAEVAEVTDEVAAESVVQvadrdgESTPA 362
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 161077523 2214 SKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEES 2275
Cdd:PRK13108  363 VEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIP 424
Borrelia_P83 pfam05262
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
2267-2430 4.33e-03

Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.


Pssm-ID: 114011 [Multi-domain]  Cd Length: 489  Bit Score: 43.45  E-value: 4.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  2267 EAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEETRRESVAEK-SPLPSKEASRPAS--VAESIKDEAE 2343
Cdd:pfam05262  205 ERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKnLPKPADTSSPKEDkqVAENQKREIE 284
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  2344 KSKEESRResaAEKSPLPSKEASrpasvAESVKDEADKSKEESRRESMAESGKAQSIKGDQSPLKEVSRPESVAESvkDD 2423
Cdd:pfam05262  285 KAQIEIKK---NDEEALKAKDHK-----AFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLN--ED 354

                   ....*..
gi 161077523  2424 PVKSKEP 2430
Cdd:pfam05262  355 AIDSSNP 361
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
3688-3877 4.44e-03

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 43.26  E-value: 4.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3688 EKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKSPLA 3767
Cdd:PRK09510   70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKA 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3768 SKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRESVAEKSSLASKKASR 3847
Cdd:PRK09510  150 EAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAK 229
                         170       180       190
                  ....*....|....*....|....*....|
gi 161077523 3848 PASVAESVKDEAEKSKEESRRESVAEKSPL 3877
Cdd:PRK09510  230 AAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
Borrelia_P83 pfam05262
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
3710-3876 4.64e-03

Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.


Pssm-ID: 114011 [Multi-domain]  Cd Length: 489  Bit Score: 43.45  E-value: 4.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3710 EAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDDAEKSKEESRRESVAEKS---PLASKEASRPASVAESVKDEAE 3786
Cdd:pfam05262  205 ERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNlpkPADTSSPKEDKQVAENQKREIE 284
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523  3787 KSKEESRResvAEKSPLPSKEASrptsvAESVKDEaekSKEESRResvAEKSSLASKKASRP-ASVAESVKDEAEKSKEE 3865
Cdd:pfam05262  285 KAQIEIKK---NDEEALKAKDHK-----AFDLKQE---SKASEKE---AEDKELEAQKKREPvAEDLQKTKPQVEAQPTS 350
                          170
                   ....*....|.
gi 161077523  3866 SRRESVAEKSP 3876
Cdd:pfam05262  351 LNEDAIDSSNP 361
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3955-3982 4.84e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.58  E-value: 4.84e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3955 EASRPTSVAESVKDETEK----SKEESRRESV 3982
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3807-3834 4.89e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.58  E-value: 4.89e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3807 EASRPTSVAESVKDEAEK----SKEESRRESV 3834
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3992-4019 4.89e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.58  E-value: 4.89e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3992 EASRPTSVAESVKDEAEK----SKEESRRESV 4019
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
3515-3729 5.01e-03

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 42.87  E-value: 5.01e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3515 PASVAESIKDEAEKSKEESRRESVAEKsplpsKEASRPTSVAESVKDEAEKSKEEsrrdsvaEKSPLASKEASRPASVAE 3594
Cdd:PRK09510   57 PGAVVEQYNRQQQQQKSAKRAEEQRKK-----KEQQQAEELQQKQAAEQERLKQL-------EKERLAAQEQKKQAEEAA 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3595 SVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAESVKDEA 3674
Cdd:PRK09510  125 KQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAE 204
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 161077523 3675 EKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPL 3729
Cdd:PRK09510  205 AEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3845-4019 5.06e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 43.04  E-value: 5.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3845 ASRPASVAESVKDEAEKSKEESRRESVAEK--SPLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRP 3922
Cdd:PRK13108  284 ALRGSEYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAV 363
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3923 TSvaesvKDEADKSKEESrRESGAEKSPLASMEASRPTSVAESVKDETEKSKEESRRESVTEKSPLPskEASRPT-SVAE 4001
Cdd:PRK13108  364 EE-----TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIP--DPAKPDeLAVA 435
                         170
                  ....*....|....*...
gi 161077523 4002 SVKDEAEKSKEESRRESV 4019
Cdd:PRK13108  436 GPGDDPAEPDGIRRQDDF 453
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
3745-4022 5.33e-03

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 43.05  E-value: 5.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3745 KDDAEKSKEESRRESVAEKSPLASKEASRPASVAESVKDEAEKSKEESRRESVAE-KSPLPSKEASRPTSVAESVKDEAE 3823
Cdd:PRK07735   12 KEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAaKAKAAALAKQKREGTEEVTEEEKA 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3824 KSKEESRRESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRRESVAEKSPlaSKEASRPASVAESVKDEAEKSKEES 3903
Cdd:PRK07735   92 KAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAK--QKREGTEEVTEEEEETDKEKAKAKA 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3904 RRESVAEKSPLPSKEASRPTSVAESVKDEAD-KSKEESRRESGAEKSPLASMEASRPTSVAEsvkDETEKSKEESRRESV 3982
Cdd:PRK07735  170 AAAAKAKAAALAKQKAAEAGEGTEEVTEEEKaKAKAKAAAAAKAKAAALAKQKASQGNGDSG---DEDAKAKAIAAAKAK 246
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 161077523 3983 TEKSplpskeASRPTSVAESVKDEAEKSKEESRRESVAEK 4022
Cdd:PRK07735  247 AAAA------ARAKTKGAEGKKEEEPKQEEPSVNQPYLNK 280
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
1979-2275 5.65e-03

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 43.31  E-value: 5.65e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1979 AAQSRETSRPASVAESAKDGADDLKELSRPESTTQSKEAGSIKDEKSPLASEEASRPASVAESVKdeaEKSKEESRRESV 2058
Cdd:PLN03237 1175 AEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVK---PKGRAGAKKKAP 1251
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2059 AEKSPLPskEASRPASVAESIKDEAEKSKEEsrrESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPL 2138
Cdd:PLN03237 1252 AAAKEKE--EEDEILDLKDRLAAYNLDSAPA---QSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSL 1326
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2139 PSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPlPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEAS 2218
Cdd:PLN03237 1327 AERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQS-GQKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAA 1405
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 2219 RPASVAESikDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESikDEAEKSKEES 2275
Cdd:PLN03237 1406 TNKETESS--ENVSGSSSSEKDEIDVSAKPRPQRANRKQTTYVLS--DSESESADDS 1458
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
3453-3721 5.91e-03

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 42.66  E-value: 5.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3453 EKSKEESRRESVAEKSPLPSKEASRPASVAESVKDEADKSKEESRRESGAEKSPLASKEAS--RPASVAESIKDEAEKSK 3530
Cdd:PRK07735    4 EKDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAaaAKAKAAALAKQKREGTE 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3531 EESRRESVAEKSPLPSKEASRPTSVAESVKDEAEKSKEESRRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRE 3610
Cdd:PRK07735   84 EVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKE 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3611 SVAEKSPLASKeASRPASVAESIKDEAEKSKEESRRESVAEKSPLASKEASRPTSVAesvKDEAEKSKEESSRDSVAEKS 3690
Cdd:PRK07735  164 KAKAKAAAAAK-AKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALA---KQKASQGNGDSGDEDAKAKA 239
                         250       260       270
                  ....*....|....*....|....*....|.
gi 161077523 3691 PLASKEASRPASVAESVQDEAEKSKEESRRE 3721
Cdd:PRK07735  240 IAAAKAKAAAAARAKTKGAEGKKEEEPKQEE 270
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
2063-2405 6.30e-03

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 43.31  E-value: 6.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2063 PLPSKEASRPASVAESIKDEAEKSKEesrrESVAEKSPLPSKEASRPASVA------ESIKDE-AEKSKEESRRESVAEK 2135
Cdd:PLN03237 1073 PFPKKAKSVEAAVAGATDDAAEEEEE----IDVSSSSGVRGSDYDYLLSMAigtltlEKVQELcADRDKLNIEVEDLKKT 1148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2136 SP----LPSKEASRPASVAESIKD-EAEKSKEESRRESVAEKSPLPSKeASRPASVAESIKDEAEKSKEESRRESVAEKS 2210
Cdd:PLN03237 1149 TPkslwLKDLDALEKELDKLDKEDaKAEEAREKLQRAAARGESGAAKK-VSRQAPKKPAPKKTTKKASESETTEETYGSS 1227
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2211 PLPSKEASRP------ASVAESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKS 2284
Cdd:PLN03237 1228 AMETENVAEVvkpkgrAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLASV 1307
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2285 PLPSKEASRPASVAESIKDEAEKSKEETRRESVAEKSPLPSKEASR---PASVAESIKDEAEKSKEESRRESAAEKSplP 2361
Cdd:PLN03237 1308 SVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKkrgPATVQSGQKLLTEMLKPAEAIGISPEKK--V 1385
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 161077523 2362 SKEASRPASV-AESVKDEADKSKEESRRESMAESGKAQSIKGDQS 2405
Cdd:PLN03237 1386 RKMRASPFNKkSGSVLGRAATNKETESSENVSGSSSSEKDEIDVS 1430
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3437-3464 6.37e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 37.20  E-value: 6.37e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 161077523  3437 EASRPASVAGSVKDEAEK----SKEESRRESV 3464
Cdd:pfam06740    1 EASRPESVAESVKDEAEKpeskSKEPSRRESV 32
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
3733-3908 6.48e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 42.66  E-value: 6.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3733 EASRPASVAESVKDDAEKSKEESRRESVAEK--SPLASKEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASR 3810
Cdd:PRK13108  283 GALRGSEYVVDEALEREPAELAAAAVASAASavGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPA 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3811 PTSvaesvKDEAEKSKEESrRESVAEKSSLASKKASRPASVAESVKDEAEKSKEESRREsVAEKSPLASKEASRPASVAE 3890
Cdd:PRK13108  363 VEE-----TSEADIEREQP-GDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPE-VPEKAAPIPDPAKPDELAVA 435
                         170
                  ....*....|....*...
gi 161077523 3891 SVKDEAEKSKEESRRESV 3908
Cdd:PRK13108  436 GPGDDPAEPDGIRRQDDF 453
PHA00430 PHA00430
tail fiber protein
3473-3607 6.75e-03

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 42.96  E-value: 6.75e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3473 KEASRPASVAESVKDEADKSKEESRRESGAEKSplASKEASRPASVAESIKDEAEKSKEESRREsvaeksplpSKEASRP 3552
Cdd:PHA00430  166 NEANRSRNEADRARNQAERFNNESGASATNTKQ--WRSEADGSNSEANRFKGYADSMTSSVEAA---------KGQAESS 234
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 161077523 3553 TSVAESVKDEAEKSKEESRRDSVAEKSplASKEASRPASVAESVQDEAEKSKEES 3607
Cdd:PHA00430  235 SKEANTAGDYATKAAASASAAHASEVN--AANSATAAATSANRAKQQADRAKTEA 287
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1408-1744 7.40e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 7.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1408 EQVKDKEEHEQKIESGIITEKEAKKSASTPEEKETSDIT---SDDELPAQLADPTTVPPKSAKDREDTGSIESPPTIEEA 1484
Cdd:PHA03307   59 AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPaspAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML 138
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1485 IEVEVQAKQEAQKPVPAPEEAIKTEKSPLASKETSRPESATGSVkedteqtkskkSPVPSRPESEAKDKKSPFASGEASR 1564
Cdd:PHA03307  139 RPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEET-----------ARAPSSPPAEPPPSTPPAAASPRPP 207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1565 PESVAESVKDEAGKAESRRESIAKTHKDESSLDKAKEQESRRESLAESIKP-ESGIDEKSALASKEASRPESVTDKSKEP 1643
Cdd:PHA03307  208 RRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPrPAPITLPTRIWEASGWNGPSSRPGPASS 287
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1644 SRRESIAESLKAESTKDEKSAPPSKEASRPGSVVESVKDETEKSKEPSRRESIAESAKPPIEFREVSRPESVIDGikdes 1723
Cdd:PHA03307  288 SSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP----- 362
                         330       340
                  ....*....|....*....|.
gi 161077523 1724 aKPESRRDSPLASKEASRPES 1744
Cdd:PHA03307  363 -SSPRKRPRPSRAPSSPAASA 382
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
3577-3761 7.71e-03

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 42.49  E-value: 7.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3577 EKSPLASKEASRPASVAESVQDEAEKSKEESRRESVAEKSPLASKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLA 3656
Cdd:PRK09510   70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKA 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 3657 SKEASRPTSVAESVKDEAEK-SKEESSRDSVAEKSPLASKEASRPASVAESVQDEAE---KSKEESRRESVAEKSPLASK 3732
Cdd:PRK09510  150 EAEAKRAAAAAKKAAAEAKKkAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEakkKAAAEAKKKAAAEAKAAAAK 229
                         170       180
                  ....*....|....*....|....*....
gi 161077523 3733 EASRPASVAESVKDDAEKSKEESRRESVA 3761
Cdd:PRK09510  230 AAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
2549-2584 8.15e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 36.81  E-value: 8.15e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 161077523  2549 EASRPGSVAESIKYDLDKPQiikdDKSTEHSRRESL 2584
Cdd:pfam06740    1 EASRPESVAESVKDEAEKPE----SKSKEPSRRESV 32
valS PRK14900
valyl-tRNA synthetase; Provisional
1965-2160 8.48e-03

valyl-tRNA synthetase; Provisional


Pssm-ID: 237855 [Multi-domain]  Cd Length: 1052  Bit Score: 42.67  E-value: 8.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1965 MSKEPSRREsvKDGAAQSRETSRPASVAESAKDGADdlKELSRPESTTQSKeaGSIKDEKSPLASEEasrPASVAESVKD 2044
Cdd:PRK14900  847 VDKEIGKVD--QDLAVLERKLQNPSFVQNAPPAVVE--KDRARAEELREKR--GKLEAHRAMLSGSE---ANSARRDTME 917
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2045 EAEKSKEesrresvAEKSPLPSKEASRPASVAESIKDEAEKSKE-------------ESRRESVAEKSPLPSKEA-SRPA 2110
Cdd:PRK14900  918 IQNEQKP-------TQDGPAAEAQPAQENTVVESAEKAVAAVSEaaqqaatavasgiEKVAEAVRKTVRRSVKKAaATRA 990
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 161077523 2111 SVAESIKDEAEKSKEESR----RESVAEKSP---LPSKEASRPASVAESIKDEAEKS 2160
Cdd:PRK14900  991 AMKKKVAKKAPAKKAAAKkaaaKKAAAKKKVakkAPAKKVARKPAAKKAAKKPARKA 1047
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
4076-4111 8.56e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 36.81  E-value: 8.56e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 161077523  4076 ETSRPDSVVESVKDETEKPEGsaidKSQVASRPESV 4111
Cdd:pfam06740    1 EASRPESVAESVKDEAEKPES----KSKEPSRRESV 32
DUF1213 pfam06740
Protein of unknown function (DUF1213); This family represents a short conserved repeat within ...
3845-3871 9.63e-03

Protein of unknown function (DUF1213); This family represents a short conserved repeat within Drosophila melanogaster proteins of unknown function. Approximately 50 copies of this repeat are present in each protein.


Pssm-ID: 429090 [Multi-domain]  Cd Length: 32  Bit Score: 36.81  E-value: 9.63e-03
                           10        20        30
                   ....*....|....*....|....*....|.
gi 161077523  3845 ASRPASVAESVKDEAEK----SKEESRRESV 3871
Cdd:pfam06740    2 ASRPESVAESVKDEAEKpeskSKEPSRRESV 32
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
1776-2153 9.91e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 42.40  E-value: 9.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1776 DEKSPLTSKDISRPESAVENVMDAVGSAER----SQPESVTASRDVSRPESVAESEKDDTDKPESVVESVIPASDVVEIE 1851
Cdd:PRK14949  369 DDPAEISLPEGQTPSALAAAVQAPHANEPQfvnaAPAEKKTALTEQTTAQQQVQAANAEAVAEADASAEPADTVEQALDD 448
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1852 KGAADK----EKGVFVS--------LEIGKPDSPSEVISRPGPvvESVKPESRRESSTEIV--LPCHAEDSKEPSRPESK 1917
Cdd:PRK14949  449 ESELLAalnaEQAVILSqaqsqgfeASSSLDADNSAVPEQIDS--TAEQSVVNPSVTDTQVddTSASNNSAADNTVDDNY 526
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1918 VECLKDESEVLKGSTRRESVAESDKSSQPFKETSRPE--SAVGSMKDESMSKEPSRRESVKDGAAQSRETSRPASVAESA 1995
Cdd:PRK14949  527 SAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESynALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTAAASL 606
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 1996 KD------------------GADDLKELSRPESTTQSKE---------AGSIKDEKSPLASEEASRPASVAESVKDEAEK 2048
Cdd:PRK14949  607 ADddildavlaardsllsdlDALSPKEGDGKKSSADRKPktppsrappASLSKPASSPDASQTSASFDLDPDFELATHQS 686
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 161077523 2049 SKEESRRESVAEKSPLPSKEASRP----ASVAESIKDEAEKSKEESRRESVaEKSPLPSKEASRPASVAESiKDEAEKSK 2124
Cdd:PRK14949  687 VPEAALASGSAPAPPPVPDPYDRPpweeAPEVASANDGPNNAAEGNLSESV-EDASNSELQAVEQQATHQP-QVQAEAQS 764
                         410       420
                  ....*....|....*....|....*....
gi 161077523 2125 EESRRESVAEKSPLPSKEASRPASVAESI 2153
Cdd:PRK14949  765 PASTTALTQTSSEVQDTELNLVLLSSGSI 793
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH