NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|332205947|ref|NP_001193769|]
View 

trans-Golgi network integral membrane protein 2 isoform 2 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MSCRAMM_ClfA super family cl41352
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
44-383 7.13e-16

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


The actual alignment was detected with superfamily member NF033609:

Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 80.72  E-value: 7.13e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  44 PSLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQK--DSSNKSGAEAKTQKGSTSKSGSEAQTTKD 121
Cdd:NF033609 542 PVVPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSgsDSASDSDSASDSDSASDSDSASDSDSASD 621
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 122 STSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQ 201
Cdd:NF033609 622 SDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 202 TPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTN 281
Cdd:NF033609 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 781
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 282 QLADKGKLSPHAFKTESGEETDLISPPQEEVKS---SEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGT 358
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSN 861
                        330       340
                 ....*....|....*....|....*
gi 332205947 359 LSDSTGSEKDDLYPNGSGNGSAESS 383
Cdd:NF033609 862 SDSESGSNNNVVPPNSPKNGTNASN 886
 
Name Accession Description Interval E-value
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
44-383 7.13e-16

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 80.72  E-value: 7.13e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  44 PSLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQK--DSSNKSGAEAKTQKGSTSKSGSEAQTTKD 121
Cdd:NF033609 542 PVVPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSgsDSASDSDSASDSDSASDSDSASDSDSASD 621
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 122 STSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQ 201
Cdd:NF033609 622 SDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 202 TPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTN 281
Cdd:NF033609 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 781
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 282 QLADKGKLSPHAFKTESGEETDLISPPQEEVKS---SEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGT 358
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSN 861
                        330       340
                 ....*....|....*....|....*
gi 332205947 359 LSDSTGSEKDDLYPNGSGNGSAESS 383
Cdd:NF033609 862 SDSESGSNNNVVPPNSPKNGTNASN 886
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
52-384 8.43e-12

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 67.63  E-value: 8.43e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  52 GSTKSHPEPQTPkDSPSKSSAEAQTPEDTPNKSGAEAKTqkDSSNkSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQ 131
Cdd:NF033609 534 GSGDGIDKPVVP-EQPDEPGEIEPIPEDSDSDPGSDSGS--DSSN-SDSGSDSGSDSTSDSGSDSASDSDSASDSDSASD 609
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 132 TPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSG 211
Cdd:NF033609 610 SDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 689
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 212 AEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQLADKGKLSP 291
Cdd:NF033609 690 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 769
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 292 HAFKTESGEETDLISPPQEEvKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGTLSDSTGSEKDDLY 371
Cdd:NF033609 770 SDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 848
                        330
                 ....*....|...
gi 332205947 372 PNGSGNGSAESSH 384
Cdd:NF033609 849 DSDSDSDSESDSN 861
KCT2 pfam17818
Keratinocyte-associated gene product; This entry includes Keratinocyte-associated ...
306-435 4.52e-08

Keratinocyte-associated gene product; This entry includes Keratinocyte-associated transmembrane protein 2 found in humans. Functional studies show that KCP2 localizes to the endoplasmic reticulum, consistent with a role in protein biosynthesis, and has a functional KKxx retrieval signal at its cytosolic C-terminus.


Pssm-ID: 407686 [Multi-domain]  Cd Length: 187  Bit Score: 53.01  E-value: 4.52e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  306 SPPQEEVKSSEptedvEPKEAEDDDTGPEEGSPPKEE------------------KEKMSGSASSENRE-GTLSDSTGSE 366
Cdd:pfam17818  33 SVPPEEADNNE-----DPSIEEEDLLTLNSSPPTAKDtldngdygepdydwttspRDEESDEILEENRGyKEIEQSVKSF 107
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332205947  367 KddlypNGSGNGSAESSHFFAYLVTAAILVAVLYIAHHNKRKIIAFVlEGKRSKVTRRPKASDYQRLDQ 435
Cdd:pfam17818 108 K-----SPPSNVEEEDSHFFFHLIIFAFCVAVVYVTYHNKRKIFLLV-QSRKWRDGLCSKTVEYHRLDQ 170
PTZ00121 PTZ00121
MAEBL; Provisional
55-330 6.30e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 55.53  E-value: 6.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947   55 KSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKtQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPK 134
Cdd:PTZ00121 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK-KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  135 DSTGKSGAEAQTPEDSPNRSG------AEAKTQKDSPSKSGSEAQTTKDVPNKSGadgQTPKDGSSKSGAEDQTPKDVPN 208
Cdd:PTZ00121 1387 AEEKKKADEAKKKAEEDKKKAdelkkaAAAKKKADEAKKKAEEKKKADEAKKKAE---EAKKADEAKKKAEEAKKAEEAK 1463
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  209 KSGAEKQTPKDGSNKsgAEEQGPIDgPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQLADKGK 288
Cdd:PTZ00121 1464 KKAEEAKKADEAKKK--AEEAKKAD-EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 332205947  289 LSPHAFKTESGEETdlisppqEEVKSSEPTEDVEPKEAEDDD 330
Cdd:PTZ00121 1541 KAEEKKKADELKKA-------EELKKAEEKKKAEEAKKAEED 1575
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
53-210 2.48e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.83  E-value: 2.48e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  53 STKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQT 132
Cdd:NF033609 751 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 830
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332205947 133 PKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGS-SKSGAEDQTPKDVPNKS 210
Cdd:NF033609 831 DSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKdSKEPLPDTGSEDEANTS 909
 
Name Accession Description Interval E-value
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
44-383 7.13e-16

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 80.72  E-value: 7.13e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  44 PSLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQK--DSSNKSGAEAKTQKGSTSKSGSEAQTTKD 121
Cdd:NF033609 542 PVVPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSgsDSASDSDSASDSDSASDSDSASDSDSASD 621
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 122 STSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQ 201
Cdd:NF033609 622 SDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 202 TPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTN 281
Cdd:NF033609 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 781
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 282 QLADKGKLSPHAFKTESGEETDLISPPQEEVKS---SEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGT 358
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSN 861
                        330       340
                 ....*....|....*....|....*
gi 332205947 359 LSDSTGSEKDDLYPNGSGNGSAESS 383
Cdd:NF033609 862 SDSESGSNNNVVPPNSPKNGTNASN 886
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
52-384 8.43e-12

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 67.63  E-value: 8.43e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  52 GSTKSHPEPQTPkDSPSKSSAEAQTPEDTPNKSGAEAKTqkDSSNkSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQ 131
Cdd:NF033609 534 GSGDGIDKPVVP-EQPDEPGEIEPIPEDSDSDPGSDSGS--DSSN-SDSGSDSGSDSTSDSGSDSASDSDSASDSDSASD 609
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 132 TPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSG 211
Cdd:NF033609 610 SDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 689
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 212 AEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQLADKGKLSP 291
Cdd:NF033609 690 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 769
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 292 HAFKTESGEETDLISPPQEEvKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGTLSDSTGSEKDDLY 371
Cdd:NF033609 770 SDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 848
                        330
                 ....*....|...
gi 332205947 372 PNGSGNGSAESSH 384
Cdd:NF033609 849 DSDSDSDSESDSN 861
KCT2 pfam17818
Keratinocyte-associated gene product; This entry includes Keratinocyte-associated ...
306-435 4.52e-08

Keratinocyte-associated gene product; This entry includes Keratinocyte-associated transmembrane protein 2 found in humans. Functional studies show that KCP2 localizes to the endoplasmic reticulum, consistent with a role in protein biosynthesis, and has a functional KKxx retrieval signal at its cytosolic C-terminus.


Pssm-ID: 407686 [Multi-domain]  Cd Length: 187  Bit Score: 53.01  E-value: 4.52e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  306 SPPQEEVKSSEptedvEPKEAEDDDTGPEEGSPPKEE------------------KEKMSGSASSENRE-GTLSDSTGSE 366
Cdd:pfam17818  33 SVPPEEADNNE-----DPSIEEEDLLTLNSSPPTAKDtldngdygepdydwttspRDEESDEILEENRGyKEIEQSVKSF 107
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332205947  367 KddlypNGSGNGSAESSHFFAYLVTAAILVAVLYIAHHNKRKIIAFVlEGKRSKVTRRPKASDYQRLDQ 435
Cdd:pfam17818 108 K-----SPPSNVEEEDSHFFFHLIIFAFCVAVVYVTYHNKRKIFLLV-QSRKWRDGLCSKTVEYHRLDQ 170
PTZ00121 PTZ00121
MAEBL; Provisional
55-330 6.30e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 55.53  E-value: 6.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947   55 KSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKtQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPK 134
Cdd:PTZ00121 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK-KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  135 DSTGKSGAEAQTPEDSPNRSG------AEAKTQKDSPSKSGSEAQTTKDVPNKSGadgQTPKDGSSKSGAEDQTPKDVPN 208
Cdd:PTZ00121 1387 AEEKKKADEAKKKAEEDKKKAdelkkaAAAKKKADEAKKKAEEKKKADEAKKKAE---EAKKADEAKKKAEEAKKAEEAK 1463
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  209 KSGAEKQTPKDGSNKsgAEEQGPIDgPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQLADKGK 288
Cdd:PTZ00121 1464 KKAEEAKKADEAKKK--AEEAKKAD-EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 332205947  289 LSPHAFKTESGEETdlisppqEEVKSSEPTEDVEPKEAEDDD 330
Cdd:PTZ00121 1541 KAEEKKKADELKKA-------EELKKAEEKKKAEEAKKAEED 1575
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
64-369 2.30e-06

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 50.25  E-value: 2.30e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947   64 KDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKS--- 140
Cdd:PLN03237 1171 EDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKkka 1250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  141 -GAEAQTPEDSPNRSGAEAKTQ---KDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQT 216
Cdd:PLN03237 1251 pAAAKEKEEEDEILDLKDRLAAynlDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERL 1330
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  217 PKDGSNKSGAEEQGPIDGPSKsgaeeqTSKDSPNKVVPEQPSRKDHSKPI--SNPSDNKELPKAdtnqladkgKLSPhaF 294
Cdd:PLN03237 1331 KKKGGRKPAAANKKAAKPPAA------AKKRGPATVQSGQKLLTEMLKPAeaIGISPEKKVRKM---------RASP--F 1393
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 332205947  295 KTESGEETDLISPPqeevKSSEPTEDVEPKEAEDDDtgpEEGSPPKEEkekmsgsASSENREGT---LSDSTGSEKDD 369
Cdd:PLN03237 1394 NKKSGSVLGRAATN----KETESSENVSGSSSSEKD---EIDVSAKPR-------PQRANRKQTtyvLSDSESESADD 1457
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
13-365 1.95e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.47  E-value: 1.95e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947   13 AAAGAVPLLATESVKQEEAGVRPSAGN-----VSTHPSLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAE 87
Cdd:PHA03307   13 AAAEGGEFFPRPPATPGDAADDLLSGSqgqlvSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSL 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947   88 AKTQKDSSNKSGAEakTQKGSTSKSGSEAQTTKDSTSKSH-PELQTPKDSTGKSGAEAQT-PEDSPNRSGAEAKTQKDS- 164
Cdd:PHA03307   93 STLAPASPAREGSP--TPPGPSSPDPPPPTPPPASPPPSPaPDLSEMLRPVGSPGPPPAAsPPAAGASPAAVASDAASSr 170
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  165 ----PSKSGSEAQTTKDVPNKS---------GADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGSNKSGAE-EQG 230
Cdd:PHA03307  171 qaalPLSSPEETARAPSSPPAEpppstppaaASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGcGWG 250
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  231 PIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQlADKGKLSPHAFKTESGEetdlispPQE 310
Cdd:PHA03307  251 PENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSS-PGSGPAPSSPRASSSSS-------SSR 322
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 332205947  311 EVKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGTLSDSTGS 365
Cdd:PHA03307  323 ESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
53-210 2.48e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.83  E-value: 2.48e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  53 STKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQT 132
Cdd:NF033609 751 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 830
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332205947 133 PKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGS-SKSGAEDQTPKDVPNKS 210
Cdd:NF033609 831 DSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKdSKEPLPDTGSEDEANTS 909
PHA00430 PHA00430
tail fiber protein
46-177 4.23e-05

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 46.04  E-value: 4.23e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  46 LSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSK 125
Cdd:PHA00430 154 IKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAES 233
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 332205947 126 SHPELQTPKDSTGKSGAE---AQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKD 177
Cdd:PHA00430 234 SSKEANTAGDYATKAAASasaAHASEVNAANSATAAATSANRAKQQADRAKTEAD 288
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
62-377 5.78e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 45.68  E-value: 5.78e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947   62 TPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSH-PELQTPKDSTGKS 140
Cdd:pfam05109 358 TETDFKCKWTLTSGTPSGCENISGAFASNRTFDITVSGLGTAPKTLIITRTATNATTTTHKVIFSKaPESTTTSPTLNTT 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  141 G-AEAQTPEDSPnrSGAEAKTQKDSPSKSGseaqttkdvPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPKD 219
Cdd:pfam05109 438 GfAAPNTTTGLP--SSTHVPTNLTAPASTG---------PTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDM 506
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  220 GSNKSGAEEQGP--------IDGPSKSGAEEQTSKDSPNKVVPE-QPSRKDHSKPISNPSDNKELPK-ADTNQLADKGKL 289
Cdd:pfam05109 507 TSPTSAVTTPTPnatsptpaVTTPTPNATSPTLGKTSPTSAVTTpTPNATSPTPAVTTPTPNATIPTlGKTSPTSAVTTP 586
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  290 SPHAFKTESGEETDLISPPQEEV--KSSEPTEDVEPKEAEDD-DTGPEEGSPPKEEKEKMSGSASSENREGTLSDSTGSE 366
Cdd:pfam05109 587 TPNATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNATSAvTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSH 666
                         330
                  ....*....|....
gi 332205947  367 KDDL---YPNGSGN 377
Cdd:pfam05109 667 MPLLtsaHPTGGEN 680
PHA00430 PHA00430
tail fiber protein
61-197 1.74e-04

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 44.11  E-value: 1.74e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  61 QTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKS 140
Cdd:PHA00430 155 KTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESS 234
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 332205947 141 GAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSG 197
Cdd:PHA00430 235 SKEANTAGDYATKAAASASAAHASEVNAANSATAAATSANRAKQQADRAKTEADKLG 291
PRK08581 PRK08581
amidase domain-containing protein;
64-305 2.08e-04

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 44.01  E-value: 2.08e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  64 KDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSgaEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAE 143
Cdd:PRK08581  58 KDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNII--DFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD 135
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 144 AQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKD-VPNKSGADGQTPKDGSSKSGAEDQTPKDvPNKSGAEKQTPKDGSN 222
Cdd:PRK08581 136 YEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDkADNQKAPSSNNTKPSTSNKQPNSPKPTQ-PNQSNSQPASDDTANQ 214
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 223 KSGAEEQGPIDGPSKSGAEEQTSKDSP-NKVVPEQPSRKDHSKpiSNPSDNKELPKADTNQLADKGKLS-PHAFKTESGE 300
Cdd:PRK08581 215 KSSSKDNQSMSDSALDSILDQYSEDAKkTQKDYASQSKKDKTE--TSNTKNPQLPTQDELKHKSKPAQSfENDVNQSNTR 292

                 ....*
gi 332205947 301 ETDLI 305
Cdd:PRK08581 293 STSLF 297
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
50-358 5.84e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.37  E-value: 5.84e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  50 PGGSTKSHPEPQTPKDSPSKSSAEAQTPE-DTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSE-AQTTKDSTSKSH 127
Cdd:PTZ00449 511 PEGPEASGLPPKAPGDKEGEEGEHEDSKEsDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKkPEFPKDPKHPKD 590
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 128 PElqTPKDStgKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSS-KSGAEDQTPK-- 204
Cdd:PTZ00449 591 PE--EPKKP--KRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIiKSPKPPKSPKpp 666
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 205 -----------DVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGaeeqTSKDSPNKVVPEQPsrKDHSKPISNPSDnk 273
Cdd:PTZ00449 667 fdpkfkekfydDYLDAAAKSKETKTTVVLDESFESILKETLPETPG----TPFTTPRPLPPKLP--RDEEFPFEPIGD-- 738
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 274 elpkadtnqlADKGKLSPHAFKTESGEETDLIsppqEEVKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSE 353
Cdd:PTZ00449 739 ----------PDAEQPDDIEFFTPPEEERTFF----HETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHED 804

                 ....*
gi 332205947 354 NREGT 358
Cdd:PTZ00449 805 KPPGD 809
PHA03169 PHA03169
hypothetical protein; Provisional
134-347 6.09e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 42.27  E-value: 6.09e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 134 KDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQT-TKDVPNKSGADGQTPKDgsSKSGAEDQTPKDVPNKSGA 212
Cdd:PHA03169  20 RGHCKRHGGTREQAGRRRGTAARAAKPAPPAPTTSGPQVRAvAEQGHRQTESDTETAEE--SRHGEKEERGQGGPSGSGS 97
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 213 EKQTPKDGSNKSGAEEQGpidgpskSGAEEQTSKDSPnkvvPEQPSrkDHSKPISNPSDNKELPKADTNQLADKGKLSPH 292
Cdd:PHA03169  98 ESVGSPTPSPSGSAEELA-------SGLSPENTSGSS----PESPA--SHSPPPSPPSHPGPHEPAPPESHNPSPNQQPS 164
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 332205947 293 AFKTESGEETDlisPPQEEVKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMS 347
Cdd:PHA03169 165 SFLQPSHEDSP---EEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQS 216
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
107-336 9.69e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 9.69e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 107 GSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADG 186
Cdd:PRK07764 589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDG 668
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 187 QTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPkdgsnksgAEEQGPIDGPSKSGAEEQTSKDspnkvvpeQPSRKDHSKPI 266
Cdd:PRK07764 669 WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPA--------QPAPAPAATPPAGQADDPAAQP--------PQAAQGASAPS 732
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 267 SNPSDNKELPKADTNQLADKGKLSPHAFKTESGEETDlISPPQEEVKSSEPTEDVEPKEAEDDDTGPEEG 336
Cdd:PRK07764 733 PAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA-PAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDA 801
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
25-356 1.16e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.60  E-value: 1.16e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  25 SVKQEEAGVRPSAG---NVSTHPSLSQRPGGSTKS-HP-EPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSG 99
Cdd:PTZ00449 551 ETKEGEVGKKPGPAkehKPSKIPTLSKKPEFPKDPkHPkDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESP 630
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 100 AEAKTQKGSTSKSGSEAQTTKDS--TSKSHPELQTPKDSTGK-------SGAEAQTPE---DSPNRSGAEAKTQKDSPSK 167
Cdd:PTZ00449 631 KSPKRPPPPQRPSSPERPEGPKIikSPKPPKSPKPPFDPKFKekfyddyLDAAAKSKEtktTVVLDESFESILKETLPET 710
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 168 SGSEAQTTKDVPNKSGADGQTP----KDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGSNKS-GAEEQGPIDGPSKSGAEE 242
Cdd:PTZ00449 711 PGTPFTTPRPLPPKLPRDEEFPfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDiLAEEFKEEDIHAETGEPD 790
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 243 QTSKDspnkvvPEQPSRKDHSKPISNPSDNKELPKAD------TNQLADKGKLSPHAF-------KTESGEE-TDLISPP 308
Cdd:PTZ00449 791 EAMKR------PDSPSEHEDKPPGDHPSLPKKRHRLDglalstTDLESDAGRIAKDASgkivklkRSKSFDDlTTVEEAE 864
                        330       340       350       360       370
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 332205947 309 QEEVKSSEPTEDVEPKEAEDDDTGPEEGS---------PPKEEKEKMSGSASSENRE 356
Cdd:PTZ00449 865 EMGAEARKIVVDDDGTEADDEDTHPPEEKhksevrrrrPPKKPSKPKKPSKPKKPKK 921
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
129-316 2.20e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 40.46  E-value: 2.20e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 129 ELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQtPKDGSSKSGAEDQTPKDVPN 208
Cdd:PRK08691 376 ELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQENNDVP-PWEDAPDEAQTAAGTAQTSA 454
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 209 KS---GAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNK--VVPEQPSRKDHSKPISN---------PSDNKE 274
Cdd:PRK08691 455 KSiqtASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDeaVETETFAHEAPAEPFYGygfpdndcpPEDGAE 534
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|..
gi 332205947 275 LPKADTNQLADKGklSPHAFKTESGEETDLISPPQEEVKSSE 316
Cdd:PRK08691 535 IPPPDWEHAAPAD--TAGGGADEEAEAGGIGGNNTPSAPPPE 574
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
189-334 2.34e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 40.35  E-value: 2.34e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 189 PKDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISN 268
Cdd:PRK13108 298 REPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPG 377
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 269 PSDnKELPKADTNQLADKGKL--SPHAFKTESGEETDLISPPQEEVKsSEPTEDVEP--KEAEDDDTGPE 334
Cdd:PRK13108 378 DLA-GQAPAAHQVDAEAASAApeEPAALASEAHDETEPEVPEKAAPI-PDPAKPDELavAGPGDDPAEPD 445
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
59-233 2.45e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 40.35  E-value: 2.45e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  59 EPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPElqtpkdSTG 138
Cdd:PRK13108 294 EALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVE------ETS 367
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 139 KSGAEAQTPEDspnrsgaeakTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPK---DVPNKSGAEKQ 215
Cdd:PRK13108 368 EADIEREQPGD----------LAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIpdpAKPDELAVAGP 437
                        170
                 ....*....|....*...
gi 332205947 216 TPkDGSNKSGAEEQGPID 233
Cdd:PRK13108 438 GD-DPAEPDGIRRQDDFS 454
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
12-241 2.87e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 40.22  E-value: 2.87e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  12 VAAAGAVPLLATESVKQEEAGVRPSAGNVSTHPSLSQRPGGSTkshPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQ 91
Cdd:PRK07003 374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA---AAAATRAEAPPAAPAPPATADRGDDAADGDAPVP 450
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947  92 KDSSNKSGAEAKTQKGST---SKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGA---------------EAQTPEDSPNR 153
Cdd:PRK07003 451 AKANARASADSRCDERDAqppADSGSASAPASDAPPDAAFEPAPRAAAPSAATPaavpdarapaaasreDAPAAAAPPAP 530
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 154 SGAEAKTQKDSPSKSGSEAQTTKDVPNKSG----ADGQTPKDGSSKSGAEdQTPKDVPNKSGAEKQTPKDGSNKSGAEEQ 229
Cdd:PRK07003 531 EARPPTPAAAAPAARAGGAAAALDVLRNAGmrvsSDRGARAAAAAKPAAA-PAAAPKPAAPRVAVQVPTPRARAATGDAP 609
                        250
                 ....*....|..
gi 332205947 230 GPIDGPSKSGAE 241
Cdd:PRK07003 610 PNGAARAEQAAE 621
PHA00430 PHA00430
tail fiber protein
103-199 5.83e-03

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 39.10  E-value: 5.83e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 103 KTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKS 182
Cdd:PHA00430 155 KTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESS 234
                         90
                 ....*....|....*..
gi 332205947 183 GADGQTPKDGSSKSGAE 199
Cdd:PHA00430 235 SKEANTAGDYATKAAAS 251
PHA03169 PHA03169
hypothetical protein; Provisional
106-355 7.52e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 38.80  E-value: 7.52e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 106 KGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSgAEAKTQKDSPSKSGSEAQTTKDVPNKSGAD 185
Cdd:PHA03169  20 RGHCKRHGGTREQAGRRRGTAARAAKPAPPAPTTSGPQVRAVAEQGHRQ-TESDTETAEESRHGEKEERGQGGPSGSGSE 98
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 186 GQTPKDGSSKSGAEDQTpkdvpnkSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSkp 265
Cdd:PHA03169  99 SVGSPTPSPSGSAEELA-------SGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQP-- 169
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205947 266 isNPSDNKELPKADTNQLADKGKLSPHAFKTESGEetdlisPPQEEVKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEK 345
Cdd:PHA03169 170 --SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSP------PPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPE 241
                        250
                 ....*....|
gi 332205947 346 MSGSASSENR 355
Cdd:PHA03169 242 REGPPFPGHR 251
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH