NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1622856808|ref|XP_014967098|]
View 

trafficking protein particle complex subunit 12 isoform X1 [Macaca mulatta]

Protein Classification

tetratricopeptide repeat protein( domain architecture ID 11469162)

tetratricopeptide repeat (TPR) protein may adopt a right-handed helical structure with an amphipathic channel and may function as an interaction scaffold in the formation of multi-protein complexes

CATH:  1.25.40.10
Gene Ontology:  GO:0005515
PubMed:  10517866|30708253
SCOP:  3001345

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
601-746 7.89e-10

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


:

Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 57.89  E-value: 7.89e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 601 SRLGRVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQDVekvtqkLDGLQGKIM 680
Cdd:COG4783     1 AACAEALYALAQALLLAGDYDEAEALLEKALELDPDN-PEAFALLGEILLQLGDLDEAIVLLHEA------LELDPDEPE 73
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622856808 681 VLMNRAFLHLGQNNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRH 746
Cdd:COG4783    74 ARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELDPDD 139
PHA03247 super family cl33720
large tegument protein UL36; Provisional
35-319 4.28e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 4.28e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808   35 RWSRPQVTPFP-QPQWAGGRGMEDAGGGEETPAPEAPHPPQLAPPEEQGLLFHEEtidlGGDEFGSEENETASEGSSPLA 113
Cdd:PHA03247  2585 RARRPDAPPQSaRPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPD----PHPPPTVPPPERPRDDPAPGR 2660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  114 DKLNEHmmesvliSDSPNNSEGDAGDLGRAREEAEPGGEG---------DPGPEPAS--------TPSPTGEVHGDCAPE 176
Cdd:PHA03247  2661 VSRPRR-------ARRLGRAAQASSPPQRPRRRAARPTVGsltsladppPPPPTPEPaphalvsaTPLPPGPAAARQASP 2733
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  177 DAPEDAAPRSGGA----PRQDAAREAPGSEAARPEQEPPAAEPVPVctifSQRAPPAPRDGFEPQMVKSPSFGGASEAPA 252
Cdd:PHA03247  2734 ALPAAPAPPAVPAgpatPGGPARPARPPTTAGPPAPAPPAAPAAGP----PRRLTRPAVASLSESRESLPSPWDPADPPA 2809
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622856808  253 RTP------PQAVQPSPSLSTFFGDAAASHSLASDFF-DSFTTSAFVSVSNPGAGSPASASPPPvsVPGTEGRP 319
Cdd:PHA03247  2810 AVLapaaalPPAASPAGPLPPPTSAQPTAPPPPPGPPpPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAAPARP 2881
 
Name Accession Description Interval E-value
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
601-746 7.89e-10

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 57.89  E-value: 7.89e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 601 SRLGRVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQDVekvtqkLDGLQGKIM 680
Cdd:COG4783     1 AACAEALYALAQALLLAGDYDEAEALLEKALELDPDN-PEAFALLGEILLQLGDLDEAIVLLHEA------LELDPDEPE 73
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622856808 681 VLMNRAFLHLGQNNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRH 746
Cdd:COG4783    74 ARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELDPDD 139
PHA03247 PHA03247
large tegument protein UL36; Provisional
35-319 4.28e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 4.28e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808   35 RWSRPQVTPFP-QPQWAGGRGMEDAGGGEETPAPEAPHPPQLAPPEEQGLLFHEEtidlGGDEFGSEENETASEGSSPLA 113
Cdd:PHA03247  2585 RARRPDAPPQSaRPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPD----PHPPPTVPPPERPRDDPAPGR 2660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  114 DKLNEHmmesvliSDSPNNSEGDAGDLGRAREEAEPGGEG---------DPGPEPAS--------TPSPTGEVHGDCAPE 176
Cdd:PHA03247  2661 VSRPRR-------ARRLGRAAQASSPPQRPRRRAARPTVGsltsladppPPPPTPEPaphalvsaTPLPPGPAAARQASP 2733
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  177 DAPEDAAPRSGGA----PRQDAAREAPGSEAARPEQEPPAAEPVPVctifSQRAPPAPRDGFEPQMVKSPSFGGASEAPA 252
Cdd:PHA03247  2734 ALPAAPAPPAVPAgpatPGGPARPARPPTTAGPPAPAPPAAPAAGP----PRRLTRPAVASLSESRESLPSPWDPADPPA 2809
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622856808  253 RTP------PQAVQPSPSLSTFFGDAAASHSLASDFF-DSFTTSAFVSVSNPGAGSPASASPPPvsVPGTEGRP 319
Cdd:PHA03247  2810 AVLapaaalPPAASPAGPLPPPTSAQPTAPPPPPGPPpPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAAPARP 2881
tol_pal_ybgF TIGR02795
tol-pal system protein YbgF; Members of this protein family are the product of one of seven ...
608-667 6.97e-05

tol-pal system protein YbgF; Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.


Pssm-ID: 188247 [Multi-domain]  Cd Length: 117  Bit Score: 43.04  E-value: 6.97e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622856808 608 YSMANCLLLMKDYVLAVEAYHAVIKYYPEQE--PQLLSGIGRISLQIGDIKTAEKYFQDVEK 667
Cdd:TIGR02795  41 YWLGEAYYAQGDYADAAKAFLAVVKKYPKSPkaPDALLKLGMSLQELGDKEKAKATLQQVIK 102
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
156-323 9.92e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 9.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 156 GPEpASTPSPTGEVHGDCAPEDApeDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAaePVPVCTIFSQRAP-PAPRD-G 233
Cdd:pfam05109 424 APE-STTTSPTLNTTGFAAPNTT--TGLPSSTHVPTNLTAPASTGPTVSTADVTSPT--PAGTTSGASPVTPsPSPRDnG 498
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 234 FEPqmvKSPSFGGASEAPARTPPQAVQPSPSLSTFFGDaAASHSLASdffDSFTTSAFVSVSNPGAGSPASASPPPVSVP 313
Cdd:pfam05109 499 TES---KAPDMTSPTSAVTTPTPNATSPTPAVTTPTPN-ATSPTLGK---TSPTSAVTTPTPNATSPTPAVTTPTPNATI 571
                         170
                  ....*....|
gi 1622856808 314 GTEGRPEPAA 323
Cdd:pfam05109 572 PTLGKTSPTS 581
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
141-316 8.31e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 42.55  E-value: 8.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 141 GRAREEAEPGGEGdPGPEPASTPSPTGEVHGDCAP--EDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPV 218
Cdd:cd23959    59 GAVSPEGENPFDG-PGLVTASTVSDCYVGNANFYEvdMSDAFAMAPDESLGPFRAARVPNPFSASSSTQRETHKTAQVAP 137
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 219 cTIFSQRAPPAPRDGfepQMvksPSFGGASEAPARTPPQAV-QPSPSLSTFFGDAAASHSLASDFFDSFTTSAFVSVSNP 297
Cdd:cd23959   138 -PKAEPQTAPVTPFG---QL---PMFGQHPPPAKPLPAAAAaQQSSASPGEVASPFASGTVSASPFATATDTAPSSGAPD 210
                         170
                  ....*....|....*....
gi 1622856808 298 GAGSPASAsPPPVSVPGTE 316
Cdd:cd23959   211 GFPAEASA-PSPFAAPASA 228
TPR_2 pfam07719
Tetratricopeptide repeat; This Pfam entry includes outlying Tetratricopeptide-like repeats ...
681-711 4.21e-03

Tetratricopeptide repeat; This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515.


Pssm-ID: 429619 [Multi-domain]  Cd Length: 33  Bit Score: 35.58  E-value: 4.21e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1622856808 681 VLMNRAFLHLGQNNFAEAHRFFTEILRMDPT 711
Cdd:pfam07719   3 ALYNLGLAYYKLGDYEEALEAYEKALELDPN 33
TPR smart00028
Tetratricopeptide repeats; Repeats present in 4 or more copies in proteins. Contain a minimum ...
605-636 9.20e-03

Tetratricopeptide repeats; Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.


Pssm-ID: 197478 [Multi-domain]  Cd Length: 34  Bit Score: 34.34  E-value: 9.20e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1622856808  605 RVMYSMANCLLLMKDYVLAVEAYHAVIKYYPE 636
Cdd:smart00028   2 EALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
43-263 9.91e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 39.28  E-value: 9.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  43 PFPQPQWAGGRGMEDAGGgEETPAPEAPHPPQLAPPEEQGLLFHEETIDLGGDEfgseenetasegsSPLADKLNEHMME 122
Cdd:COG5180   203 KVEVKDEAQEEPPDLTGG-ADHPRPEAASSPKVDPPSTSEARSRPATVDAQPEM-------------RPPADAKERRRAA 268
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 123 SVLISDSPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEdAPEDAAPRsggaPRQDAAREAPGSE 202
Cdd:COG5180   269 IGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPIDVKGVASAPPATRPVRPPGG-ARDPGTPR----PGQPTERPAGVPE 343
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622856808 203 AARPEQEPPAAEPVPvctifSQRAPPAPRDGFEPqmvkSPSFGGASEAPARTPPQAVQPSP 263
Cdd:COG5180   344 AASDAGQPPSAYPPA-----EEAVPGKPLEQGAP----RPGSSGGDGAPFQPPNGAPQPGL 395
 
Name Accession Description Interval E-value
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
601-746 7.89e-10

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 57.89  E-value: 7.89e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 601 SRLGRVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQDVekvtqkLDGLQGKIM 680
Cdd:COG4783     1 AACAEALYALAQALLLAGDYDEAEALLEKALELDPDN-PEAFALLGEILLQLGDLDEAIVLLHEA------LELDPDEPE 73
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622856808 681 VLMNRAFLHLGQNNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRH 746
Cdd:COG4783    74 ARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELDPDD 139
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
605-761 1.45e-09

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 59.25  E-value: 1.45e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 605 RVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEqEPQLLSGIGRISLQIGDIKTAEKYFQDVEKVTQKLdglqgkIMVLMN 684
Cdd:COG0457     9 EAYNNLGLAYRRLGRYEEAIEDYEKALELDPD-DAEALYNLGLAYLRLGRYEEALADYEQALELDPDD------AEALNN 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622856808 685 RAFLHLGQNNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRHYlheSVLFNLTTMYE 761
Cdd:COG0457    82 LGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDPDDA---DALYNLGIALE 155
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
605-777 1.26e-08

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 56.55  E-value: 1.26e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 605 RVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQdvekvtQKLDGLQGKIMVLMN 684
Cdd:COG0457    43 EALYNLGLAYLRLGRYEEALADYEQALELDPDD-AEALNNLGLALQALGRYEEALEDYD------KALELDPDDAEALYN 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 685 RAFLHLGQNNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRHYLHESVLFNLTTMYELES 764
Cdd:COG0457   116 LGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKLGRYEEALELLEKLEAAALAALLAAALGEAALALAAAEV 195
                         170
                  ....*....|...
gi 1622856808 765 SRSMQKKQALLEA 777
Cdd:COG0457   196 LLALLLALEQALR 208
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
605-785 1.81e-08

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 56.28  E-value: 1.81e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 605 RVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQDVEKVTQKldglqgKIMVLMN 684
Cdd:COG2956    77 EALLELAQDYLKAGLLDRAEELLEKLLELDPDD-AEALRLLAEIYEQEGDWEKAIEVLERLLKLGPE------NAHAYCE 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 685 RAFLHLGQNNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRHYlheSVLFNLTTMYElES 764
Cdd:COG2956   150 LAELYLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYL---PALPRLAELYE-KL 225
                         170       180
                  ....*....|....*....|.
gi 1622856808 765 SRSMQKKQALLEAVAGKEGDS 785
Cdd:COG2956   226 GDPEEALELLRKALELDPSDD 246
PHA03247 PHA03247
large tegument protein UL36; Provisional
35-319 4.28e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 4.28e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808   35 RWSRPQVTPFP-QPQWAGGRGMEDAGGGEETPAPEAPHPPQLAPPEEQGLLFHEEtidlGGDEFGSEENETASEGSSPLA 113
Cdd:PHA03247  2585 RARRPDAPPQSaRPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPD----PHPPPTVPPPERPRDDPAPGR 2660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  114 DKLNEHmmesvliSDSPNNSEGDAGDLGRAREEAEPGGEG---------DPGPEPAS--------TPSPTGEVHGDCAPE 176
Cdd:PHA03247  2661 VSRPRR-------ARRLGRAAQASSPPQRPRRRAARPTVGsltsladppPPPPTPEPaphalvsaTPLPPGPAAARQASP 2733
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  177 DAPEDAAPRSGGA----PRQDAAREAPGSEAARPEQEPPAAEPVPVctifSQRAPPAPRDGFEPQMVKSPSFGGASEAPA 252
Cdd:PHA03247  2734 ALPAAPAPPAVPAgpatPGGPARPARPPTTAGPPAPAPPAAPAAGP----PRRLTRPAVASLSESRESLPSPWDPADPPA 2809
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622856808  253 RTP------PQAVQPSPSLSTFFGDAAASHSLASDFF-DSFTTSAFVSVSNPGAGSPASASPPPvsVPGTEGRP 319
Cdd:PHA03247  2810 AVLapaaalPPAASPAGPLPPPTSAQPTAPPPPPGPPpPSLPLGGSVAPGGDVRRRPPSRSPAA--KPAAPARP 2881
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
143-265 2.79e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.22  E-value: 2.79e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 143 AREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQdAAREAPGSEAARPEQEPPAAEPVPVCTIF 222
Cdd:PRK07764  376 ARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAP-AAAPQPAPAPAPAPAPPSPAGNAPAGGAP 454
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1622856808 223 SQR------APPAPRDGFEPQMVK----SPSFGGASEAPARTPPQAVQPSPSL 265
Cdd:PRK07764  455 SPPpaaapsAQPAPAPAAAPEPTAapapAPPAAPAPAAAPAAPAAPAAPAGAD 507
PHA03169 PHA03169
hypothetical protein; Provisional
51-217 6.82e-07

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 52.28  E-value: 6.82e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  51 GGRGMEDAGGGEETPAPEAPHPPQLAPPEEQGLlfheetidlGGDEFGSEENETASEGSSPladklnEHMMESVLISDSP 130
Cdd:PHA03169   93 SGSGSESVGSPTPSPSGSAEELASGLSPENTSG---------SSPESPASHSPPPSPPSHP------GPHEPAPPESHNP 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 131 N-NSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAP--EDAAPRSGGAPRQDAAREAPGSEAARPE 207
Cdd:PHA03169  158 SpNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPpdEPGEPQSPTPQQAPSPNTQQAVEHEDEP 237
                         170
                  ....*....|
gi 1622856808 208 QEPPAAEPVP 217
Cdd:PHA03169  238 TEPEREGPPF 247
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
141-325 9.16e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 52.54  E-value: 9.16e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 141 GRAREEAEPGGEGDPGPEPASTP-----SPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGseAARPEQEPPAAEP 215
Cdd:PRK07003  369 GGGVPARVAGAVPAPGARAAAAVgasavPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPP--ATADRGDDAADGD 446
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 216 VPVCTIFSQRAPPAPRD---GFEPQMVKSPSFGGASEAPartPPQAVQPSPSlstffGDAAAShslasdffdsfTTSAFV 292
Cdd:PRK07003  447 APVPAKANARASADSRCderDAQPPADSGSASAPASDAP---PDAAFEPAPR-----AAAPSA-----------ATPAAV 507
                         170       180       190
                  ....*....|....*....|....*....|....
gi 1622856808 293 SVSNPGAGSPASASPPPVSVPGTEGR-PEPAAMR 325
Cdd:PRK07003  508 PDARAPAAASREDAPAAAAPPAPEARpPTPAAAA 541
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
147-325 9.40e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 52.54  E-value: 9.40e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 147 AEPGGEGDPGPEPASTPSPtgevhgdcAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPA--AEPVPVCTIFSQ 224
Cdd:PRK07003  430 PAPPATADRGDDAADGDAP--------VPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDaaFEPAPRAAAPSA 501
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 225 RAPPAPRDGFEPQMVKSPSfggASEAPARTPPQAVQPSPSLSTffgdAAASHSLASDFFDSFtTSAFVSVSNPGAGSPAS 304
Cdd:PRK07003  502 ATPAAVPDARAPAAASRED---APAAAAPPAPEARPPTPAAAA----PAARAGGAAAALDVL-RNAGMRVSSDRGARAAA 573
                         170       180
                  ....*....|....*....|.
gi 1622856808 305 ASPPPVSVPGTegrPEPAAMR 325
Cdd:PRK07003  574 AAKPAAAPAAA---PKPAAPR 591
PHA03247 PHA03247
large tegument protein UL36; Provisional
155-323 1.13e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 1.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  155 PGPEPASTPSPTGEVHGDCAPEDAPEDAAP-------------RSGGA------PRQDAAREAPGSEA--ARPEQEPPAA 213
Cdd:PHA03247  2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsrprrarrlgRAAQAssppqrPRRRAARPTVGSLTslADPPPPPPTP 2708
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  214 EPVPVCTIFS-----------QRAPPAPRDGFEPQMVKSPSFGGASEAPAR----------TPPQA-VQPSPSLSTFFGD 271
Cdd:PHA03247  2709 EPAPHALVSAtplppgpaaarQASPALPAAPAPPAVPAGPATPGGPARPARppttagppapAPPAApAAGPPRRLTRPAV 2788
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1622856808  272 AAASHSLASDFFDSFTTSAFVSVSNPGAGSPASASPPPVSVPGTEGRPEPAA 323
Cdd:PHA03247  2789 ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
50-264 1.30e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.91  E-value: 1.30e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  50 AGGRGMEDAGGGEETPAPEAPHPPQLAPPEEQGllfheetidlggdefGSEENETASEGSSPladklnehmmesvliSDS 129
Cdd:PRK07764  593 GAAGGEGPPAPASSGPPEEAARPAAPAAPAAPA---------------APAPAGAAAAPAEA---------------SAA 642
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 130 PNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQE 209
Cdd:PRK07764  643 PAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP 722
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1622856808 210 PPAAEPVPVCTIFSQRAPPAPRDGFEPQMVKSPSFGGASEAPARTPPQAVQPSPS 264
Cdd:PRK07764  723 QAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPS 777
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
143-323 2.55e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.14  E-value: 2.55e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 143 AREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPrqdAAREAPGSEAARPEQEPPAAEPVPVctif 222
Cdd:PRK07764  613 ARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP---DASDGGDGWPAKAGGAAPAAPPPAP---- 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 223 SQRAPPAPRDGFEPQmvkspsfgGASEAPARTPPQAVQPSPSLSTFFGDAAASHSLASDffdsFTTSAFVSVSNPGAGSP 302
Cdd:PRK07764  686 APAAPAAPAGAAPAQ--------PAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAAD----DPVPLPPEPDDPPDPAG 753
                         170       180
                  ....*....|....*....|.
gi 1622856808 303 ASASPPPVSVPGTEGRPEPAA 323
Cdd:PRK07764  754 APAQPPPPPAPAPAAAPAAAP 774
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
608-750 3.51e-06

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 49.34  E-value: 3.51e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 608 YSMANCLLLMKDYVLAVEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQDVEKVTQKldglqgKIMVLMNRAF 687
Cdd:COG2956    12 YFKGLNYLLNGQPDKAIDLLEEALELDPET-VEAHLALGNLYRRRGEYDRAIRIHQKLLERDPD------RAEALLELAQ 84
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622856808 688 LHLGQNNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPR--HYLHE 750
Cdd:COG2956    85 DYLKAGLLDRAEELLEKLLELDPDDAEALRLLAEIYEQEGDWEKAIEVLERLLKLGPEnaHAYCE 149
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
571-779 7.35e-06

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 49.61  E-value: 7.35e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 571 LANLEQGLAEDGAMSSVTQEGRQASVRLWRSRLGRVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEqEPQLLSGIGRISL 650
Cdd:COG3914    45 LLLLAALAEAAAAALLALAAGEAAAAAAALLLLAALLELAALLLQALGRYEEALALYRRALALNPD-NAEALFNLGNLLL 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 651 QIGDIKTAEKYFQDVEKVTQKLdglqgkIMVLMNRAFLHLGQNNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLK 730
Cdd:COG3914   124 ALGRLEEALAALRRALALNPDF------AEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLE 197
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 1622856808 731 DSLRQLEAMVQQDPRH-YLHESVLFnltTMYELESSRSMQKKQALLEAVA 779
Cdd:COG3914   198 EAIAAYRRALELDPDNaDAHSNLLF---ALRQACDWEVYDRFEELLAALA 244
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
612-710 8.52e-06

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 46.49  E-value: 8.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 612 NCLLLMKDYVLAVEAYHAVIKYYPEqEPQLLSGIGRISLQIGDIKTAEKYFQDVekvtqkLDGLQGKIMVLMNRAFLHLG 691
Cdd:COG5010    62 NLYNKLGDFEESLALLEQALQLDPN-NPELYYNLALLYSRSGDKDEAKEYYEKA------LALSPDNPNAYSNLAALLLS 134
                          90
                  ....*....|....*....
gi 1622856808 692 QNNFAEAHRFFTEILRMDP 710
Cdd:COG5010   135 LGQDDEAKAALQRALGTSP 153
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
129-376 8.62e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 49.30  E-value: 8.62e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 129 SPNNSEGDAGDLGRAREEAEPGGEGDPG----------PEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREA 198
Cdd:PTZ00449  523 APGDKEGEEGEHEDSKESDEPKEGGKPGetkegevgkkPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRS 602
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 199 PGSEAARPEQEPPAAEPVP-----VCTIFSQRAPPAPRDGFEPQMVKSPSFGGASEAPaRTPPQAVQPS--PSLSTFFGD 271
Cdd:PTZ00449  603 AQRPTRPKSPKLPELLDIPkspkrPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPP-KSPKPPFDPKfkEKFYDDYLD 681
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 272 AAA--SHSLASDFFDSFTTSAFVSVSNPGAGSPASASPP-PVSVPGTEGRP-----EPAAMRGPQAAAPPASPEPFAHIQ 343
Cdd:PTZ00449  682 AAAksKETKTTVVLDESFESILKETLPETPGTPFTTPRPlPPKLPRDEEFPfepigDPDAEQPDDIEFFTPPEEERTFFH 761
                         250       260       270
                  ....*....|....*....|....*....|...
gi 1622856808 344 AVFAGSDDPFATALSMSEMDRRNDAWLPGEATR 376
Cdd:PTZ00449  762 ETPADTPLPDILAEEFKEEDIHAETGEPDEAMK 794
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
150-385 9.68e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 49.10  E-value: 9.68e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 150 GGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPVCTIFSQRAPPA 229
Cdd:PRK12323  367 QSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 230 pRDGFEPQMVKSPSFGGASEAPARTPPQAVQPSPSLSTffGDAAASHSLASDFFDSFTTSAFVSVSNPGAGSPASASPPP 309
Cdd:PRK12323  447 -APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARA--APAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVA 523
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622856808 310 VSVPGTEGRPEPAAMRGPQAAAPPASPEPFAHIQAVFAGSDDPFATALSMSEMdrrNDAWLPGEATRGVLRAVAAQ 385
Cdd:PRK12323  524 ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM---FDGDWPALAARLPVRGLAQQ 596
NrfG COG4235
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ...
604-710 1.02e-05

Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443378 [Multi-domain]  Cd Length: 131  Bit Score: 45.77  E-value: 1.02e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 604 GRVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEqEPQLLSGIGRISLQIGDIKTAEKYFQDVekvtQKLDGLQgkIMVLM 683
Cdd:COG4235    17 AEGWLLLGRAYLRLGRYDEALAAYEKALRLDPD-NADALLDLAEALLAAGDTEEAEELLERA----LALDPDN--PEALY 89
                          90       100
                  ....*....|....*....|....*..
gi 1622856808 684 NRAFLHLGQNNFAEAHRFFTEILRMDP 710
Cdd:COG4235    90 LLGLAAFQQGDYAEAIAAWQKLLALLP 116
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
142-323 1.26e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.72  E-value: 1.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 142 RAREEAEPGGEGDPGPEPASTPSPTGEVHGDcAPEDAPEDAAPRSGGAPRQDAAREAPGseAARPEQEPPAAePVPVCTI 221
Cdd:PRK12323  389 AAAPAAAAPAPAAPPAAPAAAPAAAAAARAV-AAAPARRSPAPEALAAARQASARGPGG--APAPAPAPAAA-PAAAARP 464
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 222 FSQRAPPAPRDGFEPQMVKSPSfGGASEAPARTPPQAVQPS--PSLSTFFGDAAASHSLASDFFDSFTTSAfvsvsnPGA 299
Cdd:PRK12323  465 AAAGPRPVAAAAAAAPARAAPA-AAPAPADDDPPPWEELPPefASPAPAQPDAAPAGWVAESIPDPATADP------DDA 537
                         170       180
                  ....*....|....*....|....
gi 1622856808 300 GSPASASPPPVSVPGTEGRPEPAA 323
Cdd:PRK12323  538 FETLAPAPAAAPAPRAAAATEPVV 561
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
35-261 1.58e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.44  E-value: 1.58e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  35 RWSRPQVTPFPQPQWAgGRGMEDAGGGEETPAPEAPHPPQLAPPEEqgllfheetidlggDEFGSEENETASEGSSPlad 114
Cdd:PRK07764  607 GPPEEAARPAAPAAPA-APAAPAPAGAAAAPAEASAAPAPGVAAPE--------------HHPKHVAVPDASDGGDG--- 668
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 115 klnehmmesvliSDSPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTgevhgdcaPEDAPEDAAPRSGGAPRQDA 194
Cdd:PRK07764  669 ------------WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT--------PPAGQADDPAAQPPQAAQGA 728
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622856808 195 AREAPGSEAARPEQEPPAAEPVPVCTIFSQRAPPAPRDGFEPQMVKSPSFGGASEAPARTPPQAVQP 261
Cdd:PRK07764  729 SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDD 795
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
140-264 2.42e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.95  E-value: 2.42e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 140 LGRAREEAEPGGEGDPGPEPASTPSPtgevhgdcAPEDAPEDAAPRSGGAPRQDA-AREAPGSEAARPEQEPPAAEPVPV 218
Cdd:PRK12323  433 LAAARQASARGPGGAPAPAPAPAAAP--------AAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPP 504
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1622856808 219 CtiFSQRAPPAPRDGFEPQMVKSPSFGGASEAPARTPPQAVQPSPS 264
Cdd:PRK12323  505 E--FASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAA 548
CpoB COG1729
Cell division protein CpoB, coordinates peptidoglycan biosynthesis and outer membrane ...
601-667 2.82e-05

Cell division protein CpoB, coordinates peptidoglycan biosynthesis and outer membrane constriction [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 441335 [Multi-domain]  Cd Length: 113  Bit Score: 43.83  E-value: 2.82e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1622856808 601 SRLGRVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEQE--PQLLSGIGRISLQIGDIKTAEKYFQDVEK 667
Cdd:COG1729    27 PLAPDALYWLGEAYYALGDYDEAAEAFEKLLKRYPDSPkaPDALLKLGLSYLELGDYDKARATLEELIK 95
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
129-297 3.06e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 47.37  E-value: 3.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 129 SPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPE---------DAAPRSGGAPRQDAAREAP 199
Cdd:PRK14959  380 APSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSprvpwddapPAPPRSGIPPRPAPRMPEA 459
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 200 GSEAARPEQEPPAAEPVPVCTIFSQRAPPAPR-----DGFEPQMVKSPSFGGASEAPAR-TPPQAVQPSPSLSTF----- 268
Cdd:PRK14959  460 SPVPGAPDSVASASDAPPTLGDPSDTAEHTPSgprtwDGFLEFCQGRNGQGGRLATVLRqATPEHADGRLRLATMssvqy 539
                         170       180       190
                  ....*....|....*....|....*....|.
gi 1622856808 269 --FGDAAASHSLASDFFDSFTTSAFVSVSNP 297
Cdd:PRK14959  540 erLTDAATETTLAGLVRDYFGDACRVEVLPP 570
PHA03247 PHA03247
large tegument protein UL36; Provisional
19-323 3.52e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 3.52e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808   19 RAAGSSRIPGFCHLLGRWSRPQVTPFPQPQWAggrgMEDAGGGEETPAPeAPHPPQLAPPEEQGllfheetidlggdefg 98
Cdd:PHA03247  2666 RARRLGRAAQASSPPQRPRRRAARPTVGSLTS----LADPPPPPPTPEP-APHALVSATPLPPG---------------- 2724
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808   99 seenETASEGSSPLADklnehmmesvLISDSPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPsPTGEVHGDCAPEDA 178
Cdd:PHA03247  2725 ----PAAARQASPALP----------AAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP-AAGPPRRLTRPAVA 2789
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  179 PEDAAPRSGGAPRQ--DAAREAPGSEAARPEQEPPAAePVPVCTIFSQRAPPAPRDGFEPQMVKSPSFGGASEAPARTPP 256
Cdd:PHA03247  2790 SLSESRESLPSPWDpaDPPAAVLAPAAALPPAASPAG-PLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS 2868
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622856808  257 QAVQPSPslstffgdAAASHSLASdffdSFTTSAFVSVSNPGAGSPASASPPPVSVPGTEGRPEPAA 323
Cdd:PHA03247  2869 RSPAAKP--------AAPARPPVR----RLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
tol_pal_ybgF TIGR02795
tol-pal system protein YbgF; Members of this protein family are the product of one of seven ...
608-667 6.97e-05

tol-pal system protein YbgF; Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.


Pssm-ID: 188247 [Multi-domain]  Cd Length: 117  Bit Score: 43.04  E-value: 6.97e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622856808 608 YSMANCLLLMKDYVLAVEAYHAVIKYYPEQE--PQLLSGIGRISLQIGDIKTAEKYFQDVEK 667
Cdd:TIGR02795  41 YWLGEAYYAQGDYADAAKAFLAVVKKYPKSPkaPDALLKLGMSLQELGDKEKAKATLQQVIK 102
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
650-747 7.75e-05

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 42.08  E-value: 7.75e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 650 LQIGDIKTAEKYFQDVekvtQKLDGlqGKIMVLMNRAFLHLGQNNFAEAHRFfTEILRMDPTNAVANNNAAVCLLYLGKL 729
Cdd:COG3063     3 LKLGDLEEAEEYYEKA----LELDP--DNADALNNLGLLLLEQGRYDEAIAL-EKALKLDPNNAEALLNLAELLLELGDY 75
                          90
                  ....*....|....*...
gi 1622856808 730 KDSLRQLEAMVQQDPRHY 747
Cdd:COG3063    76 DEALAYLERALELDPSAL 93
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
613-710 8.97e-05

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 42.08  E-value: 8.97e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 613 CLLLMKDYVLAVEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQdvekvTQKLDglQGKIMVLMNRAFLHLGQ 692
Cdd:COG3063     1 LYLKLGDLEEAEEYYEKALELDPDN-ADALNNLGLLLLEQGRYDEAIALEK-----ALKLD--PNNAEALLNLAELLLEL 72
                          90
                  ....*....|....*...
gi 1622856808 693 NNFAEAHRFFTEILRMDP 710
Cdd:COG3063    73 GDYDEALAYLERALELDP 90
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
156-323 9.92e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 9.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 156 GPEpASTPSPTGEVHGDCAPEDApeDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAaePVPVCTIFSQRAP-PAPRD-G 233
Cdd:pfam05109 424 APE-STTTSPTLNTTGFAAPNTT--TGLPSSTHVPTNLTAPASTGPTVSTADVTSPT--PAGTTSGASPVTPsPSPRDnG 498
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 234 FEPqmvKSPSFGGASEAPARTPPQAVQPSPSLSTFFGDaAASHSLASdffDSFTTSAFVSVSNPGAGSPASASPPPVSVP 313
Cdd:pfam05109 499 TES---KAPDMTSPTSAVTTPTPNATSPTPAVTTPTPN-ATSPTLGK---TSPTSAVTTPTPNATSPTPAVTTPTPNATI 571
                         170
                  ....*....|
gi 1622856808 314 GTEGRPEPAA 323
Cdd:pfam05109 572 PTLGKTSPTS 581
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
179-316 1.02e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.86  E-value: 1.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 179 PEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPVCTIFSQRAPPAPRDGFEPQMVKSPsfggASEAPARTPPQA 258
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAP----AAAAPAAAPAAA 441
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1622856808 259 VQPSPSlstffgdAAASHSLASDFFDSFTTSAFVSVSNPGAGSPASASPPPVSVPGTE 316
Cdd:PRK14951  442 PAAVAL-------APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
66-326 1.09e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.93  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808   66 APEAPHPPQLAPPEEQGLLFHEETIDLGGDEFGSEENETASEGSSPLADKLNEHMMESVLISDSPNNSEGDAGDL----- 140
Cdd:PHA03307    18 GEFFPRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLstlap 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  141 GRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPVCT 220
Cdd:PHA03307    98 ASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLS 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  221 IfSQRAPPAPRDGFEPQMVKSPSFGGASEAPARTPPQAvQPSPSLSTFFGDAAASHSLASDffDSFTTSAFVSVSNPGAG 300
Cdd:PHA03307   178 S-PEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPIS-ASASSPAPAPGRSAADDAGASS--SDSSSSESSGCGWGPEN 253
                          250       260
                   ....*....|....*....|....*.
gi 1622856808  301 SPASASPPPVSVPGTEGRPEPAAMRG 326
Cdd:PHA03307   254 ECPLPRPAPITLPTRIWEASGWNGPS 279
CpoB COG1729
Cell division protein CpoB, coordinates peptidoglycan biosynthesis and outer membrane ...
614-710 1.20e-04

Cell division protein CpoB, coordinates peptidoglycan biosynthesis and outer membrane constriction [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 441335 [Multi-domain]  Cd Length: 113  Bit Score: 42.29  E-value: 1.20e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 614 LLLMKDYVLAVEAYHAVIKYYPEQE--PQLLSGIGRISLQIGDIKTAEKYFQDVEKV---TQKLDglqgkiMVLMNRAFL 688
Cdd:COG1729     3 LLKAGDYDEAIAAFKAFLKRYPNSPlaPDALYWLGEAYYALGDYDEAAEAFEKLLKRypdSPKAP------DALLKLGLS 76
                          90       100
                  ....*....|....*....|..
gi 1622856808 689 HLGQNNFAEAHRFFTEILRMDP 710
Cdd:COG1729    77 YLELGDYDKARATLEELIKKYP 98
PHA03169 PHA03169
hypothetical protein; Provisional
46-180 1.28e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 45.35  E-value: 1.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  46 QPQWAGGRGMEDAGGGEETPAPEA-PHPPQLAPPEEqgllfHEETIDLGGDEFGSEENETASEGSSPlADKLNEHMMESV 124
Cdd:PHA03169  119 SPENTSGSSPESPASHSPPPSPPShPGPHEPAPPES-----HNPSPNQQPSSFLQPSHEDSPEEPEP-PTSEPEPDSPGP 192
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1622856808 125 LISDSPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPE 180
Cdd:PHA03169  193 PQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFP 248
PHA03169 PHA03169
hypothetical protein; Provisional
51-217 1.29e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 45.35  E-value: 1.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  51 GGRGMEDAGGGEETPAPEAPHPPQLAPPEEQGLlfHEETIDlggDEFGSEENETASEGSSPLADklnehmmesvlisDSP 130
Cdd:PHA03169  114 LASGLSPENTSGSSPESPASHSPPPSPPSHPGP--HEPAPP---ESHNPSPNQQPSSFLQPSHE-------------DSP 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 131 NNSEGdagdlgrAREEAEPGGEGDP-GPEPASTPSPTGEvhgdcaPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQE 209
Cdd:PHA03169  176 EEPEP-------PTSEPEPDSPGPPqSETPTSSPPPQSP------PDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPER 242

                  ....*...
gi 1622856808 210 PPAAEPVP 217
Cdd:PHA03169  243 EGPPFPGH 250
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
145-275 1.32e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 45.44  E-value: 1.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 145 EEAEPGGEGDPGPEPASTPSPTGEVHGDCAP--EDAPEDAAPRSGGAPrqdaAREAPGSEAarpeqepPAAEPVPvcTIF 222
Cdd:PRK14959  369 ESLRPSGGGASAPSGSAAEGPASGGAATIPTpgTQGPQGTAPAAGMTP----SSAAPATPA-------PSAAPSP--RVP 435
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1622856808 223 SQRAPPAP-RDGFEPQmvKSPSFGGASEAPARtpPQAVQPSPSLSTFFGDAAAS 275
Cdd:PRK14959  436 WDDAPPAPpRSGIPPR--PAPRMPEASPVPGA--PDSVASASDAPPTLGDPSDT 485
PHA03378 PHA03378
EBNA-3B; Provisional
123-323 1.56e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.44  E-value: 1.56e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 123 SVLISDSPNNSEGDAGDLGRARE-EAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGs 201
Cdd:PHA03378  684 MLPIQWAPGTMQPPPRAPTPMRPpAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG- 762
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 202 EAARPEQEPPAAEPVPvctifSQRAPPAPRdgfepqmvKSPSFGGASEAPARTPPQAVQPSPSLSTffGDAAASHSLASD 281
Cdd:PHA03378  763 RARPPAAAPGAPTPQP-----PPQAPPAPQ--------QRPRGAPTPQPPPQAGPTSMQLMPRAAP--GQQGPTKQILRQ 827
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622856808 282 FFDSFTTSAFVSVSNPGAG----------SPASAS-----------PP---PVSVPGTEGRPEPAA 323
Cdd:PHA03378  828 LLTGGVKRGRPSLKKPAALerqaaagptpSPGSGTsdkivqapvfyPPvlqPIQVMRQLGSVRAAA 893
PHA03247 PHA03247
large tegument protein UL36; Provisional
136-326 1.62e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 1.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  136 DAGDLGRAREEAEPGGEGD---PGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGS----EAARPEQ 208
Cdd:PHA03247  2547 DAGDPPPPLPPAAPPAAPDrsvPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSplppDTHAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  209 EPPA----------AEPVPVCTIFSQRAPPAPRDGFEPQMVKSPSFGGASEAPARTP-PQAVQPSPSLSTFFGD------ 271
Cdd:PHA03247  2627 PPPSpspaanepdpHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPrRRAARPTVGSLTSLADpppppp 2706
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622856808  272 --AAASHSLASDffdSFTTSAFVSVSNPGAGSPASASPPPV----SVPGTEGRP-EPAAMRG 326
Cdd:PHA03247  2707 tpEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVpagpATPGGPARPaRPPTTAG 2765
NrfG COG4235
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ...
623-744 1.69e-04

Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443378 [Multi-domain]  Cd Length: 131  Bit Score: 42.30  E-value: 1.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 623 AVEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQDVekvtQKLDGlqGKIMVLMNRAFLHLGQNNFAEAHRFF 702
Cdd:COG4235     2 AIARLRQALAANPND-AEGWLLLGRAYLRLGRYDEALAAYEKA----LRLDP--DNADALLDLAEALLAAGDTEEAEELL 74
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 1622856808 703 TEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDP 744
Cdd:COG4235    75 ERALALDPDNPEALYLLGLAAFQQGDYAEAIAAWQKLLALLP 116
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
136-441 1.78e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 45.08  E-value: 1.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 136 DAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAA---------PRSGGAPRQDAAREAPGSEAARP 206
Cdd:PRK08691  367 DANAVIENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAmpsegktagPVSNQENNDVPPWEDAPDEAQTA 446
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 207 EqePPAAEPVPVCTIFSQRAPPAPRDGFEPQMVKSPSFGGASEAPARTPPQAVQPSPSLSTffgdAAASHSLASDFFDSF 286
Cdd:PRK08691  447 A--GTAQTSAKSIQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVET----ETFAHEAPAEPFYGY 520
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 287 TTSAFVSVSNPGAGSP----ASASPPPVSVPGTEGRPEPAAMRGPQAAAPPASPEPFAHIQAVFAGsddpFATALSMSEM 362
Cdd:PRK08691  521 GFPDNDCPPEDGAEIPppdwEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFSTENWAAIVRH----FARKLGAAQM 596
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 363 DRRNDAWLPGEATRGVLR-AVAAQQR---GAVFVDKENLTMP---GLRFDNIQGDAVKDLML--------RFLGEKAAAK 427
Cdd:PRK08691  597 PAQHSAWTEYHPDTGLMVlAMTAEARataDKKRLDKIRDTLAqayGLQLTLQTQDWRDEAGRetpamqdkRVQAEDRQKA 676
                         330
                  ....*....|....
gi 1622856808 428 RQVLNADSVEQSFV 441
Cdd:PRK08691  677 QALLEADPAAQKIL 690
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
167-401 2.16e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.87  E-value: 2.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 167 GEVHGDCAPedAPEDAAPRSGGAP---RQDAAREAPGSEAARPEQEPPAAEPVPVCTIFSQRAPPAPrDGFEPQMVKSPS 243
Cdd:PRK12323  366 GQSGGGAGP--ATAAAAPVAQPAPaaaAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP-EALAAARQASAR 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 244 FGGASEAPARTPPQAvqPSPSLSTffgdAAAShslasdffdsfttsaFVSVSNPGAGSPASASPPPVSVPGTEGRPEPAA 323
Cdd:PRK12323  443 GPGGAPAPAPAPAAA--PAAAARP----AAAG---------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEE 501
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622856808 324 MRGPQAAAPPASPEPFAHIQAVFAGSDDPFATALSMSEMDRRNDAWLPGEATRGVLRAVAAQQRGAVFVDKENLTMPG 401
Cdd:PRK12323  502 LPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDG 579
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
39-323 2.41e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 2.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  39 PQVTPFPQPQwaggrgmedagGGEETPAPEAPHPPQLAPPEEQgllfHEetidLGGDEFGSEENETASEGSSPLADKLNE 118
Cdd:pfam03154 259 SQVSPQPLPQ-----------PSLHGQMPPMPHSLQTGPSHMQ----HP----VPPQPFPLTPQSSQSQVPPGPSPAAPG 319
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 119 HMMESvlISDSPNNSEGDAGDlgRAREEAEPGGegdPGPEPASTPSPTGEVHGDCAPEdAPEDAAPRSGGAPRQDAAREA 198
Cdd:pfam03154 320 QSQQR--IHTPPSQSQLQSQQ--PPREQPLPPA---PLSMPHIKPPPTTPIPQLPNPQ-SHKHPPHLSGPSPFQMNSNLP 391
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 199 PGSE----AARPEQEPPAAEPVPVCTI-FSQRAPPAPRDgfEPQMVKSPSF-GGASEAPartPPQAVQPSPSLSTFfgda 272
Cdd:pfam03154 392 PPPAlkplSSLSTHHPPSAHPPPLQLMpQSQQLPPPPAQ--PPVLTQSQSLpPPAASHP---PTSGLHQVPSQSPF---- 462
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1622856808 273 aASHSlasdffdsFTTSAFVSVSNPgAGSPASASP------PPVSVPGTEGRPEPAA 323
Cdd:pfam03154 463 -PQHP--------FVPGGPPPITPP-SGPPTSTSSampgiqPPSSASVSSSGPVPAA 509
PHA03269 PHA03269
envelope glycoprotein C; Provisional
158-309 2.81e-04

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 44.33  E-value: 2.81e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 158 EPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAArEAPGSEAARPEQepPAAEPVPVCtifSQRAPPAPRDgfEPQ 237
Cdd:PHA03269   20 ANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPA-VAPTSAASRKPD--LAQAPTPAA---SEKFDPAPAP--HQA 91
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622856808 238 MVKSPSfggaseaPARTPPQAVQPSPSLSTFFGDAAASHSLASDffdsFTTSAFVSVSNPGAGSpaSASPPP 309
Cdd:PHA03269   92 ASRAPD-------PAVAPQLAAAPKPDAAEAFTSAAQAHEAPAD----AGTSAASKKPDPAAHT--QHSPPP 150
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
605-709 2.87e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 43.56  E-value: 2.87e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 605 RVMYSMANCLLLMKDYVLAVEAYHAVIKYYPEQEPqLLSGIGRISLQIGDIKTAEKYFQDVEKvtqkldgLQGKIMVLMN 684
Cdd:COG2956   179 RALLLLAELYLEQGDYEEAIAALERALEQDPDYLP-ALPRLAELYEKLGDPEEALELLRKALE-------LDPSDDLLLA 250
                          90       100
                  ....*....|....*....|....*
gi 1622856808 685 RAFLHLGQNNFAEAHRFFTEILRMD 709
Cdd:COG2956   251 LADLLERKEGLEAALALLERQLRRH 275
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
149-272 4.78e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.55  E-value: 4.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 149 PGGEGDPGPEPASTPSPTGEVHGDCAPE------DAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPVCTIF 222
Cdd:PRK14951  367 AAAAEAAAPAEKKTPARPEAAAPAAAPVaqaaaaPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVA 446
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1622856808 223 SQRAPPAPRDGFEPQMVKSPSFGGASEAPARTPPQAVQPSPSLSTFFGDA 272
Cdd:PRK14951  447 LAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDV 496
PHA03418 PHA03418
hypothetical E4 protein; Provisional
144-263 5.33e-04

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 42.42  E-value: 5.33e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 144 REEAEPGGEGDPgpEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARP-----EQEPPAAEPVPV 218
Cdd:PHA03418   63 RPPAQPNGHNKP--PVTKQPGGEGTEEDHQAPLAADADDDPRPGKRSKADEHGPAPGRAALAPfkldlDQDPLHGDPDPP 140
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1622856808 219 CTIFSQRAPPAPRDGFEPQmvksPSFG---GASEAPARTPPQAVQPSP 263
Cdd:PHA03418  141 PGATGGQGEEPPEGGEESQ----PPLGegeGAVEGHPPPLPPAPEPKP 184
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
141-235 6.52e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 6.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 141 GRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPvct 220
Cdd:PRK07764  421 AAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAP--- 497
                          90
                  ....*....|....*
gi 1622856808 221 ifSQRAPPAPRDGFE 235
Cdd:PRK07764  498 --AAPAAPAGADDAA 510
PEP_TPR_lipo TIGR02917
putative PEP-CTERM system TPR-repeat lipoprotein; This protein family occurs in strictly ...
614-794 7.31e-04

putative PEP-CTERM system TPR-repeat lipoprotein; This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.


Pssm-ID: 274350 [Multi-domain]  Cd Length: 899  Bit Score: 43.15  E-value: 7.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 614 LLLMKDYVLAVEAYHAVIKYYPEqEPQLLSGIGRISLQIGDIKTAEKYFQDV-EKVTQKLDGLQGKIMVlmnraflHLGQ 692
Cdd:TIGR02917 203 LLSLGNIELALAAYRKAIALRPN-NIAVLLALATILIEAGEFEEAEKHADALlKKAPNSPLAHYLKALV-------DFQK 274
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 693 NNFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRhyLHESVLfnLTTMYELESSRSMQkKQ 772
Cdd:TIGR02917 275 KNYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPN--SHQARR--LLASIQLRLGRVDE-AI 349
                         170       180
                  ....*....|....*....|..
gi 1622856808 773 ALLEAVAGKegDSFNTQCLKLA 794
Cdd:TIGR02917 350 ATLSPALGL--DPDDPAALSLL 369
PHA03247 PHA03247
large tegument protein UL36; Provisional
143-311 7.53e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 7.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  143 AREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPVctif 222
Cdd:PHA03247  2913 APPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA---- 2988
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  223 SQRAPPAPRDGFEPQMVKSPSFGGASEAPARTP---------PQAVQPSPSLSTFFGDAAASHSLASDFFDSFTTSAFVS 293
Cdd:PHA03247  2989 PASSTPPLTGHSLSRVSSWASSLALHEETDPPPvslkqtlwpPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAH 3068
                          170       180
                   ....*....|....*....|....*
gi 1622856808  294 VSNPG---AGSPASAS----PPPVS 311
Cdd:PHA03247  3069 EPDPAtpeAGARESPSsqfgPPPLS 3093
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
141-316 8.31e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 42.55  E-value: 8.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 141 GRAREEAEPGGEGdPGPEPASTPSPTGEVHGDCAP--EDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPV 218
Cdd:cd23959    59 GAVSPEGENPFDG-PGLVTASTVSDCYVGNANFYEvdMSDAFAMAPDESLGPFRAARVPNPFSASSSTQRETHKTAQVAP 137
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 219 cTIFSQRAPPAPRDGfepQMvksPSFGGASEAPARTPPQAV-QPSPSLSTFFGDAAASHSLASDFFDSFTTSAFVSVSNP 297
Cdd:cd23959   138 -PKAEPQTAPVTPFG---QL---PMFGQHPPPAKPLPAAAAaQQSSASPGEVASPFASGTVSASPFATATDTAPSSGAPD 210
                         170
                  ....*....|....*....
gi 1622856808 298 GAGSPASAsPPPVSVPGTE 316
Cdd:cd23959   211 GFPAEASA-PSPFAAPASA 228
PHA03378 PHA03378
EBNA-3B; Provisional
34-321 8.76e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.13  E-value: 8.76e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  34 GRWSRPQVTPFPQP-----QWAGGRGMEDAGGGEETPAPEAPHPPqLAPPEEQGLLFHEEtidlggdefgSEENETASEG 108
Cdd:PHA03378  436 ARTEQPRATPHSQAptvvlHRPPTQPLEGPTGPLSVQAPLEPWQP-LPHPQVTPVILHQP----------PAQGVQAHGS 504
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 109 SSPLADKLNEHMMESV---LISDSPNNSEGD-------AGDLGRarEEAEPGGEGD--------PGPEP----------- 159
Cdd:PHA03378  505 MLDLLEKDDEDMEQRVmatLLPPSPPQPRAGrrapcvyTEDLDI--ESDEPASTEPvhdqllpaPGLGPlqiqpltsptt 582
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 160 ----ASTPSPTGE----VHGDCAPED------APEDAAPRSGGAP------RQDAAREAPGSEAARPE-QEPPAAEPVPV 218
Cdd:PHA03378  583 sqlaSSAPSYAQTpwpvPHPSQTPEPpttqshIPETSAPRQWPMPlrpipmRPLRMQPITFNVLVFPTpHQPPQVEITPY 662
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 219 CTIFSQ--RAPPAPRDGFEPQMVKSPSFGGASEAPARTP----PQAVQPSPSLSTFFGDAAASHSLASdffdsfTTSAFV 292
Cdd:PHA03378  663 KPTWTQigHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPtpmrPPAAPPGRAQRPAAATGRARPPAAA------PGRARP 736
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|
gi 1622856808 293 SVSNPGAGSPASASP----PPVSVPG-------TEGRPEP 321
Cdd:PHA03378  737 PAAAPGRARPPAAAPgrarPPAAAPGrarppaaAPGAPTP 776
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
37-400 9.69e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 9.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808   37 SRPQVTPFPQPQWAGGRGMEDAGGGEETPAPEAPHPPQLAPPEEQGllfHEETIDLGGDEFGSEENETASEGSSPLADKL 116
Cdd:PHA03307    82 NESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP---PSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  117 NEHmmesvliSDSPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEvhGDCAPEDAPE-DAAPRSGGAPRQDAA 195
Cdd:PHA03307   159 PAA-------VASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAAS--PRPPRRSSPIsASASSPAPAPGRSAA 229
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  196 REAPGSEaarpeqEPPAAEPVPVCTIFSQRAPPAPRDGfePQMVKSPSFGGASEAPARTPPQAVQPSPSLSTFFGDAAAS 275
Cdd:PHA03307   230 DDAGASS------SDSSSSESSGCGWGPENECPLPRPA--PITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPS 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  276 HSLASDFFDSFTTSAFVSVSNPGAGSPASASPPPVSVPGTEGRPEPAAMRGPQAAAPPAS---------PEPFAHIQAVF 346
Cdd:PHA03307   302 SPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADpssprkrprPSRAPSSPAAS 381
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1622856808  347 AGSDDPFATALSMSEMDRRNDAWLPGEATRGVLRAVAAQQRGAVFVDKENLTMP 400
Cdd:PHA03307   382 AGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTP 435
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
139-233 1.13e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 42.24  E-value: 1.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 139 DLGRAREEAEPGGEGDPgPEPASTPSPtgevhgdcAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPV 218
Cdd:PRK14954  372 ELVRNDGGVAPSPAGSP-DVKKKAPEP--------DLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPL 442
                          90
                  ....*....|....*
gi 1622856808 219 CTIFSQRAPPAPRDG 233
Cdd:PRK14954  443 PPSPQASAPRNVASG 457
PHA03378 PHA03378
EBNA-3B; Provisional
36-263 1.17e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.36  E-value: 1.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  36 WSRPQVTPFpQPQWAGGRGM--EDAGGGEETPAPEAP---HPPQLAPPEEQGLLFHEETIDLGGDEFGSEENETASEGSS 110
Cdd:PHA03378  666 WTQIGHIPY-QPSPTGANTMlpIQWAPGTMQPPPRAPtpmRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRA 744
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 111 PLAdklnehmmesvliSDSPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAP 190
Cdd:PHA03378  745 RPP-------------AAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP 811
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 191 RQDAAREAP-----------GSEAARPEQEPPAAepvpvctiFSQRAP----PAPRDGFEPQMVKSPSFGGASEAPARTP 255
Cdd:PHA03378  812 RAAPGQQGPtkqilrqlltgGVKRGRPSLKKPAA--------LERQAAagptPSPGSGTSDKIVQAPVFYPPVLQPIQVM 883

                  ....*...
gi 1622856808 256 PQAVQPSP 263
Cdd:PHA03378  884 RQLGSVRA 891
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
175-304 1.28e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 42.04  E-value: 1.28e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 175 PEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPvctifSQRAPPAPRDGFE----------PQMV----- 239
Cdd:PRK14965  385 PSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAP-----PAAAAPPARSADPaaaasagdrwRAFVafvkg 459
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622856808 240 KSPSFGGASE--APARTPPQAVQPS-PSLSTFF-----GDAAAS-HSLASDFFDSFTTSAFVSVSNPGAGSPAS 304
Cdd:PRK14965  460 KKPALGASLEqgSPLGVSAGLLEIGfPEGSFELsamqdPDSRAElKALAEQFFGRPTRLRITVLAAPPGAAPPS 533
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
631-761 1.30e-03

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 41.15  E-value: 1.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 631 IKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQDVEKVTQKLdglqgkIMVLMNRAFLHLGQNNFAEAHRFFTEILRMDP 710
Cdd:COG0457     1 LELDPDD-AEAYNNLGLAYRRLGRYEEAIEDYEKALELDPDD------AEALYNLGLAYLRLGRYEEALADYEQALELDP 73
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1622856808 711 TNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDPRHYlheSVLFNLTTMYE 761
Cdd:COG0457    74 DDAEALNNLGLALQALGRYEEALEDYDKALELDPDDA---EALYNLGLALL 121
PHA03379 PHA03379
EBNA-3A; Provisional
64-263 1.33e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 42.35  E-value: 1.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  64 TPAPEAPHPPQLAPPEEqgllfheETIDLGGDEFGSEENETASEGSSPLADklnEHMMESVLISDSPNNSEGD--AGDLg 141
Cdd:PHA03379  415 TPRPPVEKPRPEVPQSL-------ETATSHGSAQVPEPPPVHDLEPGPLHD---QHSMAPCPVAQLPPGPLQDlePGDQ- 483
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 142 rareeaEPGGEGDPGPEPASTPSPTGevhgdcaPEDAPEDAAPRSggAPRQDAAREAPGSEAARPEQEPPAAEPVPVCTI 221
Cdd:PHA03379  484 ------LPGVVQDGRPACAPVPAPAG-------PIVRPWEASLSQ--VPGVAFAPVMPQPMPVEPVPVPTVALERPVCPA 548
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 1622856808 222 FSQRAPPAPRdgfEPQMVKSPSFGGASEAPARTPPQAVQPSP 263
Cdd:PHA03379  549 PPLIAMQGPG---ETSGIVRVRERWRPAPWTPNPPRSPSQMS 587
PHA03169 PHA03169
hypothetical protein; Provisional
134-263 1.66e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 41.49  E-value: 1.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 134 EGDAGDLGRAREEAE--PGGEGDPGPEPASTPSP-TGEVHGDCAPEDAPEDAAPRSGGAPRQDA-------AREAPGSEA 203
Cdd:PHA03169   87 RGQGGPSGSGSESVGspTPSPSGSAEELASGLSPeNTSGSSPESPASHSPPPSPPSHPGPHEPAppeshnpSPNQQPSSF 166
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 204 ARPEQEPPAAEPVPVCTIFSQRAPPAPrDGFEPQMVKSPSFGGASEAPARTPPQAVQPSP 263
Cdd:PHA03169  167 LQPSHEDSPEEPEPPTSEPEPDSPGPP-QSETPTSSPPPQSPPDEPGEPQSPTPQQAPSP 225
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
175-321 1.98e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.62  E-value: 1.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 175 PEDAPEDAAPrsggaprqdAAREAPgseaARPEQEPPAAEPVPVCTIFSQRAPPAPRDGFEPQMVKSPSFGGASEAPART 254
Cdd:PRK14951  366 PAAAAEAAAP---------AEKKTP----ARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAA 432
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622856808 255 PPQAVQPSPSlstffgdAAASHSLASDFFDSFTTSAFVSVSNPGAGSPASASPPPVSVPGTEGRPEP 321
Cdd:PRK14951  433 APAAAPAAAP-------AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
168-324 2.38e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 2.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 168 EVHGDCAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPAAEPVPvctifsQRAPPAPRDGFEPQMVKSPSFGga 247
Cdd:PRK07764  577 ELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAA------PAPAGAAAAPAEASAAPAPGVA-- 648
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622856808 248 seAPARTPPQAVQPSPSLSTFFGDAAAshslasdffDSFTTSAFVSVSNPGAGSPASASPPPVSVPGTEGRPEPAAM 324
Cdd:PRK07764  649 --APEHHPKHVAVPDASDGGDGWPAKA---------GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714
TPR_2 pfam07719
Tetratricopeptide repeat; This Pfam entry includes outlying Tetratricopeptide-like repeats ...
681-711 4.21e-03

Tetratricopeptide repeat; This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515.


Pssm-ID: 429619 [Multi-domain]  Cd Length: 33  Bit Score: 35.58  E-value: 4.21e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1622856808 681 VLMNRAFLHLGQNNFAEAHRFFTEILRMDPT 711
Cdd:pfam07719   3 ALYNLGLAYYKLGDYEEALEAYEKALELDPN 33
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
133-233 4.25e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 4.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 133 SEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHgdcAPEDAPEDAAPRSGGAPRQDAAREAPGSEAARPEQEPPA 212
Cdd:PRK07764  409 APAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAG---NAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPP 485
                          90       100
                  ....*....|....*....|.
gi 1622856808 213 AEPVPVCTIFSQRAPPAPRDG 233
Cdd:PRK07764  486 AAPAPAAAPAAPAAPAAPAGA 506
PHA03247 PHA03247
large tegument protein UL36; Provisional
143-322 4.52e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.69  E-value: 4.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  143 AREEAEPGGEGDPGPEPASTPS----------PTGE---------VHG-----------------DCAPEDAPEDAAPRS 186
Cdd:PHA03247  2492 AGAAPDPGGGGPPDPDAPPAPSrlapailpdePVGEpvhprmltwIRGleelasddagdpppplpPAAPPAAPDRSVPPP 2571
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  187 GGAPRqdAAREAPGSEAARPEQEPPAAEPvpvctifsqRAPPAPRDGFEPQMVKSPsfggASEAPARTPPQAVQPSPSLS 266
Cdd:PHA03247  2572 RPAPR--PSEPAVTSRARRPDAPPQSARP---------RAPVDDRGDPRGPAPPSP----LPPDTHAPDPPPPSPSPAAN 2636
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1622856808  267 TFFGDAAASHSLASDFFD-------SFTTSAFVSVSNPGAGSP----------------ASASPPPVSVPGTEGRPEPA 322
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDdpapgrvSRPRRARRLGRAAQASSPpqrprrraarptvgslTSLADPPPPPPTPEPAPHAL 2715
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
59-266 4.79e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 40.63  E-value: 4.79e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  59 GGGEETPAPEAPHPPQLAPPEEQGllfheetidlggdEFGSEENETASEGSSPLADKLNEhMMESVLISDSPNNSEGDAG 138
Cdd:PRK12323  368 SGGGAGPATAAAAPVAQPAPAAAA-------------PAAAAPAPAAPPAAPAAAPAAAA-AARAVAAAPARRSPAPEAL 433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 139 DLGRAREEAEPGGEGDPG------PEPASTPSPTGEVHGDCAPEDAPEDAAPRSGGAPRQDAA---REAPGSEAARPEQE 209
Cdd:PRK12323  434 AAARQASARGPGGAPAPApapaaaPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPppwEELPPEFASPAPAQ 513
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1622856808 210 PPAAEPVPVCTIFSQRAPPAPRDGFEPQMVkSPSFGGASEAPARTPPQAVQPSPSLS 266
Cdd:PRK12323  514 PDAAPAGWVAESIPDPATADPDDAFETLAP-APAAAPAPRAAAATEPVVAPRPPRAS 569
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
604-672 4.86e-03

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 37.07  E-value: 4.86e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1622856808 604 GRVMYSMANCLLLMKDYVLAvEAYHAVIKYYPEQePQLLSGIGRISLQIGDIKTAEKYFQDVEKVTQKL 672
Cdd:COG3063    26 ADALNNLGLLLLEQGRYDEA-IALEKALKLDPNN-AEALLNLAELLLELGDYDEALAYLERALELDPSA 92
PHA03247 PHA03247
large tegument protein UL36; Provisional
39-315 6.13e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 6.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808   39 PQVTPFPQPQ-WAGGRGM---EDAGGGEETPAPEAPHPPqlAPPEEQGLLFHEETIDLGGDEFGSEENETASEGSSPLAD 114
Cdd:PHA03247  2704 PPPTPEPAPHaLVSATPLppgPAAARQASPALPAAPAPP--AVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  115 KLNEHMMESVLISDSPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEDAPEDAAPRSGG-APRQD 193
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSvAPGGD 2861
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  194 AAREAP-GSEAARPEQEP---------PAAEPVPVCTIFSQRAPPAPRDGFEPQMVKSPSFGGASEAPARTPPQAVQPSP 263
Cdd:PHA03247  2862 VRRRPPsRSPAAKPAAPArppvrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1622856808  264 SLSTFFGDAAASHS--LASDFFDSFTTSAFVSVSNPGAGSPASASPPPVSVPGT 315
Cdd:PHA03247  2942 PLAPTTDPAGAGEPsgAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
PHA03325 PHA03325
nuclear-egress-membrane-like protein; Provisional
109-242 6.22e-03

nuclear-egress-membrane-like protein; Provisional


Pssm-ID: 223044  Cd Length: 418  Bit Score: 39.87  E-value: 6.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 109 SSPLADKLNEHMMESVLISDSPNNSEGDAGDLGRAREEAEPGGE-GDPGPEPASTPSP---TGEVHGDCAPED------A 178
Cdd:PHA03325  257 LTSSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEhSDPEPLPASLPPPpvrRPRVKHPEAGKEepdgarN 336
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622856808 179 PEDAAPRSGGAPRQ----------DAAREAPGSEAARPEQEP--PAAEPVPVCTIFSQRAPPAPRDGFEPQMVKSP 242
Cdd:PHA03325  337 AEAKEPAQPATSTSskgsssaqnkDSGSTGPGSSLAAASSFLedDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIP 412
TPR_12 pfam13424
Tetratricopeptide repeat;
641-708 6.47e-03

Tetratricopeptide repeat;


Pssm-ID: 315987 [Multi-domain]  Cd Length: 77  Bit Score: 36.21  E-value: 6.47e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 641 LLSGIGRISLQIGDIKTAEKYFQDVEKVTQKLDGLQ--GKIMVLMNRAFLHLGQNNFAEAHRFFTEILRM 708
Cdd:pfam13424   5 ALNNLAAVLRRLGRYDEALELLEKALEIARRLLGPDhpLTATTLLNLGRLYLELGRYEEALELLERALAL 74
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
614-744 7.07e-03

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 38.02  E-value: 7.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 614 LLLMKDYVLAVEAYHAVIKYYPEQEPQLLSGIGRISLQIGDIKTAEKYFQdvekvtQKLDGLQGKIMVLMNRAFLHLGQN 693
Cdd:COG5010    29 AALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLE------QALQLDPNNPELYYNLALLYSRSG 102
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1622856808 694 NFAEAHRFFTEILRMDPTNAVANNNAAVCLLYLGKLKDSLRQLEAMVQQDP 744
Cdd:COG5010   103 DKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTSP 153
PHA03291 PHA03291
envelope glycoprotein I; Provisional
159-258 9.00e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 39.17  E-value: 9.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 159 PASTPSPTGEVHGDCAPeDAPEdAAPRSG-------GAPRQDAAREAPGSEAARPEQEPPAAEPVPVCTIFSQRAPPAPR 231
Cdd:PHA03291  172 LAAPPLGEGSADGSCDP-ALPL-SAPRLGpadvfvpATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGT 249
                          90       100
                  ....*....|....*....|....*..
gi 1622856808 232 DGFEPQMVKSPSFGGASEAPARTPPQA 258
Cdd:PHA03291  250 TPEAEGTPAPPTPGGGEAPPANATPAP 276
TPR smart00028
Tetratricopeptide repeats; Repeats present in 4 or more copies in proteins. Contain a minimum ...
605-636 9.20e-03

Tetratricopeptide repeats; Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.


Pssm-ID: 197478 [Multi-domain]  Cd Length: 34  Bit Score: 34.34  E-value: 9.20e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1622856808  605 RVMYSMANCLLLMKDYVLAVEAYHAVIKYYPE 636
Cdd:smart00028   2 EALYNLGNAYLKLGDYDEALEYYEKALELDPN 33
PHA03169 PHA03169
hypothetical protein; Provisional
50-191 9.70e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 39.18  E-value: 9.70e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  50 AGGRGMEDAGGGEETPAPEAPHPPQLAPPEEqgllfheetidlGGDEFGSEENETASEGSSPlADKLNEHMMESVLISDS 129
Cdd:PHA03169  158 SPNQQPSSFLQPSHEDSPEEPEPPTSEPEPD------------SPGPPQSETPTSSPPPQSP-PDEPGEPQSPTPQQAPS 224
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622856808 130 PNNSEGdagdlgrAREEAEPGGEGDPGPEPASTPSPTGEVHGdcapedapEDAAPRSGGAPR 191
Cdd:PHA03169  225 PNTQQA-------VEHEDEPTEPEREGPPFPGHRSHSYTVVG--------WKPSTRPGGVPK 271
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
43-263 9.91e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 39.28  E-value: 9.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808  43 PFPQPQWAGGRGMEDAGGgEETPAPEAPHPPQLAPPEEQGLLFHEETIDLGGDEfgseenetasegsSPLADKLNEHMME 122
Cdd:COG5180   203 KVEVKDEAQEEPPDLTGG-ADHPRPEAASSPKVDPPSTSEARSRPATVDAQPEM-------------RPPADAKERRRAA 268
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622856808 123 SVLISDSPNNSEGDAGDLGRAREEAEPGGEGDPGPEPASTPSPTGEVHGDCAPEdAPEDAAPRsggaPRQDAAREAPGSE 202
Cdd:COG5180   269 IGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPIDVKGVASAPPATRPVRPPGG-ARDPGTPR----PGQPTERPAGVPE 343
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622856808 203 AARPEQEPPAAEPVPvctifSQRAPPAPRDGFEPqmvkSPSFGGASEAPARTPPQAVQPSP 263
Cdd:COG5180   344 AASDAGQPPSAYPPA-----EEAVPGKPLEQGAP----RPGSSGGDGAPFQPPNGAPQPGL 395
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH