NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1119494255|ref|WP_072465175|]
View 

PPE family protein, partial [Mycobacterium tuberculosis]

Protein Classification

PPE family protein( domain architecture ID 11475754)

proline-proline-glutamate (PPE) family protein containing pentapeptide repeats, similar to various Mycobacterium tuberculosis PPE virulence/immunogenicity factors

CATH:  1.10.287.850
SCOP:  4001235

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-162 6.75e-50

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


:

Pssm-ID: 425887  Cd Length: 158  Bit Score: 172.38  E-value: 6.75e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255   5 VLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAASFGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAAVQAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1119494255  85 QTAAQAAAMIAEFEAVKTAVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYHAGASAIA 162
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAASAAAA 158
PPE COG5651
PPE-repeat protein [Function unknown];
1-388 1.03e-49

PPE-repeat protein [Function unknown];


:

Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 179.70  E-value: 1.03e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255   1 MNFSVLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAASFGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAA 80
Cdd:COG5651     1 MDFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  81 VQAEQTAAQAAAMIAEFEAVKTAVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYhAGASA 160
Cdd:COG5651    81 AQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGY-AAASA 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 161 IASALSPFSKPLQNLAGLPAWLASGAPAAAMTAAAGIPALAGGPTAINLGIANVGGGNVGNAnnglANIGNANLGNYNFG 240
Cdd:COG5651   160 AAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIG----LNSGPGNTGFAGTG 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 241 SGNFGNSNIGSASLGNNNIGFGNLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIGIGGLNSGTGN 320
Cdd:COG5651   236 AAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGA 315
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1119494255 321 FGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGS 388
Cdd:COG5651   316 AGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
379-417 9.23e-07

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


:

Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 46.01  E-value: 9.23e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 379 ANTGSLNTGSYNMGDFNPGSSNTGTFNTGNANTGFLNAG 417
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
400-437 1.86e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


:

Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 42.16  E-value: 1.86e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 400 NTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTG 437
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
FhaB super family cl27105
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
230-799 3.82e-05

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


The actual alignment was detected with superfamily member COG3210:

Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 47.45  E-value: 3.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  230 GNANLGNYNFGSGNFGNSNIGSASlgnnniGFGNLGSNNVGV-GNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQ 308
Cdd:COG3210    779 GNTSAGATLDNAGAEISIDITADG------TITAAGTTAINVtGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGT 852
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  309 IGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGS 388
Cdd:COG3210    853 TSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAA 932
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  389 YNMGDFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTGDMNNGVFYRGVGQGSLQFSITTPDLTLPPLQ 468
Cdd:COG3210    933 AGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGS 1012
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  469 IPGISVPAFSLPAITLPSLTIPAATTPANITVGAFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPA 548
Cdd:COG3210   1013 GAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTH 1092
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  549 NITVSGFQLPPLSIPSVAIPPVTVPPITVGAFNLPPLQIPEVTIPQLTIPAGITIGGFSLPAIHTQPITVGQIGVGQFGL 628
Cdd:COG3210   1093 TLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAA 1172
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  629 PSIGWDVFLSTPRITVPAFGIPFTLQFQTNVPALQPPGGGLSTFTNGALIFGEFDLPQLVVHPYTLTGPIVIGSFFLPAF 708
Cdd:COG3210   1173 TTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASG 1252
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  709 NIPGIDVPAINVDGFTLPQITTPAITTPEFAIPPIGVGGFTLPQITTQEIITPELTINSIGVGGFTLPQITTPPITTPPL 788
Cdd:COG3210   1253 TGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVN 1332
                          570
                   ....*....|.
gi 1119494255  789 TIDPINLTGFT 799
Cdd:COG3210   1333 SGGVNAGGGTI 1343
 
Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-162 6.75e-50

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 172.38  E-value: 6.75e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255   5 VLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAASFGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAAVQAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1119494255  85 QTAAQAAAMIAEFEAVKTAVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYHAGASAIA 162
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAASAAAA 158
PPE COG5651
PPE-repeat protein [Function unknown];
1-388 1.03e-49

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 179.70  E-value: 1.03e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255   1 MNFSVLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAASFGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAA 80
Cdd:COG5651     1 MDFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  81 VQAEQTAAQAAAMIAEFEAVKTAVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYhAGASA 160
Cdd:COG5651    81 AQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGY-AAASA 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 161 IASALSPFSKPLQNLAGLPAWLASGAPAAAMTAAAGIPALAGGPTAINLGIANVGGGNVGNAnnglANIGNANLGNYNFG 240
Cdd:COG5651   160 AAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIG----LNSGPGNTGFAGTG 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 241 SGNFGNSNIGSASLGNNNIGFGNLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIGIGGLNSGTGN 320
Cdd:COG5651   236 AAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGA 315
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1119494255 321 FGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGS 388
Cdd:COG5651   316 AGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
319-357 3.50e-07

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 47.17  E-value: 3.50e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 319 GNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSG 357
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
379-417 9.23e-07

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 46.01  E-value: 9.23e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 379 ANTGSLNTGSYNMGDFNPGSSNTGTFNTGNANTGFLNAG 417
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
400-437 1.86e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 42.16  E-value: 1.86e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 400 NTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTG 437
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
PTZ00395 PTZ00395
Sec24-related protein; Provisional
321-436 3.33e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 47.76  E-value: 3.33e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  321 FGLFNSGSGNVGffNSGNGNFGIGN---SGNFNT------GGWNSGHGNTGFFNAGSFNtgmlDVGNANTGSLNTGSYNM 391
Cdd:PTZ00395   339 YGGFHDGSPNAA--SAGAPFNGLGNqadGGHINQvhpdarGAWAGGPHSNASYNCAAYS----NAAQSNAAQSNAGFSNA 412
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1119494255  392 GDFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNT 436
Cdd:PTZ00395   413 GYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNL 457
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
230-799 3.82e-05

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 47.45  E-value: 3.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  230 GNANLGNYNFGSGNFGNSNIGSASlgnnniGFGNLGSNNVGV-GNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQ 308
Cdd:COG3210    779 GNTSAGATLDNAGAEISIDITADG------TITAAGTTAINVtGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGT 852
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  309 IGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGS 388
Cdd:COG3210    853 TSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAA 932
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  389 YNMGDFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTGDMNNGVFYRGVGQGSLQFSITTPDLTLPPLQ 468
Cdd:COG3210    933 AGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGS 1012
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  469 IPGISVPAFSLPAITLPSLTIPAATTPANITVGAFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPA 548
Cdd:COG3210   1013 GAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTH 1092
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  549 NITVSGFQLPPLSIPSVAIPPVTVPPITVGAFNLPPLQIPEVTIPQLTIPAGITIGGFSLPAIHTQPITVGQIGVGQFGL 628
Cdd:COG3210   1093 TLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAA 1172
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  629 PSIGWDVFLSTPRITVPAFGIPFTLQFQTNVPALQPPGGGLSTFTNGALIFGEFDLPQLVVHPYTLTGPIVIGSFFLPAF 708
Cdd:COG3210   1173 TTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASG 1252
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  709 NIPGIDVPAINVDGFTLPQITTPAITTPEFAIPPIGVGGFTLPQITTQEIITPELTINSIGVGGFTLPQITTPPITTPPL 788
Cdd:COG3210   1253 TGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVN 1332
                          570
                   ....*....|.
gi 1119494255  789 TIDPINLTGFT 799
Cdd:COG3210   1333 SGGVNAGGGTI 1343
PHA03247 PHA03247
large tegument protein UL36; Provisional
458-630 1.42e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 1.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  458 TTPDLTLPPLQIPGISVPAFSLPAITLPSlTIPAATTPANITVGAFSLPGLTLPSLNIPAATTPA--------NITVGAF 529
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSESRESLPSPWDPA-DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPpppgppppSLPLGGS 2855
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  530 SLPGLTL----PSLNIPAA-TTPANITVS------------GFQLPPLSI-----PSVAIPPVTVPPITVGAFNLPPLQI 587
Cdd:PHA03247  2856 VAPGGDVrrrpPSRSPAAKpAAPARPPVRrlarpavsrsteSFALPPDQPerppqPQAPPPPQPQPQPPPPPQPQPPPPP 2935
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1119494255  588 PEVTIPQL---TIPAGITIGGFSLPAIHTQPITVGQIGVGQFGLPS 630
Cdd:PHA03247  2936 PPRPQPPLaptTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQ 2981
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
412-633 3.13e-04

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 44.29  E-value: 3.13e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 412 GFLNAGNINTGVFNIGHMNngLFNTGdmnnGVFYRgVGQgslqfSITTPDLTLPPLQIPGISvpafslPAITLPSLTIPA 491
Cdd:TIGR01645 249 GFIEYNNLQSQSEAIASMN--LFDLG----GQYLR-VGK-----CVTPPDALLQPATVSAIP------AAAAVAAAAATA 310
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 492 ATTPANITVGAfSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIP-AATTPANITVSGFQLPPLSIPSVAIPPV 570
Cdd:TIGR01645 311 KIMAAEAVAGA-AVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPlPQAAPAVVKPGPMEIPTPVPPPGLAIPS 389
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1119494255 571 TVPPitvgafnlpplqiPEVTIPQLTIPagitiggfSLPAIHTQPITVGQIGVGQFGL-PSIGW 633
Cdd:TIGR01645 390 LVAP-------------PGLVAPTEINP--------SFLASPRKKMKREKLPVTFGALdDTLAW 432
 
Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-162 6.75e-50

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 172.38  E-value: 6.75e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255   5 VLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAASFGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAAVQAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1119494255  85 QTAAQAAAMIAEFEAVKTAVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYHAGASAIA 162
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAASAAAA 158
PPE COG5651
PPE-repeat protein [Function unknown];
1-388 1.03e-49

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 179.70  E-value: 1.03e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255   1 MNFSVLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAASFGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAA 80
Cdd:COG5651     1 MDFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  81 VQAEQTAAQAAAMIAEFEAVKTAVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYhAGASA 160
Cdd:COG5651    81 AQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGY-AAASA 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 161 IASALSPFSKPLQNLAGLPAWLASGAPAAAMTAAAGIPALAGGPTAINLGIANVGGGNVGNAnnglANIGNANLGNYNFG 240
Cdd:COG5651   160 AAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIG----LNSGPGNTGFAGTG 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 241 SGNFGNSNIGSASLGNNNIGFGNLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIGIGGLNSGTGN 320
Cdd:COG5651   236 AAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGA 315
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1119494255 321 FGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGS 388
Cdd:COG5651   316 AGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
182-554 2.73e-07

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 54.39  E-value: 2.73e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  182 LASGAPAAAMTAAAGIPALAGGPTAINLGIANVGGGNVGNANNGLANIGNANLGNYNFGSGNFGNSNIGSASLGNNNIGF 261
Cdd:COG3210    345 SGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAG 424
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  262 GNLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNF 341
Cdd:COG3210    425 GFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAG 504
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  342 GIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMGDFNPGSSNTGTFNTGNANTGFLNAGNINT 421
Cdd:COG3210    505 GDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGN 584
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  422 GVFNIGHMNNGLFNTGDMNNGVFYRGVGQGSLQFSITTPDLTLPPLQIPGISVPAFSLPAITLPSLTIPAATTPANITVG 501
Cdd:COG3210    585 STSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGV 664
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1119494255  502 AFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPANITVSG 554
Cdd:COG3210    665 NTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTG 717
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
319-357 3.50e-07

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 47.17  E-value: 3.50e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 319 GNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSG 357
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
329-367 6.49e-07

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 46.40  E-value: 6.49e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 329 GNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAG 367
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
379-417 9.23e-07

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 46.01  E-value: 9.23e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 379 ANTGSLNTGSYNMGDFNPGSSNTGTFNTGNANTGFLNAG 417
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
311-347 1.09e-06

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 45.63  E-value: 1.09e-06
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 1119494255 311 IGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFGIGNSG 347
Cdd:pfam01469   3 TGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
359-397 2.19e-06

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 44.86  E-value: 2.19e-06
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 359 GNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMGDFNPG 397
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
339-377 2.39e-06

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 44.86  E-value: 2.39e-06
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 339 GNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVG 377
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
258-295 3.34e-06

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 44.47  E-value: 3.34e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 258 NIGFGNLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTG 295
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
390-427 7.03e-06

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 43.31  E-value: 7.03e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 390 NMGDFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNIG 427
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
262-300 7.32e-06

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 43.31  E-value: 7.32e-06
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 262 GNLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIG 300
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
400-437 1.86e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 42.16  E-value: 1.86e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 400 NTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTG 437
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
268-305 2.89e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 41.77  E-value: 2.89e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 268 NVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTG 305
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
143-383 3.26e-05

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 47.63  E-value: 3.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 143 MWAADVSAMSAYHAGASAIASALSPFSKPLQNLAGLPAWLASGAPAAAMTAAAGIPALAGGPTAINLGIANVGGGNVGNA 222
Cdd:COG3468   192 GSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTG 271
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 223 NNGLANIGNANLGNYNFGSGNFGNSNIGSASLGNNNIGFGNLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIGIG 302
Cdd:COG3468   272 SGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGG 351
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 303 LTGNNQIGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTG 382
Cdd:COG3468   352 GTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGTGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTG 431

                  .
gi 1119494255 383 S 383
Cdd:COG3468   432 N 432
PTZ00395 PTZ00395
Sec24-related protein; Provisional
321-436 3.33e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 47.76  E-value: 3.33e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  321 FGLFNSGSGNVGffNSGNGNFGIGN---SGNFNT------GGWNSGHGNTGFFNAGSFNtgmlDVGNANTGSLNTGSYNM 391
Cdd:PTZ00395   339 YGGFHDGSPNAA--SAGAPFNGLGNqadGGHINQvhpdarGAWAGGPHSNASYNCAAYS----NAAQSNAAQSNAGFSNA 412
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1119494255  392 GDFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNT 436
Cdd:PTZ00395   413 GYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNL 457
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
230-799 3.82e-05

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 47.45  E-value: 3.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  230 GNANLGNYNFGSGNFGNSNIGSASlgnnniGFGNLGSNNVGV-GNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQ 308
Cdd:COG3210    779 GNTSAGATLDNAGAEISIDITADG------TITAAGTTAINVtGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGT 852
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  309 IGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGS 388
Cdd:COG3210    853 TSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAA 932
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  389 YNMGDFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTGDMNNGVFYRGVGQGSLQFSITTPDLTLPPLQ 468
Cdd:COG3210    933 AGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGS 1012
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  469 IPGISVPAFSLPAITLPSLTIPAATTPANITVGAFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPA 548
Cdd:COG3210   1013 GAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTH 1092
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  549 NITVSGFQLPPLSIPSVAIPPVTVPPITVGAFNLPPLQIPEVTIPQLTIPAGITIGGFSLPAIHTQPITVGQIGVGQFGL 628
Cdd:COG3210   1093 TLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAA 1172
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  629 PSIGWDVFLSTPRITVPAFGIPFTLQFQTNVPALQPPGGGLSTFTNGALIFGEFDLPQLVVHPYTLTGPIVIGSFFLPAF 708
Cdd:COG3210   1173 TTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASG 1252
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  709 NIPGIDVPAINVDGFTLPQITTPAITTPEFAIPPIGVGGFTLPQITTQEIITPELTINSIGVGGFTLPQITTPPITTPPL 788
Cdd:COG3210   1253 TGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVN 1332
                          570
                   ....*....|.
gi 1119494255  789 TIDPINLTGFT 799
Cdd:COG3210   1333 SGGVNAGGGTI 1343
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
248-285 4.59e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 41.00  E-value: 4.59e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 248 NIGSASLGNNNIGFGNLGSNNVGVGNLGNLNTGFANTG 285
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
227-265 5.75e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 41.00  E-value: 5.75e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 227 ANIGNANLGNYNFGSGNFGNSNIGSASLGNNNIGFGNLG 265
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
238-275 9.04e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 40.23  E-value: 9.04e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 238 NFGSGNFGNSNIGSASLGNNNIGFGNLGSNNVGVGNLG 275
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
PHA03247 PHA03247
large tegument protein UL36; Provisional
458-630 1.42e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 1.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  458 TTPDLTLPPLQIPGISVPAFSLPAITLPSlTIPAATTPANITVGAFSLPGLTLPSLNIPAATTPA--------NITVGAF 529
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSESRESLPSPWDPA-DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPpppgppppSLPLGGS 2855
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  530 SLPGLTL----PSLNIPAA-TTPANITVS------------GFQLPPLSI-----PSVAIPPVTVPPITVGAFNLPPLQI 587
Cdd:PHA03247  2856 VAPGGDVrrrpPSRSPAAKpAAPARPPVRrlarpavsrsteSFALPPDQPerppqPQAPPPPQPQPQPPPPPQPQPPPPP 2935
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1119494255  588 PEVTIPQL---TIPAGITIGGFSLPAIHTQPITVGQIGVGQFGLPS 630
Cdd:PHA03247  2936 PPRPQPPLaptTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQ 2981
PTZ00395 PTZ00395
Sec24-related protein; Provisional
342-430 2.13e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 45.07  E-value: 2.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  342 GIGNSGNFNTGGW-NSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMGDFNPGSSNTGTFNTGNANTGFLNAGNIN 420
Cdd:PTZ00395   382 GPHSNASYNCAAYsNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSN 461
                           90
                   ....*....|
gi 1119494255  421 TGVFNIGHMN 430
Cdd:PTZ00395   462 TPYSNAPLSN 471
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
350-387 2.51e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.08  E-value: 2.51e-04
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1119494255 350 NTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTG 387
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
272-310 2.83e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.08  E-value: 2.83e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 272 GNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIG 310
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
412-633 3.13e-04

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 44.29  E-value: 3.13e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 412 GFLNAGNINTGVFNIGHMNngLFNTGdmnnGVFYRgVGQgslqfSITTPDLTLPPLQIPGISvpafslPAITLPSLTIPA 491
Cdd:TIGR01645 249 GFIEYNNLQSQSEAIASMN--LFDLG----GQYLR-VGK-----CVTPPDALLQPATVSAIP------AAAAVAAAAATA 310
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 492 ATTPANITVGAfSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIP-AATTPANITVSGFQLPPLSIPSVAIPPV 570
Cdd:TIGR01645 311 KIMAAEAVAGA-AVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPlPQAAPAVVKPGPMEIPTPVPPPGLAIPS 389
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1119494255 571 TVPPitvgafnlpplqiPEVTIPQLTIPagitiggfSLPAIHTQPITVGQIGVGQFGL-PSIGW 633
Cdd:TIGR01645 390 LVAP-------------PGLVAPTEINP--------SFLASPRKKMKREKLPVTFGALdDTLAW 432
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
232-270 3.62e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 38.69  E-value: 3.62e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 232 ANLGNYNFGSGNFGNSNIGSASLGNNNIGFGNLGSNNVG 270
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
PHA03247 PHA03247
large tegument protein UL36; Provisional
457-665 1.16e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  457 ITTPDLTLPPLQIPGISVPAfslpaitlPSLTIPAATTPAnitvgafslPGLTLPSLNIPAATTPAnitvGAFSLPGLTL 536
Cdd:PHA03247  2811 VLAPAAALPPAASPAGPLPP--------PTSAQPTAPPPP---------PGPPPPSLPLGGSVAPG----GDVRRRPPSR 2869
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  537 PSLNIPAAttPANITVSGFQLPPLSIP--SVAIPPVTVPPITVGAFNLPPLQIPEVTIPQLTIPAGITIGgfsLPAIHTQ 614
Cdd:PHA03247  2870 SPAAKPAA--PARPPVRRLARPAVSRSteSFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP---RPQPPLA 2944
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1119494255  615 PITVGQIGVGQFGLPSIGWDVFLSTPRITVPAFGIPftlQFQTNVPALQPP 665
Cdd:PHA03247  2945 PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVP---QPAPSREAPASS 2992
PTZ00395 PTZ00395
Sec24-related protein; Provisional
325-400 1.73e-03

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 41.98  E-value: 1.73e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1119494255  325 NSGSGNVGFFNSGNGNFGIGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMGDFNPGSSN 400
Cdd:PTZ00395   396 NAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN 471
34 PHA02584
long tail fiber, proximal subunit; Provisional
250-422 2.13e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 41.66  E-value: 2.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  250 GSASLGNNNIGFGNLGSNnvGVGNLGNLNTGFANTGLGNfgfGNTGNNNIGigltgnNQIGIGGLNSGTGNFGLFNSGSG 329
Cdd:PHA02584   909 GSLTFTKNTNLSAPLVSS--STATFGGSVTANSTLTTQN---TSNGTVVVV------DETSIAFYSQNNTTGNIVFNIDG 977
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  330 NVGFFNSgNGNFGIGNSGNFNTGGWNSGHG----NTGFFNAGSFNTGMLDVGNANTGSLNTGSYNmGDFNPGSSNTGTFN 405
Cdd:PHA02584   978 TVDPINV-NANGTLNATGVATNGRAVYAEGggiaRTNNAARAITGGFTIRNDGSTTVFLLTAAGD-QTGGFNGLKSLIIN 1055
                          170
                   ....*....|....*..
gi 1119494255  406 TGNANTGFLNAGNINTG 422
Cdd:PHA02584  1056 NANGQVTINDNYIINAG 1072
COG4935 COG4935
Regulatory P domain of the subtilisin-like proprotein convertases and other proteases ...
23-573 2.46e-03

Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443962 [Multi-domain]  Cd Length: 641  Bit Score: 41.35  E-value: 2.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  23 PMAAAATAWDGLAMELASAAASFGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAAVQAEQTAAQAAAMIAEFEAVKT 102
Cdd:COG4935     2 AAGGAGSTTGLAAAVLAAAAGTGSAATAEGGAASTATSAAVAGASAAAAAATAVGAGASSLAASAAAAAAAASGAAAGAV 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 103 AVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYHAGASAIASALSPFSKPLQNLAGLPAWL 182
Cdd:COG4935    82 DAAPAAATVVGAALGVVAVAGAGLAATASGAAAGAVAAAANGNTGAGPGSGGTGGGSGGAGAAAAAAALSAAGAAVGVAA 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 183 ASGAPAAAMTAAAGIPALAGGPTAINLGIANVGGGNVGNANNGLANIGNANLGNYNFGSGNFGNSNIGSASLGNNNIGFG 262
Cdd:COG4935   162 VAGAAGGGGGVGVAAAVGVVLGAGLVADGGNGGGGAVAGGAAGGGGGGGGGGGLGGAAGGGGAGLAAAGGGGGGAAAAAA 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 263 NLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFG 342
Cdd:COG4935   242 AGVGGLGAAATAAAADGGGGGGAGAAGAGGSAGAAAGGAGAGVVGAAAGGGDAALGGAVGAAGTGNAAAAAAASAGSGGG 321
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 343 IGNSGNFNTGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMGDFNPGSSNTGTFNTGNANTGFLNAGNINTG 422
Cdd:COG4935   322 GGSAAAAGAAAAAAAAAAGAAAGVSGAASVVAGASGGGAGTAAAAGGGAAAAAAGGAAAAGAAAGAAAGAAAGAAAAGGV 401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 423 VFNIGHMNNGLFNTGDMNNGVFYRGVGQGSLQFSITTPDLTLPPLQIPGISVPAFSLPAITLPSLTIPAATTPANITVGA 502
Cdd:COG4935   402 ASAAGAVGAGTAAGASATAAVSTGAASGSSTTSSTGTTATATGLGGGADAGSTSTGTGSAAGAAGGTTTATSGLASSTTA 481
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1119494255 503 FSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPANITVSgfqlPPLSIPSVAIPPVTVP 573
Cdd:COG4935   482 AAAAAAAGLATTAAVAAGAAGAAAAAATAASVGGATGAAGTTNSTATFSNT----TDVAIPDNGPAGVTST 548
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
39-426 2.47e-03

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 41.68  E-value: 2.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255   39 ASAAASFGSVTSGLVGGAWQGASSSAMAAAAAPYAAWLAAAAVQAEQTAAQAAAMIAEFEAVKTAVVQPMLVAANRADLV 118
Cdd:COG3210    352 NGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGG 431
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  119 SLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYHAGASAIASALSPFSKPLQNLAGLPAWLASGAPAAAMTAAAGIP 198
Cdd:COG3210    432 VLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIA 511
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  199 ALAGGPTAINLGIANVGGGNVGNANNGLANIGNANLGNYNFGSGNFGNSNIGSASLGNNNIGFGNLGSNNVGVGNLGNLN 278
Cdd:COG3210    512 TGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGG 591
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255  279 TGFANTGLGNFGFGNTGNNNIGI---GLTGNNQIGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFNTGGWN 355
Cdd:COG3210    592 TGTNSGGTVLSIGTGSAGATGTItlgAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTG 671
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1119494255  356 SGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMGDFNPGSSNTGTFNTGNANTGFLNAGNINTGVFNI 426
Cdd:COG3210    672 GGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLT 742
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
111-421 5.54e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 40.53  E-value: 5.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 111 AANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSAMSAYHAGASAIASALSPFSKPLQNLAGLPAWLASGAPAAA 190
Cdd:COG4625   216 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGG 295
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 191 MTAAAGIPALAGGPTAINLGIANVGGGNVGNANNGLANIGNANLGNYNFGSGNFGNSNIGSASLGNNNIGFGNLGSNNVG 270
Cdd:COG4625   296 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGG 375
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1119494255 271 VGNLGNLNTGFANTGLGNFGFGNTGNNNIGIGLTGNNQIGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFN 350
Cdd:COG4625   376 GSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGG 455
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1119494255 351 TGGWNSGHGNTGFFNAGsfnTGMLDVGNANTGSLNTGSYNMGDFNPGSSNTGTFNTGNANTGFLNAGNINT 421
Cdd:COG4625   456 AGGSGGGAGAGGGSGSG---AGTLTLTGNNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTAT 523
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
409-447 5.81e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 35.23  E-value: 5.81e-03
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1119494255 409 ANTGFLNAGNINTGVFNIGHMNNGLFNTGDMNNGVFYRG 447
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
278-327 6.67e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 34.84  E-value: 6.67e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1119494255 278 NTGFANTGLGNFGFGNTGNNNIGIGltgnnqigigglNSGTGNFGLFNSG 327
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSF------------NLGSGNTGNGNTG 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH