NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1701408716|ref|WP_141961105|]
View 

PPE family protein, partial [Mycobacterium tuberculosis]

Protein Classification

PPE family protein( domain architecture ID 10467123)

proline-proline-glutamate (PPE) family protein containing pentapeptide repeats, similar to various Mycobacterium tuberculosis PPE virulence/immunogenicity factors

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-157 3.24e-38

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


:

Pssm-ID: 425887  Cd Length: 158  Bit Score: 131.16  E-value: 3.24e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716   5 VMPPEINSLLIYTGAGPGPLLAAAAAWDELAAELGSAAAAFGSVTSGLVGGIWQGPSSVAMAAAAAPYAGWLSAAAASAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1701408716  85 SAAGQARAVVGVFEAALAATVDPFVIAANRSRLVSLALSNLFGQNTPAIAAAEFDYELMWAQDVAAMLGYHTG 157
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAA 153
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
187-216 1.45e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


:

Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 35.61  E-value: 1.45e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1701408716 187 VNLGLANVGLFNAGSGNVGSYNVGAGNVGS 216
Cdd:pfam01469   6 GNSGSGNTGFFNLGSGNTGSFNLGSGNTGN 35
 
Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-157 3.24e-38

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 131.16  E-value: 3.24e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716   5 VMPPEINSLLIYTGAGPGPLLAAAAAWDELAAELGSAAAAFGSVTSGLVGGIWQGPSSVAMAAAAAPYAGWLSAAAASAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1701408716  85 SAAGQARAVVGVFEAALAATVDPFVIAANRSRLVSLALSNLFGQNTPAIAAAEFDYELMWAQDVAAMLGYHTG 157
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAA 153
PPE COG5651
PPE-repeat protein [Function unknown];
1-256 1.63e-29

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 114.22  E-value: 1.63e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716   1 MSFVVMPPEINSLLIYTGAGPGPLLAAAAAWDELAAELGSAAAAFGSVTSGLVGGIWQGPSSVAMAAAAAPYAGWLSAAA 80
Cdd:COG5651     1 MDFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716  81 ASAESAAGQARAVVGVFEAALAATVDPFVIAANRSRLVSLALSNLFGQNTPAIAAAEFDYELMWAQDVAAMLGYhTGASA 160
Cdd:COG5651    81 AQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGY-AAASA 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716 161 AAEALAPF-------GSPLASLAAAAEPAKSLAVNLGLANVGLFNAGSGNVGSYNVGAGNVGSYNVGGGNIGGNNVGLGN 233
Cdd:COG5651   160 AAVALTPFtqppptiTNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAG 239
                         250       260
                  ....*....|....*....|...
gi 1701408716 234 VGWGNFGLGNSGLTPGLMGLGNI 256
Cdd:COG5651   240 AAAAAAAAAAAAGAGASAALASL 262
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
187-216 1.45e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 35.61  E-value: 1.45e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1701408716 187 VNLGLANVGLFNAGSGNVGSYNVGAGNVGS 216
Cdd:pfam01469   6 GNSGSGNTGFFNLGSGNTGSFNLGSGNTGN 35
 
Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-157 3.24e-38

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 131.16  E-value: 3.24e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716   5 VMPPEINSLLIYTGAGPGPLLAAAAAWDELAAELGSAAAAFGSVTSGLVGGIWQGPSSVAMAAAAAPYAGWLSAAAASAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1701408716  85 SAAGQARAVVGVFEAALAATVDPFVIAANRSRLVSLALSNLFGQNTPAIAAAEFDYELMWAQDVAAMLGYHTG 157
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAA 153
PPE COG5651
PPE-repeat protein [Function unknown];
1-256 1.63e-29

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 114.22  E-value: 1.63e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716   1 MSFVVMPPEINSLLIYTGAGPGPLLAAAAAWDELAAELGSAAAAFGSVTSGLVGGIWQGPSSVAMAAAAAPYAGWLSAAA 80
Cdd:COG5651     1 MDFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716  81 ASAESAAGQARAVVGVFEAALAATVDPFVIAANRSRLVSLALSNLFGQNTPAIAAAEFDYELMWAQDVAAMLGYhTGASA 160
Cdd:COG5651    81 AQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGY-AAASA 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1701408716 161 AAEALAPF-------GSPLASLAAAAEPAKSLAVNLGLANVGLFNAGSGNVGSYNVGAGNVGSYNVGGGNIGGNNVGLGN 233
Cdd:COG5651   160 AAVALTPFtqppptiTNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAG 239
                         250       260
                  ....*....|....*....|...
gi 1701408716 234 VGWGNFGLGNSGLTPGLMGLGNI 256
Cdd:COG5651   240 AAAAAAAAAAAAGAGASAALASL 262
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
187-216 1.45e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 35.61  E-value: 1.45e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1701408716 187 VNLGLANVGLFNAGSGNVGSYNVGAGNVGS 216
Cdd:pfam01469   6 GNSGSGNTGFFNLGSGNTGSFNLGSGNTGN 35
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH