NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|18390636|ref|NP_563765|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 13599426)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
169-217 2.52e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.37  E-value: 2.52e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18390636   169 PDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYGAVINSLC 217
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_long super family cl38513
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
162-285 3.60e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


The actual alignment was detected with superfamily member pfam17177:

Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 38.15  E-value: 3.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18390636   162 MKLIGYSPDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYGAVINSLCLARkttDVVKI--VKEMVSKAGIS 239
Cdd:pfam17177  81 MKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAG---DADKAyeVEEHMLAHGVE 157
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 18390636   240 PRKgmlTKVAAALRANREIWKAIEMIEFVES-RDY--PVEFESYEVVVE 285
Cdd:pfam17177 158 LEE---PELAALLKVSAKAGRADKVYAYLHRlRDAvrQVSESTAGVLEE 203
 
Name Accession Description Interval E-value
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
169-217 2.52e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.37  E-value: 2.52e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18390636   169 PDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYGAVINSLC 217
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
173-206 1.38e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.29  E-value: 1.38e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18390636   173 TCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDV 206
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PLN03218 PLN03218
maturation of RBCL 1; Provisional
161-244 1.61e-03

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 40.25  E-value: 1.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18390636   161 EMKLIGYSPDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYgAVINSLCLARKTTDVVKIVKEMVSKAGISP 240
Cdd:PLN03218  709 DIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITY-SILLVASERKDDADVGLDLLSQAKEDGIKP 787

                  ....
gi 18390636   241 RKGM 244
Cdd:PLN03218  788 NLVM 791
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
162-285 3.60e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 38.15  E-value: 3.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18390636   162 MKLIGYSPDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYGAVINSLCLARkttDVVKI--VKEMVSKAGIS 239
Cdd:pfam17177  81 MKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAG---DADKAyeVEEHMLAHGVE 157
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 18390636   240 PRKgmlTKVAAALRANREIWKAIEMIEFVES-RDY--PVEFESYEVVVE 285
Cdd:pfam17177 158 LEE---PELAALLKVSAKAGRADKVYAYLHRlRDAvrQVSESTAGVLEE 203
 
Name Accession Description Interval E-value
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
169-217 2.52e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.37  E-value: 2.52e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18390636   169 PDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYGAVINSLC 217
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
173-206 1.38e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.29  E-value: 1.38e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18390636   173 TCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDV 206
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
166-198 1.09e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.87  E-value: 1.09e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 18390636   166 GYSPDTGTCNYLVSSLCAVDKLDEAIKVVEEMS 198
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
158-214 2.66e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.49  E-value: 2.66e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 18390636   158 IINEMKLIGYSPDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYGAVIN 214
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
201-233 1.39e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.78  E-value: 1.39e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 18390636   201 GCIPDVESYGAVINSLCLARKTTDVVKIVKEMV 233
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PLN03218 PLN03218
maturation of RBCL 1; Provisional
161-244 1.61e-03

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 40.25  E-value: 1.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18390636   161 EMKLIGYSPDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYgAVINSLCLARKTTDVVKIVKEMVSKAGISP 240
Cdd:PLN03218  709 DIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITY-SILLVASERKDDADVGLDLLSQAKEDGIKP 787

                  ....
gi 18390636   241 RKGM 244
Cdd:PLN03218  788 NLVM 791
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
162-285 3.60e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 38.15  E-value: 3.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18390636   162 MKLIGYSPDTGTCNYLVSSLCAVDKLDEAIKVVEEMSAAGCIPDVESYGAVINSLCLARkttDVVKI--VKEMVSKAGIS 239
Cdd:pfam17177  81 MKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAG---DADKAyeVEEHMLAHGVE 157
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 18390636   240 PRKgmlTKVAAALRANREIWKAIEMIEFVES-RDY--PVEFESYEVVVE 285
Cdd:pfam17177 158 LEE---PELAALLKVSAKAGRADKVYAYLHRlRDAvrQVSESTAGVLEE 203
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
185-283 4.21e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 38.15  E-value: 4.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18390636   185 DKLDEAIKVVEEMSAAGCIPDVESYGAVINsLCLARKTTDVV-KIVKEMVsKAGISPRKGMLTKVAAALRANREIWKAIE 263
Cdd:pfam17177  69 LAADRGFEVFEAMKAQGVSPNEATYTAVAR-LAAAKGDGDLAfDLVKEME-AAGVSPRLRSYSPALHAYCEAGDADKAYE 146
                          90       100
                  ....*....|....*....|
gi 18390636   264 MIEFVESRDypVEFESYEVV 283
Cdd:pfam17177 147 VEEHMLAHG--VELEEPELA 164
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH