NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15231358|ref|NP_187990|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1000225)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

Gene Ontology:  GO:0003723|GO:0009451
PubMed:  24471833|19004664

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03081 super family cl33631
pentatricopeptide (PPR) repeat-containing protein; Provisional
27-628 4.64e-169

pentatricopeptide (PPR) repeat-containing protein; Provisional


The actual alignment was detected with superfamily member PLN03081:

Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 498.63  E-value: 4.64e-169
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   27 ISQLCSNGRLQEA--LLEMAMLGP--EMGFHGYDALLNACLDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDC 102
Cdd:PLN03081  94 IEKLVACGRHREAleLFEILEAGCpfTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGM 173
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  103 LEDARKVLDEMPEKNVVSWTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLtsciRAS-GLGL---GKQIHGL 178
Cdd:PLN03081 174 LIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVML----RASaGLGSaraGQQLHCC 249
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  179 IVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPNYVTYA 258
Cdd:PLN03081 250 VLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFS 329
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  259 SLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGRE 338
Cdd:PLN03081 330 IMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTK 409
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  339 VLELFRLMRDEKrVKPDAVTLLAVLSGCSHGRMEDTGLNIFDGMvAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKR 418
Cdd:PLN03081 410 AVEMFERMIAEG-VAPNHVTFLAVLSACRYSGLSEQGWEIFQSM-SENHRIKPRAMHYACMIELLGREGLLDEAYAMIRR 487
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  419 MPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNNVRAMMMQKAVTKEPGRSW 498
Cdd:PLN03081 488 APFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTW 567
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  499 IQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIP 578
Cdd:PLN03081 568 IEVKKQDHSFFSGDRLHPQSREIYQKLDELMKEISEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLINTSEWTP 647
                        570       580       590       600       610
                 ....*....|....*....|....*....|....*....|....*....|
gi 15231358  579 IRVFKNLRICVDCHNFAKIFSKVFEREVSLRDKNRFHQIVDGICSCGDYW 628
Cdd:PLN03081 648 LQITQSHRICKDCHKVIKFIALVTKREIVVRDASRFHHFKLGKCSCGDYW 697
 
Name Accession Description Interval E-value
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
27-628 4.64e-169

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 498.63  E-value: 4.64e-169
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   27 ISQLCSNGRLQEA--LLEMAMLGP--EMGFHGYDALLNACLDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDC 102
Cdd:PLN03081  94 IEKLVACGRHREAleLFEILEAGCpfTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGM 173
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  103 LEDARKVLDEMPEKNVVSWTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLtsciRAS-GLGL---GKQIHGL 178
Cdd:PLN03081 174 LIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVML----RASaGLGSaraGQQLHCC 249
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  179 IVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPNYVTYA 258
Cdd:PLN03081 250 VLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFS 329
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  259 SLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGRE 338
Cdd:PLN03081 330 IMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTK 409
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  339 VLELFRLMRDEKrVKPDAVTLLAVLSGCSHGRMEDTGLNIFDGMvAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKR 418
Cdd:PLN03081 410 AVEMFERMIAEG-VAPNHVTFLAVLSACRYSGLSEQGWEIFQSM-SENHRIKPRAMHYACMIELLGREGLLDEAYAMIRR 487
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  419 MPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNNVRAMMMQKAVTKEPGRSW 498
Cdd:PLN03081 488 APFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTW 567
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  499 IQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIP 578
Cdd:PLN03081 568 IEVKKQDHSFFSGDRLHPQSREIYQKLDELMKEISEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLINTSEWTP 647
                        570       580       590       600       610
                 ....*....|....*....|....*....|....*....|....*....|
gi 15231358  579 IRVFKNLRICVDCHNFAKIFSKVFEREVSLRDKNRFHQIVDGICSCGDYW 628
Cdd:PLN03081 648 LQITQSHRICKDCHKVIKFIALVTKREIVVRDASRFHHFKLGKCSCGDYW 697
DYW_deaminase pfam14432
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ...
536-628 2.11e-41

DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.


Pssm-ID: 464172 [Multi-domain]  Cd Length: 93  Bit Score: 144.87  E-value: 2.11e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   536 GYVPDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIFSKVFEREVSLRDKNRFH 615
Cdd:pfam14432   1 GYVPDLRFVLHDVDDEEKKQLLCGHSEKLALAYGLLTTPDGATIRITKNLRVCGDCHTAFKYISKIRGREIVVRDASRFH 80
                          90
                  ....*....|...
gi 15231358   616 QIVDGICSCGDYW 628
Cdd:pfam14432  81 HFKNGLCSCGDFW 93
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
220-253 3.02e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 3.02e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15231358   220 VSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPN 253
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
 
Name Accession Description Interval E-value
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
27-628 4.64e-169

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 498.63  E-value: 4.64e-169
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   27 ISQLCSNGRLQEA--LLEMAMLGP--EMGFHGYDALLNACLDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDC 102
Cdd:PLN03081  94 IEKLVACGRHREAleLFEILEAGCpfTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGM 173
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  103 LEDARKVLDEMPEKNVVSWTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLtsciRAS-GLGL---GKQIHGL 178
Cdd:PLN03081 174 LIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVML----RASaGLGSaraGQQLHCC 249
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  179 IVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPNYVTYA 258
Cdd:PLN03081 250 VLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFS 329
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  259 SLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGRE 338
Cdd:PLN03081 330 IMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTK 409
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  339 VLELFRLMRDEKrVKPDAVTLLAVLSGCSHGRMEDTGLNIFDGMvAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKR 418
Cdd:PLN03081 410 AVEMFERMIAEG-VAPNHVTFLAVLSACRYSGLSEQGWEIFQSM-SENHRIKPRAMHYACMIELLGREGLLDEAYAMIRR 487
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  419 MPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNNVRAMMMQKAVTKEPGRSW 498
Cdd:PLN03081 488 APFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTW 567
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  499 IQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIP 578
Cdd:PLN03081 568 IEVKKQDHSFFSGDRLHPQSREIYQKLDELMKEISEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLINTSEWTP 647
                        570       580       590       600       610
                 ....*....|....*....|....*....|....*....|....*....|
gi 15231358  579 IRVFKNLRICVDCHNFAKIFSKVFEREVSLRDKNRFHQIVDGICSCGDYW 628
Cdd:PLN03081 648 LQITQSHRICKDCHKVIKFIALVTKREIVVRDASRFHHFKLGKCSCGDYW 697
PLN03077 PLN03077
Protein ECB2; Provisional
69-626 1.11e-149

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 453.92  E-value: 1.11e-149
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   69 RDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSWTAMISRYSQTGHSSEALTVFAEMMRSD 148
Cdd:PLN03077 305 RLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDN 384
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  149 GKPNEFTFATVLTSCIRASGLGLGKQIHGLIVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAG 228
Cdd:PLN03077 385 VSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAG 464
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  229 YAQLGLDEEALEMFHRLHSEgMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLIDMYSKCGNLSYA 308
Cdd:PLN03077 465 LRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYA 543
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  309 RRLFdNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRdEKRVKPDAVTLLAVLSGCSHGRMEDTGLNIFDGMvAGEYG 388
Cdd:PLN03077 544 WNQF-NSHEKDVVSWNILLTGYVAHGKGSMAVELFNRMV-ESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSM-EEKYS 620
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  389 TKPGTEHYGCIVDMLGRAGRIDEAFEFIKRMPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNL 468
Cdd:PLN03077 621 ITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNL 700
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358  469 YASAGRWADVNNVRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGYVPDLSCVLyDV 548
Cdd:PLN03077 701 YADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDDESHPQIKEINTVLEGFYEKMKASGLAGSESSSM-DE 779
                        490       500       510       520       530       540       550
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 15231358  549 DEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIFSKVFEREVSLRDKNRFHQIVDGICSCGD 626
Cdd:PLN03077 780 IEVSKDDIFCGHSERLAIAFGLINTVPGMPIWVTKNLYMCENCHNTVKFISKIVRREISVRDTEQFHHFKDGECSCGD 857
DYW_deaminase pfam14432
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ...
536-628 2.11e-41

DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.


Pssm-ID: 464172 [Multi-domain]  Cd Length: 93  Bit Score: 144.87  E-value: 2.11e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   536 GYVPDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKIFSKVFEREVSLRDKNRFH 615
Cdd:pfam14432   1 GYVPDLRFVLHDVDDEEKKQLLCGHSEKLALAYGLLTTPDGATIRITKNLRVCGDCHTAFKYISKIRGREIVVRDASRFH 80
                          90
                  ....*....|...
gi 15231358   616 QIVDGICSCGDYW 628
Cdd:pfam14432  81 HFKNGLCSCGDFW 93
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
438-500 1.25e-17

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 77.20  E-value: 1.25e-17
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 15231358   438 HLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWADVNNVRAMMMQKAVTKEPGRSWIQ 500
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PLN03218 PLN03218
maturation of RBCL 1; Provisional
194-435 1.27e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 67.98  E-value: 1.27e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   194 LLDMYAKAGQIKEAREIFECLPER----DVVSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPNYVTYASLLTALSGLAL 269
Cdd:PLN03218  443 LMSVCASSQDIDGALRVLRLVQEAglkaDCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQ 522
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   270 LDHGKQAHCHVLRRELPFYAVLQNSLIdmySKCGNLSYARRLFDNMPERTA---------ISWNAMLVGYSKHGLGREVL 340
Cdd:PLN03218  523 VAKAFGAYGIMRSKNVKPDRVVFNALI---SACGQSGAVDRAFDVLAEMKAethpidpdhITVGALMKACANAGQVDRAK 599
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   341 ELFRLMRdEKRVK--PDAVTllAVLSGCSHGRMEDTGLNIFDGMVagEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIKR 418
Cdd:PLN03218  600 EVYQMIH-EYNIKgtPEVYT--IAVNSCSQKGDWDFALSIYDDMK--KKGVKPDEVFFSALVDVAGHAGDLDKAFEILQD 674
                         250       260
                  ....*....|....*....|
gi 15231358   419 MPS---KPTAGVLGSLLGAC 435
Cdd:PLN03218  675 ARKqgiKLGTVSYSSLMGAC 694
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
116-163 2.61e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.44  E-value: 2.61e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15231358   116 KNVVSWTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSC 163
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PLN03218 PLN03218
maturation of RBCL 1; Provisional
5-434 5.92e-07

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 52.96  E-value: 5.92e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358     5 MRLIHRSFSSSPTNYVLQTILPISQLCSNGRLQEA--LLE----MAMLGPEMGFHGydALLNACLDKRALRDGQRVhAHM 78
Cdd:PLN03218  355 NSLAAYNGGVSGKRKSPEYIDAYNRLLRDGRIKDCidLLEdmekRGLLDMDKIYHA--KFFKACKKQRAVKEAFRF-AKL 431
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358    79 IKTRYLpATYlrTRLLIFYGKCDCLEDARKVLDEMPEK----NVVSWTAMISRYSQTGHSSEALTVFAEMMRSDGKPNEF 154
Cdd:PLN03218  432 IRNPTL-STF--NMLMSVCASSQDIDGALRVLRLVQEAglkaDCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVH 508
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   155 TFATVLTSCirasglglgkqihglivkwnydshifvgsslldmyAKAGQIKEAREIFECLPERDV----VSCTAIIAGYA 230
Cdd:PLN03218  509 TFGALIDGC-----------------------------------ARAGQVAKAFGAYGIMRSKNVkpdrVVFNALISACG 553
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   231 QLGLDEEALEMFHRLHSEG--MSPNYVTYASLLTALSGLALLDHGKQA----HCHVLRRELPFYAVLQNSlidmYSKCGN 304
Cdd:PLN03218  554 QSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEVyqmiHEYNIKGTPEVYTIAVNS----CSQKGD 629
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15231358   305 LSYARRLFDNMpertaiswnamlvgyskhglgrevlelfrlmrDEKRVKPDAVTLLAVLSGCSHGRMEDTGLNIFDGMVA 384
Cdd:PLN03218  630 WDFALSIYDDM--------------------------------KKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARK 677
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|...
gi 15231358   385 GeyGTKPGTEHYGCIVDMLGRAGRIDEA---FEFIKRMPSKPTAGVLGSLLGA 434
Cdd:PLN03218  678 Q--GIKLGTVSYSSLMGACSNAKNWKKAlelYEDIKSIKLRPTVSTMNALITA 728
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
217-264 7.11e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.20  E-value: 7.11e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15231358   217 RDVVSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPNYVTYASLLTAL 264
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
318-368 3.96e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 44.28  E-value: 3.96e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 15231358   318 RTAISWNAMLVGYSKHGLGREVLELFRLMRdEKRVKPDAVTLLAVLSGCSH 368
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMK-KRGVKPNVYTYTILINGLCK 50
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
321-349 1.88e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.99  E-value: 1.88e-04
                          10        20
                  ....*....|....*....|....*....
gi 15231358   321 ISWNAMLVGYSKHGLGREVLELFRLMRDE 349
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
218-266 2.12e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.65  E-value: 2.12e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 15231358   218 DVVSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPNYVTYASLLTALSG 266
Cdd:pfam13812  14 NVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
220-253 3.02e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 3.02e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15231358   220 VSCTAIIAGYAQLGLDEEALEMFHRLHSEGMSPN 253
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
119-153 7.99e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.05  E-value: 7.99e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15231358   119 VSWTAMISRYSQTGHSSEALTVFAEMMRSDGKPNE 153
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
321-356 9.17e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.05  E-value: 9.17e-04
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 15231358   321 ISWNAMLVGYSKHGLGREVLELFRLMRdEKRVKPDA 356
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMK-ERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
119-147 1.34e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 36.29  E-value: 1.34e-03
                          10        20
                  ....*....|....*....|....*....
gi 15231358   119 VSWTAMISRYSQTGHSSEALTVFAEMMRS 147
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
220-250 3.41e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.52  E-value: 3.41e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15231358   220 VSCTAIIAGYAQLGLDEEALEMFHRLHSEGM 250
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
390-419 3.88e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.40  E-value: 3.88e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 15231358   390 KPGTEHYGCIVDMLGRAGRIDEAFEFIKRM 419
Cdd:pfam12854   4 KPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH