NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1063685727|ref|NP_001319064|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1003726)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner; similar to Arabidopsis pentatricopeptide repeat proteins that play constitutive, often essential roles in mitochondria and chloroplasts, probably via binding to organellar transcripts

Gene Ontology:  GO:0003723
PubMed:  23635770

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
252-654 1.29e-27

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 119.60  E-value: 1.29e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  252 GDLERVDKIWLeMKRRNIEFSEVTynilingfSKNGKMEEARRFHGDMRRSgfavTPYSFNPLIEGYCKQGLFDDAWGVT 331
Cdd:PLN03218   394 EDMEKRGLLDM-DKIYHAKFFKAC--------KKQRAVKEAFRFAKLIRNP----TLSTFNMLMSVCASSQDIDGALRVL 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  332 DEMLNAGIYPTTSTYNIYICALCDFGRIDDARELLSSMAA----PDVVSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHP 407
Cdd:PLN03218   461 RLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNagveANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKP 540
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  408 SIVTYNTLIDGLCESGNLEGAQRLKEEMT--TQLIFPDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTR 485
Cdd:PLN03218   541 DRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIA 620
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  486 AVGELRLGDSDKAFRLHEEMvATDHHAPDLTIYNVRIDGLCKVGNLVKAIEFQRKIFRVGLVPDHVTYTTVIRGYLENGQ 565
Cdd:PLN03218   621 VNSCSQKGDWDFALSIYDDM-KKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKN 699
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  566 FKMARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYLCK 645
Cdd:PLN03218   700 WKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQ 779

                   ....*....
gi 1063685727  646 MEEEGIPPN 654
Cdd:PLN03218   780 AKEDGIKPN 788
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
201-250 1.52e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 45.43  E-value: 1.52e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063685727 201 PSVRNCNIVLKVLRDSRMMNKASAVYETMIEHGIMPTVITFNTMLDSCFK 250
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
653-700 6.39e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.51  E-value: 6.39e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063685727 653 PNKYSYTMLISKNCDFEKWEEVVKLYKEMLDKEIEPDGYTHRALFKHL 700
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
252-654 1.29e-27

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 119.60  E-value: 1.29e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  252 GDLERVDKIWLeMKRRNIEFSEVTynilingfSKNGKMEEARRFHGDMRRSgfavTPYSFNPLIEGYCKQGLFDDAWGVT 331
Cdd:PLN03218   394 EDMEKRGLLDM-DKIYHAKFFKAC--------KKQRAVKEAFRFAKLIRNP----TLSTFNMLMSVCASSQDIDGALRVL 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  332 DEMLNAGIYPTTSTYNIYICALCDFGRIDDARELLSSMAA----PDVVSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHP 407
Cdd:PLN03218   461 RLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNagveANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKP 540
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  408 SIVTYNTLIDGLCESGNLEGAQRLKEEMT--TQLIFPDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTR 485
Cdd:PLN03218   541 DRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIA 620
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  486 AVGELRLGDSDKAFRLHEEMvATDHHAPDLTIYNVRIDGLCKVGNLVKAIEFQRKIFRVGLVPDHVTYTTVIRGYLENGQ 565
Cdd:PLN03218   621 VNSCSQKGDWDFALSIYDDM-KKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKN 699
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  566 FKMARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYLCK 645
Cdd:PLN03218   700 WKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQ 779

                   ....*....
gi 1063685727  646 MEEEGIPPN 654
Cdd:PLN03218   780 AKEDGIKPN 788
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
372-420 2.88e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.39  E-value: 2.88e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063685727 372 PDVVSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHPSIVTYNTLIDGLC 420
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
439-716 1.09e-08

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 57.05  E-value: 1.09e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 439 LIFPDVITYTTLVKG--FVKNGNLSMATEVYDEMLRkgIKPDgYAYTTRAVGEL--RLGDSDKAFRLHEEMVATDHHAPD 514
Cdd:COG2956     1 LLLPVAAALGWYFKGlnYLLNGQPDKAIDLLEEALE--LDPE-TVEAHLALGNLyrRRGEYDRAIRIHQKLLERDPDRAE 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 515 LT-----IYNvridglcKVGNLVKAIEFQRKIfrVGLVPDHVT-YTTVIRGYLENGQFKMARNLYDEMLRKRLYPSVItY 588
Cdd:COG2956    78 ALlelaqDYL-------KAGLLDRAEELLEKL--LELDPDDAEaLRLLAEIYEQEGDWEKAIEVLERLLKLGPENAHA-Y 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 589 FVLIYGHAKAGRLEQAFQY---STEMKKRGVRPNVMThnALLYGmcKAGNIDEAYRYLCKMEEEGiPPNKYSYTMLISKN 665
Cdd:COG2956   148 CELAELYLEQGDYDEAIEAlekALKLDPDCARALLLL--AELYL--EQGDYEEAIAALERALEQD-PDYLPALPRLAELY 222
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1063685727 666 CDFEKWEEVVKLYKEMLdkEIEPDGYTHRALFKHLEKDHESRE-VEFLERLL 716
Cdd:COG2956   223 EKLGDPEEALELLRKAL--ELDPSDDLLLALADLLERKEGLEAaLALLERQL 272
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
445-478 3.66e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.68  E-value: 3.66e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063685727 445 ITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPD 478
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
201-250 1.52e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 45.43  E-value: 1.52e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063685727 201 PSVRNCNIVLKVLRDSRMMNKASAVYETMIEHGIMPTVITFNTMLDSCFK 250
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
653-700 6.39e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.51  E-value: 6.39e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063685727 653 PNKYSYTMLISKNCDFEKWEEVVKLYKEMLDKEIEPDGYTHRALFKHL 700
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
252-654 1.29e-27

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 119.60  E-value: 1.29e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  252 GDLERVDKIWLeMKRRNIEFSEVTynilingfSKNGKMEEARRFHGDMRRSgfavTPYSFNPLIEGYCKQGLFDDAWGVT 331
Cdd:PLN03218   394 EDMEKRGLLDM-DKIYHAKFFKAC--------KKQRAVKEAFRFAKLIRNP----TLSTFNMLMSVCASSQDIDGALRVL 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  332 DEMLNAGIYPTTSTYNIYICALCDFGRIDDARELLSSMAA----PDVVSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHP 407
Cdd:PLN03218   461 RLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNagveANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKP 540
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  408 SIVTYNTLIDGLCESGNLEGAQRLKEEMT--TQLIFPDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTR 485
Cdd:PLN03218   541 DRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIA 620
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  486 AVGELRLGDSDKAFRLHEEMvATDHHAPDLTIYNVRIDGLCKVGNLVKAIEFQRKIFRVGLVPDHVTYTTVIRGYLENGQ 565
Cdd:PLN03218   621 VNSCSQKGDWDFALSIYDDM-KKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKN 699
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  566 FKMARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYLCK 645
Cdd:PLN03218   700 WKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQ 779

                   ....*....
gi 1063685727  646 MEEEGIPPN 654
Cdd:PLN03218   780 AKEDGIKPN 788
PLN03218 PLN03218
maturation of RBCL 1; Provisional
357-715 1.74e-27

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 119.21  E-value: 1.74e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  357 GRIDDARELLSSMAAPDVVSYNTLMHGYI----KMGKFVEASLLFDDLRagdIHPSIVTYNTLIDGLCESGNLEGAQRLK 432
Cdd:PLN03218   384 GRIKDCIDLLEDMEKRGLLDMDKIYHAKFfkacKKQRAVKEAFRFAKLI---RNPTLSTFNMLMSVCASSQDIDGALRVL 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  433 EEMTTQLIFPDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTRAVGELRLGDSDKAFRLHEEMVATDHHa 512
Cdd:PLN03218   461 RLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVK- 539
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  513 PDLTIYNVRIDGLCKVGNLVKAIEF--QRKIFRVGLVPDHVTYTTVIRGYLENGQFKMARNLYdEMLRK---RLYPSVit 587
Cdd:PLN03218   540 PDRVVFNALISACGQSGAVDRAFDVlaEMKAETHPIDPDHITVGALMKACANAGQVDRAKEVY-QMIHEyniKGTPEV-- 616
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  588 YFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYLCKMEEEGIPPNKYSYTMLISKNCD 667
Cdd:PLN03218   617 YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSN 696
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1063685727  668 FEKWEEVVKLYKEMLDKEIEPDGYTHRALFKHL-EKDHESREVEFLERL 715
Cdd:PLN03218   697 AKNWKKALELYEDIKSIKLRPTVSTMNALITALcEGNQLPKALEVLSEM 745
PLN03218 PLN03218
maturation of RBCL 1; Provisional
236-632 6.07e-26

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 114.20  E-value: 6.07e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  236 PTVITFNTMLDSCFKAGDLERVDKIWLEMKRRNIEFSEVTYNILINGFSKNGK---MEEArrFHgDMRRSGFAVTPYSFN 312
Cdd:PLN03218   435 PTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKvdaMFEV--FH-EMVNAGVEANVHTFG 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  313 PLIEGYCKQGLFDDAWGVTDEMLNAGIYPTTSTYNIYICALCDFGRIDDARELLSSMAA------PDVVSYNTLMHGYIK 386
Cdd:PLN03218   512 ALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAethpidPDHITVGALMKACAN 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  387 MGKfVEasllfddlRAGDIHPSIVTYNtlIDGlcesgnlegaqrlkeemttqliFPDVitYTTLVKGFVKNGNLSMATEV 466
Cdd:PLN03218   592 AGQ-VD--------RAKEVYQMIHEYN--IKG----------------------TPEV--YTIAVNSCSQKGDWDFALSI 636
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  467 YDEMLRKGIKPD-----------GYAyttravgelrlGDSDKAFRLHEEMvATDHHAPDLTIYNVRIDGLCKVGNLVKAI 535
Cdd:PLN03218   637 YDDMKKKGVKPDevffsalvdvaGHA-----------GDLDKAFEILQDA-RKQGIKLGTVSYSSLMGACSNAKNWKKAL 704
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  536 EFQRKIFRVGLVPDHVTYTTVIRGYLENGQFKMARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRG 615
Cdd:PLN03218   705 ELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDG 784
                          410
                   ....*....|....*..
gi 1063685727  616 VRPNVMTHNALLyGMCK 632
Cdd:PLN03218   785 IKPNLVMCRCIT-GLCL 800
PLN03218 PLN03218
maturation of RBCL 1; Provisional
219-478 1.69e-24

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 109.58  E-value: 1.69e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  219 MNKASAVYETMIEHGIMPTVITFNTMLDSCFKAGDLERVDKIWLEMKRRN--IEFSEVTYNILINGFSKNGKMEEARRFH 296
Cdd:PLN03218   523 VAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEVY 602
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  297 GDMRRSGFAVTPYSFNPLIEGYCKQGLFDDAWGVTDEMLNAGIYPTtstyNIYICALCDF----GRIDDARELLSSMAA- 371
Cdd:PLN03218   603 QMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPD----EVFFSALVDVaghaGDLDKAFEILQDARKq 678
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  372 ---PDVVSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHPSIVTYNTLIDGLCESGNLEGAQRLKEEMTTQLIFPDVITYT 448
Cdd:PLN03218   679 gikLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYS 758
                          250       260       270
                   ....*....|....*....|....*....|
gi 1063685727  449 TLVKGFVKNGNLSMATEVYDEMLRKGIKPD 478
Cdd:PLN03218   759 ILLVASERKDDADVGLDLLSQAKEDGIKPN 788
PLN03218 PLN03218
maturation of RBCL 1; Provisional
267-689 2.21e-23

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 106.11  E-value: 2.21e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  267 RNIEFSEVTYNILIngfsKNGKMEEARRFHGDMRRSGF----AVTPYSFNPLiegyCK-QGLFDDAWGVTDEMLNagiyP 341
Cdd:PLN03218   368 RKSPEYIDAYNRLL----RDGRIKDCIDLLEDMEKRGLldmdKIYHAKFFKA----CKkQRAVKEAFRFAKLIRN----P 435
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  342 TTSTYNIYICALCDFGRIDDARELLssmaapdvvsyntlmhgyikmgKFVEASLLFDDlragdihpsIVTYNTLIDGLCE 421
Cdd:PLN03218   436 TLSTFNMLMSVCASSQDIDGALRVL----------------------RLVQEAGLKAD---------CKLYTTLISTCAK 484
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  422 SGNLEGAQRLKEEMTTQLIFPDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTRAVGELRLGDSDKAFRL 501
Cdd:PLN03218   485 SGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDV 564
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  502 HEEMVATDHHA-PDLTIYNVRIDGLCKVGNLVKAIEFQRKIFRVGLVPDHVTYTTVIRGYLENGQFKMARNLYDEMLRKR 580
Cdd:PLN03218   565 LAEMKAETHPIdPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKG 644
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  581 LYPSVITYFVLI--YGHakAGRLEQAFQYSTEMKKRGV-----------------------------------RPNVMTH 623
Cdd:PLN03218   645 VKPDEVFFSALVdvAGH--AGDLDKAFEILQDARKQGIklgtvsysslmgacsnaknwkkalelyediksiklRPTVSTM 722
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1063685727  624 NALLYGMCKAGNIDEAYRYLCKMEEEGIPPNKYSYTMLIsKNCDFEKWEEV-VKLYKEMLDKEIEPD 689
Cdd:PLN03218   723 NALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILL-VASERKDDADVgLDLLSQAKEDGIKPN 788
PLN03077 PLN03077
Protein ECB2; Provisional
277-696 3.47e-20

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 95.69  E-value: 3.47e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 277 NILINGFSKNGKMEEARRFHGDMRRSGFavtpYSFNPLIEGYCKQGLFDDAWGVTDEMLNAGIYPTTSTY---------- 346
Cdd:PLN03077  125 NAMLSMFVRFGELVHAWYVFGKMPERDL----FSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFpcvlrtcggi 200
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 347 -------------------------NIYICALCDFGRIDDARELLSSMAAPDVVSYNTLMHGYIKMGKFVEASLLFDDLR 401
Cdd:PLN03077  201 pdlargrevhahvvrfgfeldvdvvNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMR 280
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 402 AGDIHPSIVT-----------------------------------YNTLIDGLCESGNLEGAQRLKEEMTTQlifpDVIT 446
Cdd:PLN03077  281 ELSVDPDLMTitsvisacellgderlgremhgyvvktgfavdvsvCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVS 356
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 447 YTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTRAVGELRLGDSDKAFRLHEEMVATDHHAPdLTIYNVRIDGLC 526
Cdd:PLN03077  357 WTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISY-VVVANALIEMYS 435
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 527 KVGNLVKAIEfqrkIFRVGLVPDHVTYTTVIRGYLENGQFKMARNLYDEMLRKrLYPSVITYFVLIYGHAKAGRLEQAFQ 606
Cdd:PLN03077  436 KCKCIDKALE----VFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKE 510
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 607 YSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYLCKMEEEGIppnkySYTMLISKNCDFEKWEEVVKLYKEMLDKEI 686
Cdd:PLN03077  511 IHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFNSHEKDVV-----SWNILLTGYVAHGKGSMAVELFNRMVESGV 585
                         490
                  ....*....|
gi 1063685727 687 EPDGYTHRAL 696
Cdd:PLN03077  586 NPDEVTFISL 595
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
222-627 8.67e-18

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 87.62  E-value: 8.67e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 222 ASAVYETMIEHGIMPTVITFNTMLDSCFKAGDLERVDKIWLEMKRRNIefseVTYNILINGFSKNGKMEEARRFHGDMRR 301
Cdd:PLN03081  142 VKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNL----ASWGTIIGGLVDAGNYREAFALFREMWE 217
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 302 SGFAVTPYSFNPLIEGYCKQGLFDDAWGVTDEMLNAGIYPttstyNIYI-CALCDF----GRIDDARELLSSMAAPDVVS 376
Cdd:PLN03081  218 DGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVG-----DTFVsCALIDMyskcGDIEDARCVFDGMPEKTTVA 292
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 377 YNTLMHGYIKMGKFVEASLLFDDLRAGDIHPSIVTYNTLIDGLCESGNLEGAQRLKEEMTTQLIFPDVITYTTLVKGFVK 456
Cdd:PLN03081  293 WNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSK 372
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 457 NGNLSMATEVYDEMLRKGIKP-----DGYAYTtravgelrlGDSDKAFRLHEEMVATdhhapdltiynvridglckvgnl 531
Cdd:PLN03081  373 WGRMEDARNVFDRMPRKNLISwnaliAGYGNH---------GRGTKAVEMFERMIAE----------------------- 420
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 532 vkaiefqrkifrvGLVPDHVTYTTVIRGYLENGQFKMARNLYDEMLR-KRLYPSVITYFVLIYGHAKAGRLEQAFQYste 610
Cdd:PLN03081  421 -------------GVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSEnHRIKPRAMHYACMIELLGREGLLDEAYAM--- 484
                         410
                  ....*....|....*..
gi 1063685727 611 MKKRGVRPNVMTHNALL 627
Cdd:PLN03081  485 IRRAPFKPTVNMWAALL 501
PLN03218 PLN03218
maturation of RBCL 1; Provisional
221-508 9.60e-18

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 88.01  E-value: 9.60e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  221 KASAVYETMIEHGIMPTVITFNTMLDSCFKAGDLERVDKIWLEMKRRNIEFSEVTYNILINGFSKNGKMEEARRFHGDMR 300
Cdd:PLN03218   597 RAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDAR 676
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  301 RSGFAVTPYSFNPLIEGYCKQGLFDDAWGVTDEMLNAGIYPTTSTYNIYICALCDFGRIDDARELLSSMA----APDVVS 376
Cdd:PLN03218   677 KQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKrlglCPNTIT 756
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727  377 YNTLMHGYIKMGKFVEASLLFDDLRAGDIHPSIVTYNTLIdGLCesgnlegaqrlKEEMTTQLIFPDVITYTTLVKGFVK 456
Cdd:PLN03218   757 YSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLC-----------LRRFEKACALGEPVVSFDSGRPQIE 824
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1063685727  457 NGNLSMATEVYDEMLRKGIKPdgyayTTRAVGELrLG------DSDKAFRLHEEMVAT 508
Cdd:PLN03218   825 NKWTSWALMVYRETISAGTLP-----TMEVLSQV-LGclqlphDATLRNRLIENLGIS 876
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
334-662 1.15e-15

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 81.07  E-value: 1.15e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 334 MLNAGIYPTTSTYNIYICALCDFGRIDDARELLSSMAAPDVVSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHPSIVTYN 413
Cdd:PLN03081  149 VESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFV 228
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 414 TLIDGLCESGNLEGAQRLKEEMTTQLIFPDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIkpdgYAYTTRAVGELRLG 493
Cdd:PLN03081  229 VMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTT----VAWNSMLAGYALHG 304
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 494 DSDKAFRLHEEM----VATDHHApdltiYNVRIDGLCKVGNLVKAIEFQRKIFRVGLVPDHVTYTTVIRGYLENGQFKMA 569
Cdd:PLN03081  305 YSEEALCLYYEMrdsgVSIDQFT-----FSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDA 379
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 570 RNLYDEMLRKrlypSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYLCKM-EE 648
Cdd:PLN03081  380 RNVFDRMPRK----NLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMsEN 455
                         330
                  ....*....|....
gi 1063685727 649 EGIPPNKYSYTMLI 662
Cdd:PLN03081  456 HRIKPRAMHYACMI 469
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
372-420 2.88e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.39  E-value: 2.88e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063685727 372 PDVVSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHPSIVTYNTLIDGLC 420
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
236-285 1.65e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 65.08  E-value: 1.65e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063685727 236 PTVITFNTMLDSCFKAGDLERVDKIWLEMKRRNIEFSEVTYNILINGFSK 285
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
407-456 1.99e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 65.08  E-value: 1.99e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063685727 407 PSIVTYNTLIDGLCESGNLEGAQRLKEEMTTQLIFPDVITYTTLVKGFVK 456
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
548-597 1.61e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 62.38  E-value: 1.61e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063685727 548 PDHVTYTTVIRGYLENGQFKMARNLYDEMLRKRLYPSVITYFVLIYGHAK 597
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
342-692 1.86e-12

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 70.65  E-value: 1.86e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 342 TTSTYNIYICALCDFGRIDDARELLSSMA----APDVVSYNTLM------HGY---IKMGKFVEASLLFDDLRAGdihps 408
Cdd:PLN03077   50 STHDSNSQLRALCSHGQLEQALKLLESMQelrvPVDEDAYVALFrlcewkRAVeegSRVCSRALSSHPSLGVRLG----- 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 409 ivtyNTLIDGLCESGNLEGAQRLKEEMTTQlifpDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAY-----T 483
Cdd:PLN03077  125 ----NAMLSMFVRFGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFpcvlrT 196
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 484 TRAVGELRLGDsdkafRLHEEMVATDhHAPDLTIYNVRIDGLCKVGNLVKAiefqRKIFRVGLVPDHVTYTTVIRGYLEN 563
Cdd:PLN03077  197 CGGIPDLARGR-----EVHAHVVRFG-FELDVDVVNALITMYVKCGDVVSA----RLVFDRMPRRDCISWNAMISGYFEN 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 564 GQFKMARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYL 643
Cdd:PLN03077  267 GECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVF 346
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*....
gi 1063685727 644 CKMEEEGIppnkYSYTMLISKNCDFEKWEEVVKLYKEMLDKEIEPDGYT 692
Cdd:PLN03077  347 SRMETKDA----VSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEIT 391
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
365-668 6.47e-12

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 68.74  E-value: 6.47e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 365 LLSSMAAPDVVSYNTLMHGYIKMGKFVEASLLFDDLRAgdihPSIVTYNTLIDGLCESGNLEGAQRLKEEMTTQLIFPDV 444
Cdd:PLN03081  149 VESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPE----RNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEP 224
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 445 ITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTRAVGELRLGDSDKAFRLHEEMvatdhhaPDLTI--YNVRI 522
Cdd:PLN03081  225 RTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGM-------PEKTTvaWNSML 297
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 523 DGLCKVGNLVKAIEFQRKIFRVGLVPDHVTYTTVIRGYLENGQFKMARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLE 602
Cdd:PLN03081  298 AGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRME 377
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1063685727 603 QAFQYSTEMKKRgvrpNVMTHNALLYGMCKAGNIDEAYRYLCKMEEEGIPPNKYSYTMLISKnCDF 668
Cdd:PLN03081  378 DARNVFDRMPRK----NLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSA-CRY 438
PLN03077 PLN03077
Protein ECB2; Provisional
195-616 7.38e-12

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 68.72  E-value: 7.38e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 195 IRKGFLPSVRNCNIVLKVLRDSRMMNKASAVYETMIEHGIMptviTFNTMLDSCFKAGDLERVDKIWLEMKRRNIEFSEV 274
Cdd:PLN03077  315 VKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAV----SWTAMISGYEKNGLPDKALETYALMEQDNVSPDEI 390
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 275 TYNILINGFSKNGKMEEARRFHGDMRRSGFAVTPYSFNPLIEGYCKQGLFDDAWGVTDEMLNAGIYPTTSTyniyICALC 354
Cdd:PLN03077  391 TIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSI----IAGLR 466
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 355 DFGRIDDARELLSSMAA---PDVVSYNTLMH-----GYIKMGKFVEASLLFDDLRAGDIHPsivtyNTLIDGLCESGNLE 426
Cdd:PLN03077  467 LNNRCFEALIFFRQMLLtlkPNSVTLIAALSacariGALMCGKEIHAHVLRTGIGFDGFLP-----NALLDLYVRCGRMN 541
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 427 GA--QRLKEEmttqlifPDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTRAVGELRLGDSDKAFRLHEE 504
Cdd:PLN03077  542 YAwnQFNSHE-------KDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHS 614
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 505 MVATDHHAPDLTIYNVRIDGLCKVGNLVKAIEFqrkIFRVGLVPDhvtytTVIRGYLENGqFKMARNL-YDEMLRKRLY- 582
Cdd:PLN03077  615 MEEKYSITPNLKHYACVVDLLGRAGKLTEAYNF---INKMPITPD-----PAVWGALLNA-CRIHRHVeLGELAAQHIFe 685
                         410       420       430
                  ....*....|....*....|....*....|....*..
gi 1063685727 583 --PSVITYFVLIYG-HAKAGRLEQAFQYSTEMKKRGV 616
Cdd:PLN03077  686 ldPNSVGYYILLCNlYADAGKWDEVARVRKTMRENGL 722
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
442-484 1.57e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 59.68  E-value: 1.57e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1063685727 442 PDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTT 484
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTI 43
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
272-320 2.34e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.91  E-value: 2.34e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063685727 272 SEVTYNILINGFSKNGKMEEARRFHGDMRRSGFAVTPYSFNPLIEGYCK 320
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
618-663 2.03e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 56.22  E-value: 2.03e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1063685727 618 PNVMTHNALLYGMCKAGNIDEAYRYLCKMEEEGIPPNKYSYTMLIS 663
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILIN 46
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
583-632 2.65e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 56.22  E-value: 2.65e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063685727 583 PSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCK 632
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
310-355 6.20e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 55.06  E-value: 6.20e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1063685727 310 SFNPLIEGYCKQGLFDDAWGVTDEMLNAGIYPTTSTYNIYICALCD 355
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
405-436 4.64e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 51.96  E-value: 4.64e-09
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063685727 405 IHPSIVTYNTLIDGLCESGNLEGAQRLKEEMT 436
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
491-649 4.83e-09

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 57.02  E-value: 4.83e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 491 RLGDSDKAFRLHEEMVATD-----HHapdltiYNVRIDgLCKVGNLVKAIEFQRKIFRV----------GLVPDHVTYTT 555
Cdd:pfam17177  23 KHADATGALALYDAAKAEGvrlaqYH------YNVLLY-LCSKAADATDLKPQLAADRGfevfeamkaqGVSPNEATYTA 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 556 VIRGYLENGQFKMARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGN 635
Cdd:pfam17177  96 VARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGR 175
                         170
                  ....*....|....
gi 1063685727 636 IDEAYRYLCKMEEE 649
Cdd:pfam17177 176 ADKVYAYLHRLRDA 189
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
245-375 6.40e-09

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 56.64  E-value: 6.40e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 245 LDSCFKAGDLERVDKIWLEMKRRNIEFSEVTYNILINGFSK---------NGKMEEARRFHGDMRRSGFAVTPYSFNPLI 315
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLYLCSKaadatdlkpQLAADRGFEVFEAMKAQGVSPNEATYTAVA 97
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 316 EGYCKQGLFDDAWGVTDEMLNAGIYPTTSTYNIYICALCDFGRIDDARELLSSMAAPDVV 375
Cdd:pfam17177  98 RLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVE 157
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
439-716 1.09e-08

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 57.05  E-value: 1.09e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 439 LIFPDVITYTTLVKG--FVKNGNLSMATEVYDEMLRkgIKPDgYAYTTRAVGEL--RLGDSDKAFRLHEEMVATDHHAPD 514
Cdd:COG2956     1 LLLPVAAALGWYFKGlnYLLNGQPDKAIDLLEEALE--LDPE-TVEAHLALGNLyrRRGEYDRAIRIHQKLLERDPDRAE 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 515 LT-----IYNvridglcKVGNLVKAIEFQRKIfrVGLVPDHVT-YTTVIRGYLENGQFKMARNLYDEMLRKRLYPSVItY 588
Cdd:COG2956    78 ALlelaqDYL-------KAGLLDRAEELLEKL--LELDPDDAEaLRLLAEIYEQEGDWEKAIEVLERLLKLGPENAHA-Y 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 589 FVLIYGHAKAGRLEQAFQY---STEMKKRGVRPNVMThnALLYGmcKAGNIDEAYRYLCKMEEEGiPPNKYSYTMLISKN 665
Cdd:COG2956   148 CELAELYLEQGDYDEAIEAlekALKLDPDCARALLLL--AELYL--EQGDYEEAIAALERALEQD-PDYLPALPRLAELY 222
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1063685727 666 CDFEKWEEVVKLYKEMLdkEIEPDGYTHRALFKHLEKDHESRE-VEFLERLL 716
Cdd:COG2956   223 EKLGDPEEALELLRKAL--ELDPSDDLLLALADLLERKEGLEAaLALLERQL 272
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
572-631 3.29e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 50.43  E-value: 3.29e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 572 LYDEMLRKRLYPSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLyGMC 631
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVI 60
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
614-647 3.88e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 49.65  E-value: 3.88e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063685727 614 RGVRPNVMTHNALLYGMCKAGNIDEAYRYLCKME 647
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
341-386 7.89e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.90  E-value: 7.89e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063685727 341 PTTSTYNIYICALCDFGRIDDARELLSSM----AAPDVVSYNTLMHGYIK 386
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMkkrgVKPNVYTYTILINGLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
430-477 3.22e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 47.74  E-value: 3.22e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063685727 430 RLKEEMTTQLIFPDVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKP 477
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKP 48
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
445-478 3.66e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.68  E-value: 3.66e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063685727 445 ITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPD 478
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
274-304 4.11e-07

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 46.69  E-value: 4.11e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063685727 274 VTYNILINGFSKNGKMEEARRFHGDMRRSGF 304
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
513-560 5.34e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.59  E-value: 5.34e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063685727 513 PDLTIYNVRIDGLCKVGNLVKAIEFQRKIFRVGLVPDHVTYTTVIRGY 560
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
610-663 6.28e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 46.97  E-value: 6.28e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1063685727 610 EMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYLCKMEEEGIPPNKYSYTMLIS 663
Cdd:pfam13812   5 EMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
440-470 6.51e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 46.18  E-value: 6.51e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063685727 440 IFPDVITYTTLVKGFVKNGNLSMATEVYDEM 470
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
222-366 6.98e-07

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 50.47  E-value: 6.98e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 222 ASAVYETMIEHGIMPTVITFNTMLDSCFKAGD---------LERVDKIWLEMKRRNIEFSEVTYNILINGFSKNGKMEEA 292
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLYLCSKAADatdlkpqlaADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLA 109
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1063685727 293 RRFHGDMRRSGFAVTPYSFNPLIEGYCKQGLFDDAWGVTDEMLNAGIYPTTSTYNIYICALCDFGRIDDARELL 366
Cdd:pfam17177 110 FDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYL 183
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
201-250 1.52e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 45.43  E-value: 1.52e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063685727 201 PSVRNCNIVLKVLRDSRMMNKASAVYETMIEHGIMPTVITFNTMLDSCFK 250
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
274-307 1.84e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.75  E-value: 1.84e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063685727 274 VTYNILINGFSKNGKMEEARRFHGDMRRSGFAVT 307
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
621-654 3.38e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 3.38e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063685727 621 MTHNALLYGMCKAGNIDEAYRYLCKMEEEGIPPN 654
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
225-286 3.46e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 45.04  E-value: 3.46e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063685727 225 VYETMIEHGIMPTVITFNTMLDSCFKAGDLERVDKIWLEMKRRNIEFSEVTYNILINGFSKN 286
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
569-662 3.52e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 48.55  E-value: 3.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 569 ARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEAYRYLCKMEE 648
Cdd:pfam17177  74 GFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLA 153
                          90
                  ....*....|....
gi 1063685727 649 EGIPPNKYSYTMLI 662
Cdd:pfam17177 154 HGVELEEPELAALL 167
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
267-300 4.53e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.49  E-value: 4.53e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063685727 267 RNIEFSEVTYNILINGFSKNGKMEEARRFHGDMR 300
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
653-700 6.39e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.51  E-value: 6.39e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063685727 653 PNKYSYTMLISKNCDFEKWEEVVKLYKEMLDKEIEPDGYTHRALFKHL 700
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
395-457 7.97e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.89  E-value: 7.97e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063685727 395 LLFDDLRAGDIHPSIVTYNTLIDGLCESGNLEGAQRLKEEMTTQLIFPDVITYTTLVKGFVKN 457
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
445-475 8.17e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.84  E-value: 8.17e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063685727 445 ITYTTLVKGFVKNGNLSMATEVYDEMLRKGI 475
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
551-585 9.38e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.83  E-value: 9.38e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063685727 551 VTYTTVIRGYLENGQFKMARNLYDEMLRKRLYPSV 585
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
338-369 1.03e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.72  E-value: 1.03e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063685727 338 GIYPTTSTYNIYICALCDFGRIDDARELLSSM 369
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
309-343 1.98e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.06  E-value: 1.98e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063685727 309 YSFNPLIEGYCKQGLFDDAWGVTDEMLNAGIYPTT 343
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
545-576 6.73e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 40.41  E-value: 6.73e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063685727 545 GLVPDHVTYTTVIRGYLENGQFKMARNLYDEM 576
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
443-662 7.28e-05

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 46.02  E-value: 7.28e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 443 DVITYTTLVKGFVKNGNLSMATEVYDEMLRKGIKPDGYAYTTRAVGELRLGDSDKAFRLHEEMVATdhhapDLTIYNVRI 522
Cdd:PLN03081  122 PASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER-----NLASWGTII 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 523 DGLCKVGNLVKAIEFQRKIFRVGLVPDHVTYTTVIRGYLENGQFKMARNLYDEMLRKRLYPSVITYFVLIYGHAKAGRLE 602
Cdd:PLN03081  197 GGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIE 276
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 603 QAFQYSTEMKKRgvrpNVMTHNALLYGMCKAGNIDEAYRYLCKMEEEGIPPNKYSYTMLI 662
Cdd:PLN03081  277 DARCVFDGMPEK----TTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMI 332
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
239-270 8.80e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 8.80e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063685727 239 ITFNTMLDSCFKAGDLERVDKIWLEMKRRNIE 270
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIE 32
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
551-581 9.97e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 39.76  E-value: 9.97e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063685727 551 VTYTTVIRGYLENGQFKMARNLYDEMLRKRL 581
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
539-592 1.50e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.42  E-value: 1.50e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1063685727 539 RKIFRVGLVPDHVTYTTVIRGYLENGQFKMARNLYDEMLRKRLYPSVITYFVLI 592
Cdd:pfam13812   4 REMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
239-269 1.76e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.99  E-value: 1.76e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063685727 239 ITFNTMLDSCFKAGDLERVDKIWLEMKRRNI 269
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
372-401 1.91e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 39.25  E-value: 1.91e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063685727 372 PDVVSYNTLMHGYIKMGKFVEASLLFDDLR 401
Cdd:pfam12854   5 PDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
396-558 1.92e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 43.15  E-value: 1.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 396 LFDDLRAGDIHPSIVTYNTLIDgLCESGNLEgaqrlkeemttqlifPDVITYTTLVKGFvkngnlsmatEVYDEMLRKGI 475
Cdd:pfam17177  33 LYDAAKAEGVRLAQYHYNVLLY-LCSKAADA---------------TDLKPQLAADRGF----------EVFEAMKAQGV 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 476 KPDGYAYTTRAvgelRL----GDSDKAFRLHEEMVATDhHAPDLTIYNVRIDGLCKVGNLVKAIEFQRKIFRVGLVPDHV 551
Cdd:pfam17177  87 SPNEATYTAVA----RLaaakGDGDLAFDLVKEMEAAG-VSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEP 161

                  ....*..
gi 1063685727 552 TYTTVIR 558
Cdd:pfam17177 162 ELAALLK 168
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
375-409 2.19e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.98  E-value: 2.19e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063685727 375 VSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHPSI 409
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
596-688 2.46e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 43.15  E-value: 2.46e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 596 AKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGN---------IDEAYRYLCKMEEEGIPPNKYSYTMLI---S 663
Cdd:pfam17177  22 SKHADATGALALYDAAKAEGVRLAQYHYNVLLYLCSKAADatdlkpqlaADRGFEVFEAMKAQGVSPNEATYTAVArlaA 101
                          90       100
                  ....*....|....*....|....*
gi 1063685727 664 KNCDFEKWEEVVklyKEMLDKEIEP 688
Cdd:pfam17177 102 AKGDGDLAFDLV---KEMEAAGVSP 123
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
191-248 4.34e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.88  E-value: 4.34e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1063685727 191 FEKMIRKGFLPSVRNCNIVLKVLRDSRMMNKASAVYETMIEHGIMPTVITFNTMLDSC 248
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
309-334 5.96e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 37.71  E-value: 5.96e-04
                          10        20
                  ....*....|....*....|....*.
gi 1063685727 309 YSFNPLIEGYCKQGLFDDAWGVTDEM 334
Cdd:pfam12854   8 VTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
259-318 6.19e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.49  E-value: 6.19e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 259 KIWLEMKRRNIEFSEVTYNILINGFSKNGKMEEARRFHGDMRRSGFAVTPYSFNPLIEGY 318
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
191-435 6.29e-04

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 42.94  E-value: 6.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 191 FEKMIRKGFLpsvrNCNIVLKVLRDSRMMNKASAVYETMIEHGIMPTVITFNTMLDSCFKAGDLERVDKIWLEMKR-RNI 269
Cdd:PLN03081  383 FDRMPRKNLI----SWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSEnHRI 458
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 270 EFSEVTYNILINGFSKNGKMEEARRFhgdMRRSGFAVTPysfnpliegyckqglfdDAWGVtdeMLNAgiypttstyniy 349
Cdd:PLN03081  459 KPRAMHYACMIELLGREGLLDEAYAM---IRRAPFKPTV-----------------NMWAA---LLTA------------ 503
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 350 iCAL---CDFGRIddARELLSSMAAPDVVSYNTLMHGYIKMGKFVEASLLFDDL--RAGDIHPSIV-------TYNTLID 417
Cdd:PLN03081  504 -CRIhknLELGRL--AAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLkrKGLSMHPACTwievkkqDHSFFSG 580
                         250
                  ....*....|....*...
gi 1063685727 418 GLCESGNLEGAQRLKEEM 435
Cdd:PLN03081  581 DRLHPQSREIYQKLDELM 598
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
207-396 7.59e-04

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 42.94  E-value: 7.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 207 NIVLKVLRDSRMMNKASAVYETMIEHGiMPTVITFNTML-DSCFKAGDLERVDKIWLEMKRRNIefseVTYNILINGFSK 285
Cdd:PLN03081  329 SIMIRIFSRLALLEHAKQAHAGLIRTG-FPLDIVANTALvDLYSKWGRMEDARNVFDRMPRKNL----ISWNALIAGYGN 403
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 286 NGKMEEARRFHGDMRRSGFAVTPYSFNPLIEGYCKQGLFDDAWGVTDEML-NAGIYPTTSTYNIYICALCDFGRIDDARE 364
Cdd:PLN03081  404 HGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSeNHRIKPRAMHYACMIELLGREGLLDEAYA 483
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 1063685727 365 LLSSmaAPDVVSYN--------TLMHGYIKMGKFVEASLL 396
Cdd:PLN03081  484 MIRR--APFKPTVNmwaalltaCRIHKNLELGRLAAEKLY 521
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
410-435 8.83e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 8.83e-04
                          10        20
                  ....*....|....*....|....*.
gi 1063685727 410 VTYNTLIDGLCESGNLEGAQRLKEEM 435
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEM 26
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
372-416 9.17e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.11  E-value: 9.17e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 1063685727 372 PDVVSYNTLMHGYIKMGKFVEASLLFDDLRAGDIHPSIVTYNTLI 416
Cdd:pfam13812  13 LNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
622-651 9.27e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 9.27e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063685727 622 THNALLYGMCKAGNIDEAYRYLCKMEEEGI 651
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
569-715 1.61e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.46  E-value: 1.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 569 ARNLYDEMLRKRLYPSVITYFVLIYGHAKAG---------RLEQAFQYSTEMKKRGVRPNVMTHNALLYGMCKAGNIDEA 639
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLYLCSKAAdatdlkpqlAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLA 109
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1063685727 640 YRYLCKMEEEGIPPNKYSYTMLISKNCDFEKWEEVVKLYKEMLDKEIEPDGYTHRALFK-HLEKDHESREVEFLERL 715
Cdd:pfam17177 110 FDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKvSAKAGRADKVYAYLHRL 186
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
490-579 2.14e-03

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 37.84  E-value: 2.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 490 LRLGDSDKAFRLHEEMVATDHHAPDltIYNVRIDGLCKVGNLVKAIEFQRKIfrvGLVPDHV-TYTTVIRGYLENGQFKM 568
Cdd:COG3063     3 LKLGDLEEAEEYYEKALELDPDNAD--ALNNLGLLLLEQGRYDEAIALEKAL---KLDPNNAeALLNLAELLLELGDYDE 77
                          90
                  ....*....|.
gi 1063685727 569 ARNLYDEMLRK 579
Cdd:COG3063    78 ALAYLERALEL 88
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
466-522 2.38e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.95  E-value: 2.38e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1063685727 466 VYDEMLRKGIKPDGYAYTTRAVGELRLGDSDKAFRLHEEMvATDHHAPDLTIYNVRI 522
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERM-KKKGIKPTLDTYNAIL 57
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
232-265 2.91e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.78  E-value: 2.91e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063685727 232 HGIMPTVITFNTMLDSCFKAGDLERVDKIWLEMK 265
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
309-339 3.67e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.52  E-value: 3.67e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063685727 309 YSFNPLIEGYCKQGLFDDAWGVTDEMLNAGI 339
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
560-648 3.73e-03

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 37.07  E-value: 3.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063685727 560 YLENGQFKMARNLYDEMLRkrLYPSVI-TYFVLIYGHAKAGRLEQAFQYSTEMKKRGVRPNVMTHNALLYGmcKAGNIDE 638
Cdd:COG3063     2 YLKLGDLEEAEEYYEKALE--LDPDNAdALNNLGLLLLEQGRYDEAIALEKALKLDPNNAEALLNLAELLL--ELGDYDE 77
                          90
                  ....*....|
gi 1063685727 639 AYRYLCKMEE 648
Cdd:COG3063    78 ALAYLERALE 87
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
375-401 6.07e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 6.07e-03
                          10        20
                  ....*....|....*....|....*..
gi 1063685727 375 VSYNTLMHGYIKMGKFVEASLLFDDLR 401
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMK 27
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
590-620 6.41e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.74  E-value: 6.41e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063685727 590 VLIYGHAKAGRLEQAFQYSTEMKKRGVRPNV 620
Cdd:TIGR00756   5 TLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH