NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1063703982|ref|NP_001325411|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03081 super family cl33631
pentatricopeptide (PPR) repeat-containing protein; Provisional
57-627 1.37e-27

pentatricopeptide (PPR) repeat-containing protein; Provisional


The actual alignment was detected with superfamily member PLN03081:

Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 119.59  E-value: 1.37e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  57 EIQELHNLILSSSIQKTKLSSLLSVVSIFAKSNHIDKAFPQFQLVRSRFPENKPsVYLYNLLLESCIKERRVEFVSWLYK 136
Cdd:PLN03081   69 EVSESKDARLDDTQIRKSGVSLCSQIEKLVACGRHREALELFEILEAGCPFTLP-ASTYDALVEACIALKSIRCVKAVYW 147
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 137 DMVLCGIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKgckpNEFTFGILVRGYCKAGLTDKGLELLNAMesfgvlpn 216
Cdd:PLN03081  148 HVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREM-------- 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 217 kviyntivssfCREGRNDDSEKMVEKMREeglvpdivtfnSRISALCKEGKVLDASRIFSDMELDEYLGLPrpnsitynl 296
Cdd:PLN03081  216 -----------WEDGSDAEPRTFVVMLRA-----------SAGLGSARAGQQLHCCVLKTGVVGDTFVSCA--------- 264
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 297 MLKGFCKVGLLEDAKTLFESIREnddlASLQSYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCKLGM 376
Cdd:PLN03081  265 LIDMYSKCGDIEDARCVFDGMPE----KTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLAL 340
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 377 LSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLpnayTCNILLHSLWKMGRISEAEELLRK 456
Cdd:PLN03081  341 LEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLI----SWNALIAGYGNHGRGTKAVEMFER 416
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 457 MNEKGYGLDTVTCNIIVDGLCGSGELDKAIEIVKGM-RVHG--------SAALGNLGNSyiGLVDD--SLIENNCL-PDL 524
Cdd:PLN03081  417 MIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMsENHRikpramhyACMIELLGRE--GLLDEayAMIRRAPFkPTV 494
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 525 ITYSTLLNGlCKAGRFAEAKNLFAE----MMGEKLQPDSVAYNIFIhhfcKQGKISSAFRVLKDMEKKG------C---- 590
Cdd:PLN03081  495 NMWAALLTA-CRIHKNLELGRLAAEklygMGPEKLNNYVVLLNLYN----SSGRQAEAAKVVETLKRKGlsmhpaCtwie 569
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....*...
gi 1063703982 591 -----HK-----SLETYNSlilglgiknQIFE-IHGLMDEMKEKGISP 627
Cdd:PLN03081  570 vkkqdHSffsgdRLHPQSR---------EIYQkLDELMKEISEYGYVA 608
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
627-676 5.74e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 61.23  E-value: 5.74e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 627 PNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAFCK 676
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
57-627 1.37e-27

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 119.59  E-value: 1.37e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  57 EIQELHNLILSSSIQKTKLSSLLSVVSIFAKSNHIDKAFPQFQLVRSRFPENKPsVYLYNLLLESCIKERRVEFVSWLYK 136
Cdd:PLN03081   69 EVSESKDARLDDTQIRKSGVSLCSQIEKLVACGRHREALELFEILEAGCPFTLP-ASTYDALVEACIALKSIRCVKAVYW 147
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 137 DMVLCGIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKgckpNEFTFGILVRGYCKAGLTDKGLELLNAMesfgvlpn 216
Cdd:PLN03081  148 HVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREM-------- 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 217 kviyntivssfCREGRNDDSEKMVEKMREeglvpdivtfnSRISALCKEGKVLDASRIFSDMELDEYLGLPrpnsitynl 296
Cdd:PLN03081  216 -----------WEDGSDAEPRTFVVMLRA-----------SAGLGSARAGQQLHCCVLKTGVVGDTFVSCA--------- 264
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 297 MLKGFCKVGLLEDAKTLFESIREnddlASLQSYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCKLGM 376
Cdd:PLN03081  265 LIDMYSKCGDIEDARCVFDGMPE----KTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLAL 340
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 377 LSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLpnayTCNILLHSLWKMGRISEAEELLRK 456
Cdd:PLN03081  341 LEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLI----SWNALIAGYGNHGRGTKAVEMFER 416
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 457 MNEKGYGLDTVTCNIIVDGLCGSGELDKAIEIVKGM-RVHG--------SAALGNLGNSyiGLVDD--SLIENNCL-PDL 524
Cdd:PLN03081  417 MIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMsENHRikpramhyACMIELLGRE--GLLDEayAMIRRAPFkPTV 494
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 525 ITYSTLLNGlCKAGRFAEAKNLFAE----MMGEKLQPDSVAYNIFIhhfcKQGKISSAFRVLKDMEKKG------C---- 590
Cdd:PLN03081  495 NMWAALLTA-CRIHKNLELGRLAAEklygMGPEKLNNYVVLLNLYN----SSGRQAEAAKVVETLKRKGlsmhpaCtwie 569
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....*...
gi 1063703982 591 -----HK-----SLETYNSlilglgiknQIFE-IHGLMDEMKEKGISP 627
Cdd:PLN03081  570 vkkqdHSffsgdRLHPQSR---------EIYQkLDELMKEISEYGYVA 608
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
145-194 4.14e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.00  E-value: 4.14e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 145 PQTYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPNEFTFGILVRGYCK 194
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
627-676 5.74e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 61.23  E-value: 5.74e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 627 PNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAFCK 676
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
362-396 4.27e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.68  E-value: 4.27e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 362 YSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDA 396
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
630-664 4.76e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 4.76e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 630 CTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNV 664
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Name Accession Description Interval E-value
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
57-627 1.37e-27

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 119.59  E-value: 1.37e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  57 EIQELHNLILSSSIQKTKLSSLLSVVSIFAKSNHIDKAFPQFQLVRSRFPENKPsVYLYNLLLESCIKERRVEFVSWLYK 136
Cdd:PLN03081   69 EVSESKDARLDDTQIRKSGVSLCSQIEKLVACGRHREALELFEILEAGCPFTLP-ASTYDALVEACIALKSIRCVKAVYW 147
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 137 DMVLCGIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKgckpNEFTFGILVRGYCKAGLTDKGLELLNAMesfgvlpn 216
Cdd:PLN03081  148 HVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREM-------- 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 217 kviyntivssfCREGRNDDSEKMVEKMREeglvpdivtfnSRISALCKEGKVLDASRIFSDMELDEYLGLPrpnsitynl 296
Cdd:PLN03081  216 -----------WEDGSDAEPRTFVVMLRA-----------SAGLGSARAGQQLHCCVLKTGVVGDTFVSCA--------- 264
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 297 MLKGFCKVGLLEDAKTLFESIREnddlASLQSYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCKLGM 376
Cdd:PLN03081  265 LIDMYSKCGDIEDARCVFDGMPE----KTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLAL 340
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 377 LSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLpnayTCNILLHSLWKMGRISEAEELLRK 456
Cdd:PLN03081  341 LEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLI----SWNALIAGYGNHGRGTKAVEMFER 416
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 457 MNEKGYGLDTVTCNIIVDGLCGSGELDKAIEIVKGM-RVHG--------SAALGNLGNSyiGLVDD--SLIENNCL-PDL 524
Cdd:PLN03081  417 MIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMsENHRikpramhyACMIELLGRE--GLLDEayAMIRRAPFkPTV 494
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 525 ITYSTLLNGlCKAGRFAEAKNLFAE----MMGEKLQPDSVAYNIFIhhfcKQGKISSAFRVLKDMEKKG------C---- 590
Cdd:PLN03081  495 NMWAALLTA-CRIHKNLELGRLAAEklygMGPEKLNNYVVLLNLYN----SSGRQAEAAKVVETLKRKGlsmhpaCtwie 569
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....*...
gi 1063703982 591 -----HK-----SLETYNSlilglgiknQIFE-IHGLMDEMKEKGISP 627
Cdd:PLN03081  570 vkkqdHSffsgdRLHPQSR---------EIYQkLDELMKEISEYGYVA 608
PLN03218 PLN03218
maturation of RBCL 1; Provisional
220-713 6.32e-26

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 114.97  E-value: 6.32e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  220 YNTIVSSFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCK-EGKVLDASRiFSDMeldeylgLPRPNSITYNLML 298
Cdd:PLN03218   373 YIDAYNRLLRDGRIKDCIDLLEDMEKRGLLDMDKIYHAKFFKACKkQRAVKEAFR-FAKL-------IRNPTLSTFNMLM 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  299 KGFCKVGLLEDAKTLFESIRENDDLASLQSYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCKLGMLS 378
Cdd:PLN03218   445 SVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVA 524
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  379 DAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMrnnclpnaytcnillhslwkmgriSEAEELLRkmn 458
Cdd:PLN03218   525 KAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMK------------------------AETHPIDP--- 577
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  459 ekgyglDTVTCNIIVDGLCGSGELDKAIEIVKGMRvhgsaalgnlgnsyiglvddsliENNCLPDLITYSTLLNGLCKAG 538
Cdd:PLN03218   578 ------DHITVGALMKACANAGQVDRAKEVYQMIH-----------------------EYNIKGTPEVYTIAVNSCSQKG 628
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  539 RFAEAKNLFAEMMGEKLQPDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLiLGL--GIKN-----QIF 611
Cdd:PLN03218   629 DWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSL-MGAcsNAKNwkkalELY 707
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  612 EihglmdEMKEKGISPNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAFCKVPDFDMAQEVFETA- 690
Cdd:PLN03218   708 E------DIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAk 781
                          490       500       510
                   ....*....|....*....|....*....|
gi 1063703982  691 -------VSICGQKEGLYSLMFNELLAAGQ 713
Cdd:PLN03218   782 edgikpnLVMCRCITGLCLRRFEKACALGE 811
PLN03218 PLN03218
maturation of RBCL 1; Provisional
70-477 1.12e-24

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 111.12  E-value: 1.12e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982   70 IQKTKLSSLLSVVSIFAKSNHIDKAFPQFQLVRSRfpENKPSVYLYNLLLESCIKerrvefvswlykdmvlcgiapqtyt 149
Cdd:PLN03218   432 IRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEA--GLKADCKLYTTLISTCAK------------------------- 484
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  150 fnlliralcdSSCVDAARELFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSFCR 229
Cdd:PLN03218   485 ----------SGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQ 554
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  230 EGRNDDSEKMVEKMREEG--LVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLGLPRPNSITYNlmlkgFC-KVGL 306
Cdd:PLN03218   555 SGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVN-----SCsQKGD 629
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  307 LEDAKTLFESIRENDdlasLQSYNIWLQGLVR---HGKFI-EAETVLKQMTDKGIGPSIYSYNILMDGLCKLGMLSDAKT 382
Cdd:PLN03218   630 WDFALSIYDDMKKKG----VKPDEVFFSALVDvagHAGDLdKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALE 705
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  383 IVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEELLRKMNEKGY 462
Cdd:PLN03218   706 LYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGI 785
                          410
                   ....*....|....*
gi 1063703982  463 GLDTVTCNIIVdGLC 477
Cdd:PLN03218   786 KPNLVMCRCIT-GLC 799
PLN03077 PLN03077
Protein ECB2; Provisional
163-655 6.49e-20

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 95.30  E-value: 6.49e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 163 VDAAReLFDEMPEKgckpNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSfC------REGRndds 236
Cdd:PLN03077  138 VHAWY-VFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRT-CggipdlARGR---- 207
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 237 EKMVEKMReEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMeldeylglPRPNSITYNLMLKGFCKVGLLEDAKTLFES 316
Cdd:PLN03077  208 EVHAHVVR-FGFELDVDVVNALITMYVKCGDVVSARLVFDRM--------PRRDCISWNAMISGYFENGECLEGLELFFT 278
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 317 IREND---DLASLQS--------------------------------YNIWLQGLVRHGKFIEAETVLKQMTDKgigpSI 361
Cdd:PLN03077  279 MRELSvdpDLMTITSvisacellgderlgremhgyvvktgfavdvsvCNSLIQMYLSLGSWGEAEKVFSRMETK----DA 354
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 362 YSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCNILLHSL 441
Cdd:PLN03077  355 VSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMY 434
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 442 WKMGRISEAEELLRKMNEKgyglDTVTCNIIVDGLCGSGELDKAIEIVKGMRVH-------------GSAALGNLG---- 504
Cdd:PLN03077  435 SKCKCIDKALEVFHNIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQMLLTlkpnsvtliaalsACARIGALMcgke 510
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 505 -NSYI---GLVDDSLIENnclpdlitysTLLNGLCKAGRFAEAKNLFAemMGEKlqpDSVAYNIFIHHFCKQGKISSAFR 580
Cdd:PLN03077  511 iHAHVlrtGIGFDGFLPN----------ALLDLYVRCGRMNYAWNQFN--SHEK---DVVSWNILLTGYVAHGKGSMAVE 575
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1063703982 581 VLKDMEKKGCHKSLETYNSLILGLGIKNQIFEIHGLMDEMKEK-GISPNICTYNTAIQYLCEGEKVEDATNLLDEM 655
Cdd:PLN03077  576 LFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLTEAYNFINKM 651
PLN03218 PLN03218
maturation of RBCL 1; Provisional
285-772 4.73e-19

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 92.63  E-value: 4.73e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  285 GLPRPNSIT-----YNLMLKGfckvGLLEDAKTLFESIRENDDLASLQSYniwlqglvrHGKFIEAETVLKQMTD----- 354
Cdd:PLN03218   363 GVSGKRKSPeyidaYNRLLRD----GRIKDCIDLLEDMEKRGLLDMDKIY---------HAKFFKACKKQRAVKEafrfa 429
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  355 KGI-GPSIYSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYT 433
Cdd:PLN03218   430 KLIrNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHT 509
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  434 CNILLHSLWKMGRISEAEELLRKMNEKGYGLDTVTCNIIVDGlCG-SGELDKAIEIVKGMRVHGSAALgnlgnsyiglvd 512
Cdd:PLN03218   510 FGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISA-CGqSGAVDRAFDVLAEMKAETHPID------------ 576
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  513 dsliennclPDLITYSTLLNGLCKAGRFAEAKNLFaEMMGE---KLQPDsvAYNIFIHHFCKQGKISSAFRVLKDMEKKG 589
Cdd:PLN03218   577 ---------PDHITVGALMKACANAGQVDRAKEVY-QMIHEyniKGTPE--VYTIAVNSCSQKGDWDFALSIYDDMKKKG 644
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  590 CHKSLETYNSLILGLGIKNQIFEIHGLMDEMKEKGISPNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNVFSFKY 669
Cdd:PLN03218   645 VKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNA 724
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  670 LIEAFCKvpdfdmaqevfetavsicgqkeglyslmfnellaAGQLLKATELLEAVLDRGFELGTFLYKDLVESLCKKDEL 749
Cdd:PLN03218   725 LITALCE----------------------------------GNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDA 770
                          490       500
                   ....*....|....*....|...
gi 1063703982  750 EVASGILhkMIDRGYGFDPAALM 772
Cdd:PLN03218   771 DVGLDLL--SQAKEDGIKPNLVM 791
PLN03077 PLN03077
Protein ECB2; Provisional
142-589 2.67e-18

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 90.29  E-value: 2.67e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 142 GIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKgckpNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYN 221
Cdd:PLN03077  318 GFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIA 393
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 222 TIVSSFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMeldeylglPRPNSITYNLMLKGF 301
Cdd:PLN03077  394 SVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNI--------PEKDVISWTSIIAGL 465
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 302 CKVGLLEDAKTLFESIrenddLASLQSYNIWL---------QGLVRHGKFIEAEtVLKqmtdKGIGPSIYSYNILMDGLC 372
Cdd:PLN03077  466 RLNNRCFEALIFFRQM-----LLTLKPNSVTLiaalsacarIGALMCGKEIHAH-VLR----TGIGFDGFLPNALLDLYV 535
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 373 KLGMLSDAKTIVGLMKRngvcpDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEE 452
Cdd:PLN03077  536 RCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLE 610
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 453 LLRKMNEKgYGldtVTCNI-----IVDGLCGSGELDKAIEIVKGMRVHgsaalgnlgnsyiglvddsliennclPDLITY 527
Cdd:PLN03077  611 YFHSMEEK-YS---ITPNLkhyacVVDLLGRAGKLTEAYNFINKMPIT--------------------------PDPAVW 660
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063703982 528 STLLNGlCKAGRFAEAKNLFAEMMGEkLQPDSVAYNIFI-HHFCKQGKISSAFRVLKDMEKKG 589
Cdd:PLN03077  661 GALLNA-CRIHRHVELGELAAQHIFE-LDPNSVGYYILLcNLYADAGKWDEVARVRKTMRENG 721
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
145-194 4.14e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.00  E-value: 4.14e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 145 PQTYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPNEFTFGILVRGYCK 194
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
394-441 2.70e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 64.69  E-value: 2.70e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063703982 394 PDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCNILLHSL 441
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
522-571 8.55e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.54  E-value: 8.55e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 522 PDLITYSTLLNGLCKAGRFAEAKNLFAEMMGEKLQPDSVAYNIFIHHFCK 571
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
557-604 9.90e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.15  E-value: 9.90e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063703982 557 PDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLILGL 604
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PLN03218 PLN03218
maturation of RBCL 1; Provisional
75-257 5.17e-12

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 69.91  E-value: 5.17e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982   75 LSSLLSVVsifAKSNHIDKAFPQFQLVRSRfpENKPSVYLYNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLI 154
Cdd:PLN03218   652 FSALVDVA---GHAGDLDKAFEILQDARKQ--GIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALI 726
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  155 RALCDSSCVDAARELFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVsSFCRegrnd 234
Cdd:PLN03218   727 TALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLCL----- 800
                          170       180
                   ....*....|....*....|...
gi 1063703982  235 dseKMVEKMREEGLVpdIVTFNS 257
Cdd:PLN03218   801 ---RRFEKACALGEP--VVSFDS 818
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
627-676 5.74e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 61.23  E-value: 5.74e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 627 PNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAFCK 676
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
250-303 1.33e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 60.07  E-value: 1.33e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1063703982 250 PDIVTFNSRISALCKEGKVLDASRIFSDMELdeyLGLPrPNSITYNLMLKGFCK 303
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKK---RGVK-PNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
359-407 1.59e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 59.68  E-value: 1.59e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063703982 359 PSIYSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDAVTYGCLLHGYC 407
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PLN03218 PLN03218
maturation of RBCL 1; Provisional
86-265 5.38e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 66.82  E-value: 5.38e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982   86 AKSNHIDKAFPQFQLVRSRFPENKPSVYlyNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCDSSCVDA 165
Cdd:PLN03218   590 ANAGQVDRAKEVYQMIHEYNIKGTPEVY--TIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDK 667
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982  166 ARELFDEMPEKG--------------C---------------------KPNEFTFGILVRGYCKAGLTDKGLELLNAMES 210
Cdd:PLN03218   668 AFEILQDARKQGiklgtvsysslmgaCsnaknwkkalelyediksiklRPTVSTMNALITALCEGNQLPKALEVLSEMKR 747
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1063703982  211 FGVLPNKVIYNTIVSSFCREGRNDDSEKMVEKMREEGLVPDIVTFNSrISALCKE 265
Cdd:PLN03218   748 LGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRC-ITGLCLR 801
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
215-264 2.14e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 56.60  E-value: 2.14e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 215 PNKVIYNTIVSSFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCK 264
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
518-550 6.68e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 54.66  E-value: 6.68e-10
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063703982 518 NNCLPDLITYSTLLNGLCKAGRFAEAKNLFAEM 550
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
328-373 7.78e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 55.06  E-value: 7.78e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1063703982 328 SYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCK 373
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
102-264 1.64e-09

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 58.56  E-value: 1.64e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 102 RSRFPENKP-SVYLYNLllESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRaLCDSS----------CVDAARELF 170
Cdd:pfam17177   2 RKKKGKQTPeSELRFQL--DKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAadatdlkpqlAADRGFEVF 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 171 DEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSFCREGRNDDSEKMVEKMREEGLVP 250
Cdd:pfam17177  79 EAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
                         170
                  ....*....|....
gi 1063703982 251 DivtfNSRISALCK 264
Cdd:pfam17177 159 E----EPELAALLK 168
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
176-209 7.41e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 51.96  E-value: 7.41e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063703982 176 KGCKPNEFTFGILVRGYCKAGLTDKGLELLNAME 209
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
465-536 8.07e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.98  E-value: 8.07e-09
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063703982 465 DTVTCNIIVDGLCGSGELDKAIEIVKGMRvhgsaalgnlgnsyiglvddsliENNCLPDLITYSTLLNGLCK 536
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMK-----------------------KRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
211-244 8.66e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 51.58  E-value: 8.66e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063703982 211 FGVLPNKVIYNTIVSSFCREGRNDDSEKMVEKMR 244
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
390-422 1.57e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.81  E-value: 1.57e-08
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063703982 390 NGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEM 422
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
180-229 1.85e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.21  E-value: 1.85e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 180 PNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSFCR 229
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
246-279 3.03e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.04  E-value: 3.03e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063703982 246 EGLVPDIVTFNSRISALCKEGKVLDASRIFSDME 279
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
395-688 1.58e-07

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 55.26  E-value: 1.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 395 DAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEELLRKMNEKgyglDTVTCNIIVD 474
Cdd:PLN03081  122 PASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIG 197
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 475 GLCGSGELDKAIEIVKGMRVHGSA-----------ALGNLGNSYIG-----------LVDDSLIenNClpdlitysTLLN 532
Cdd:PLN03081  198 GLVDAGNYREAFALFREMWEDGSDaeprtfvvmlrASAGLGSARAGqqlhccvlktgVVGDTFV--SC--------ALID 267
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 533 GLCKAGRFAEAKNLFaEMMGEKlqpDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLIlglgiknQIFE 612
Cdd:PLN03081  268 MYSKCGDIEDARCVF-DGMPEK---TTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMI-------RIFS 336
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 613 IHGLMDEMKE-------KGISPNICTyNTA-IQYLCEGEKVEDATNLLDEMMQKNIapnvFSFKYLIEAFCKVPDFDMAQ 684
Cdd:PLN03081  337 RLALLEHAKQahaglirTGFPLDIVA-NTAlVDLYSKWGRMEDARNVFDRMPRKNL----ISWNALIAGYGNHGRGTKAV 411

                  ....
gi 1063703982 685 EVFE 688
Cdd:PLN03081  412 EMFE 415
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
362-396 4.27e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.68  E-value: 4.27e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 362 YSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDA 396
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
525-559 6.14e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.29  E-value: 6.14e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 525 ITYSTLLNGLCKAGRFAEAKNLFAEMMGEKLQPDS 559
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
397-431 7.92e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.91  E-value: 7.92e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 397 VTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNA 431
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
148-182 8.24e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.91  E-value: 8.24e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 148 YTFNLLIRALCDSSCVDAARELFDEMPEKGCKPNE 182
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
555-586 1.11e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 45.80  E-value: 1.11e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063703982 555 LQPDSVAYNIFIHHFCKQGKISSAFRVLKDME 586
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
110-159 1.38e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 45.82  E-value: 1.38e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063703982 110 PSVYLYNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCD 159
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
580-643 1.75e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 45.81  E-value: 1.75e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1063703982 580 RVLKDMEKKGCHKSLETYNSLILGL---GIKNQIFEIhglMDEMKEKGISPNICTYNtAIQYLCEGE 643
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYanvGNLKLALEI---FERMKKKGIKPTLDTYN-AILGVIGGR 63
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
546-601 1.97e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 45.81  E-value: 1.97e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1063703982 546 LFAEMMGEKLQPDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLI 601
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
616-674 2.54e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 45.43  E-value: 2.54e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1063703982 616 LMDEMKEKGISPNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAF 674
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
623-655 2.88e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 44.64  E-value: 2.88e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063703982 623 KGISPNICTYNTAIQYLCEGEKVEDATNLLDEM 655
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
204-265 4.95e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 44.66  E-value: 4.95e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063703982 204 LLNAMESFGVLPNKVIYNTIVSSFCREGRNDDSEKMVEKMREEGLVPDIVTFNSRISALCKE 265
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
289-315 1.37e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.72  E-value: 1.37e-05
                          10        20
                  ....*....|....*....|....*..
gi 1063703982 289 PNSITYNLMLKGFCKVGLLEDAKTLFE 315
Cdd:pfam12854   5 PDVVTYNTLINGLCRAGRVDEAFELLD 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
560-590 1.39e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.44  E-value: 1.39e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063703982 560 VAYNIFIHHFCKQGKISSAFRVLKDMEKKGC 590
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
425-457 1.84e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.33  E-value: 1.84e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063703982 425 NNCLPNAYTCNILLHSLWKMGRISEAEELLRKM 457
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
218-252 3.10e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 3.10e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 218 VIYNTIVSSFCREGRNDDSEKMVEKMREEGLVPDI 252
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
134-192 6.42e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 41.57  E-value: 6.42e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1063703982 134 LYKDMVLCGIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPNEFTFGILVRGY 192
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
183-216 6.48e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 6.48e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063703982 183 FTFGILVRGYCKAGLTDKGLELLNAMESFGVLPN 216
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
387-438 6.54e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 41.57  E-value: 6.54e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1063703982 387 MKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCNILL 438
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
526-794 7.25e-05

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 46.40  E-value: 7.25e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 526 TYSTLLNGLCKAGRFAEAKNLFAEMMGEKLQPDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKgchkSLETYNSLILGLG 605
Cdd:PLN03081  125 TYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIGGLV 200
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 606 IKNQIFEIHGLMDEMKEKGISPNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAFCKVPDFDMAQE 685
Cdd:PLN03081  201 DAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARC 280
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 686 VFETAvsicGQKEglySLMFNELLAA----GQLLKATELLEAVLDRGFELGTFLYKDLVESLCKKDELEVASGILHKMID 761
Cdd:PLN03081  281 VFDGM----PEKT---TVAWNSMLAGyalhGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIR 353
                         250       260       270
                  ....*....|....*....|....*....|...
gi 1063703982 762 RGYGFDPAALMPVIDGLGKMGNKKEANSFADKM 794
Cdd:PLN03081  354 TGFPLDIVANTALVDLYSKWGRMEDARNVFDRM 386
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
397-427 8.10e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.14  E-value: 8.10e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063703982 397 VTYGCLLHGYCSVGKVDAAKSLLQEMMRNNC 427
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
355-388 8.59e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 40.41  E-value: 8.59e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063703982 355 KGIGPSIYSYNILMDGLCKLGMLSDAKTIVGLMK 388
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
571-688 8.79e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 44.69  E-value: 8.79e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 571 KQGKISSAFRVLKDMEKKGCHKSLETYNSLI-------------LGLGIKNqIFEIhglMDEMKEKGISPNICTYnTAIQ 637
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLylcskaadatdlkPQLAADR-GFEV---FEAMKAQGVSPNEATY-TAVA 97
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1063703982 638 YLCEgeKVED---ATNLLDEMMQKNIAPNVFSFKYLIEAFCKVPDFDMAQEVFE 688
Cdd:pfam17177  98 RLAA--AKGDgdlAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEE 149
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
362-392 1.21e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 39.76  E-value: 1.21e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063703982 362 YSYNILMDGLCKLGMLSDAKTIVGLMKRNGV 392
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
348-403 1.22e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.80  E-value: 1.22e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1063703982 348 VLKQMTDKGIGPSIYSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDAVTYGCLL 403
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
460-493 1.35e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 39.64  E-value: 1.35e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063703982 460 KGYGLDTVTCNIIVDGLCGSGELDKAIEIVKGMR 493
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
432-466 1.85e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.36  E-value: 1.85e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 432 YTCNILLHSLWKMGRISEAEELLRKMNEKGYGLDT 466
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
148-178 1.96e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 39.37  E-value: 1.96e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063703982 148 YTFNLLIRALCDSSCVDAARELFDEMPEKGC 178
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
560-590 2.20e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.99  E-value: 2.20e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063703982 560 VAYNIFIHHFCKQGKISSAFRVLKDMEKKGC 590
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
525-555 3.11e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.60  E-value: 3.11e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063703982 525 ITYSTLLNGLCKAGRFAEAKNLFAEMMGEKL 555
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
328-361 4.45e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 4.45e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063703982 328 SYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSI 361
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
630-664 4.76e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 4.76e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063703982 630 CTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNV 664
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
142-173 5.24e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.10  E-value: 5.24e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063703982 142 GIAPQTYTFNLLIRALCDSSCVDAARELFDEM 173
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
183-213 5.44e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 5.44e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063703982 183 FTFGILVRGYCKAGLTDKGLELLNAMESFGV 213
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
218-248 6.76e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 6.76e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063703982 218 VIYNTIVSSFCREGRNDDSEKMVEKMREEGL 248
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
292-320 7.75e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.44  E-value: 7.75e-04
                          10        20
                  ....*....|....*....|....*....
gi 1063703982 292 ITYNLMLKGFCKVGLLEDAKTLFESIREN 320
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
467-496 8.85e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.85e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063703982 467 VTCNIIVDGLCGSGELDKAIEIVKGMRVHG 496
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERG 30
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
328-357 1.00e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 1.00e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063703982 328 SYNIWLQGLVRHGKFIEAETVLKQMTDKGI 357
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
238-301 1.19e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.11  E-value: 1.19e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1063703982 238 KMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDeylGLpRPNSITYNLMLKGF 301
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKK---GI-KPTLDTYNAILGVI 60
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
417-478 1.37e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 37.72  E-value: 1.37e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063703982 417 SLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEELLRKMNEKGYGLDTVTCNIIVdGLCG 478
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVIG 61
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
292-321 1.42e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.05  E-value: 1.42e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063703982 292 ITYNLMLKGFCKVGLLEDAKTLFESIREND 321
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERG 30
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
531-662 1.46e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.84  E-value: 1.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 531 LNGLCKAGRFAEAKNLFAEMMGEKLQPDSVAYNIFIHhFC----------KQGKISSAFRVLKDMEKKGCHKSLETYNSL 600
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCskaadatdlkPQLAADRGFEVFEAMKAQGVSPNEATYTAV 96
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063703982 601 IlGLGIKNQIFEI-HGLMDEMKEKGISPNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAP 662
Cdd:pfam17177  97 A-RLAAAKGDGDLaFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
169-227 1.54e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 37.72  E-value: 1.54e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1063703982 169 LFDEMPEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSF 227
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
596-629 2.23e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 2.23e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063703982 596 TYNSLILGLGIKNQIFEIHGLMDEMKEKGISPNI 629
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
432-461 3.26e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.90  E-value: 3.26e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063703982 432 YTCNILLHSLWKMGRISEAEELLRKMNEKG 461
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
253-279 4.21e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.52  E-value: 4.21e-03
                          10        20
                  ....*....|....*....|....*..
gi 1063703982 253 VTFNSRISALCKEGKVLDASRIFSDME 279
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMK 27
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
338-457 4.78e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 39.30  E-value: 4.78e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063703982 338 RHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCKLGMLSDAKTIVGL---------MKRNGVCPDAVTYGCLLHGYCS 408
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLYLCSKAADATDLKPQLAAdrgfevfeaMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 1063703982 409 VGKVDAAKSLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEELLRKM 457
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHM 151
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH