NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|18407744|ref|NP_564809|]
View 

Tetratricopeptide repeat (TPR)-like superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 13595575)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
168-600 4.40e-37

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 148.49  E-value: 4.40e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   168 RISDAVALVDQMVEMG------------YK--------PDTFTFTTLIHglflhnkaseavalvdqmvqrgcQPDLVTYG 227
Cdd:PLN03218  385 RIKDCIDLLEDMEKRGlldmdkiyhakfFKackkqravKEAFRFAKLIR-----------------------NPTLSTFN 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   228 TVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLINCLCNYG 307
Cdd:PLN03218  442 MLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAG 521
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   308 RWSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRS--IDPDTITYNLLINGFCMHNRLDEAKQM 385
Cdd:PLN03218  522 QVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEV 601
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   386 FKfmvskdclpNIQTYNtlINGFCKCkrvedgvelfremsqrglvgntvtYTTIIQGFFQAGDCDSAQMVFKQMVSNRVP 465
Cdd:PLN03218  602 YQ---------MIHEYN--IKGTPEV------------------------YTIAVNSCSQKGDWDFALSIYDDMKKKGVK 646
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   466 TDIMTYSILLHGLCSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGEAWDLFC---SLSIKPDVVTYNTMI 542
Cdd:PLN03218  647 PDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEdikSIKLRPTVSTMNALI 726
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 18407744   543 SGLCSKRLLQEADDLFRKMKEDGTLPNSGTYNTLIRANLRDCDRAASAELIKEMRSSG 600
Cdd:PLN03218  727 TALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDG 784
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
117-165 5.79e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.98  E-value: 5.79e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   117 DLYTYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCH 165
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
81-130 4.53e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.59  E-value: 4.53e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 18407744    81 PSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCR 130
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
168-600 4.40e-37

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 148.49  E-value: 4.40e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   168 RISDAVALVDQMVEMG------------YK--------PDTFTFTTLIHglflhnkaseavalvdqmvqrgcQPDLVTYG 227
Cdd:PLN03218  385 RIKDCIDLLEDMEKRGlldmdkiyhakfFKackkqravKEAFRFAKLIR-----------------------NPTLSTFN 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   228 TVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLINCLCNYG 307
Cdd:PLN03218  442 MLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAG 521
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   308 RWSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRS--IDPDTITYNLLINGFCMHNRLDEAKQM 385
Cdd:PLN03218  522 QVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEV 601
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   386 FKfmvskdclpNIQTYNtlINGFCKCkrvedgvelfremsqrglvgntvtYTTIIQGFFQAGDCDSAQMVFKQMVSNRVP 465
Cdd:PLN03218  602 YQ---------MIHEYN--IKGTPEV------------------------YTIAVNSCSQKGDWDFALSIYDDMKKKGVK 646
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   466 TDIMTYSILLHGLCSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGEAWDLFC---SLSIKPDVVTYNTMI 542
Cdd:PLN03218  647 PDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEdikSIKLRPTVSTMNALI 726
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 18407744   543 SGLCSKRLLQEADDLFRKMKEDGTLPNSGTYNTLIRANLRDCDRAASAELIKEMRSSG 600
Cdd:PLN03218  727 TALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDG 784
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
326-374 1.24e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.94  E-value: 1.24e-16
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   326 PNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDTITYNLLINGFC 374
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
117-165 5.79e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.98  E-value: 5.79e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   117 DLYTYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCH 165
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
364-398 4.48e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.29  E-value: 4.48e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   364 ITYNLLINGFCMHNRLDEAKQMFKFMVSKDCLPNI 398
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
81-130 4.53e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.59  E-value: 4.53e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 18407744    81 PSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCR 130
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
119-153 2.03e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 2.03e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   119 YTYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDI 153
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
307-458 2.48e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 43.18  E-value: 2.48e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744 307 GRWSDASRLLSNMLEkkINP-NVVTFNALIDAFFKEGKLVEAEKLHEEMIQRsiDPDTITYNLLINGFCMH-NRLDEAKQ 384
Cdd:COG2956  22 GQPDKAIDLLEEALE--LDPeTVEAHLALGNLYRRRGEYDRAIRIHQKLLER--DPDRAEALLELAQDYLKaGLLDRAEE 97
                        90       100       110       120       130       140       150
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 18407744 385 MFKFMVSKDCLpNIQTYNTLINGFCKCKRVEDGVELFREMSQRGLvGNTVTYTTIIQGFFQAGDCDSAQMVFKQ 458
Cdd:COG2956  98 LLEKLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLGP-ENAHAYCELAELYLEQGDYDEAIEALEK 169
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
168-600 4.40e-37

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 148.49  E-value: 4.40e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   168 RISDAVALVDQMVEMG------------YK--------PDTFTFTTLIHglflhnkaseavalvdqmvqrgcQPDLVTYG 227
Cdd:PLN03218  385 RIKDCIDLLEDMEKRGlldmdkiyhakfFKackkqravKEAFRFAKLIR-----------------------NPTLSTFN 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   228 TVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLINCLCNYG 307
Cdd:PLN03218  442 MLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAG 521
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   308 RWSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRS--IDPDTITYNLLINGFCMHNRLDEAKQM 385
Cdd:PLN03218  522 QVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEV 601
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   386 FKfmvskdclpNIQTYNtlINGFCKCkrvedgvelfremsqrglvgntvtYTTIIQGFFQAGDCDSAQMVFKQMVSNRVP 465
Cdd:PLN03218  602 YQ---------MIHEYN--IKGTPEV------------------------YTIAVNSCSQKGDWDFALSIYDDMKKKGVK 646
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   466 TDIMTYSILLHGLCSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGEAWDLFC---SLSIKPDVVTYNTMI 542
Cdd:PLN03218  647 PDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEdikSIKLRPTVSTMNALI 726
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 18407744   543 SGLCSKRLLQEADDLFRKMKEDGTLPNSGTYNTLIRANLRDCDRAASAELIKEMRSSG 600
Cdd:PLN03218  727 TALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDG 784
PLN03077 PLN03077
Protein ECB2; Provisional
66-587 2.53e-34

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 139.60  E-value: 2.53e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   66 DAVDLFGDMVKSRPFpsivEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAVLAKMM 145
Cdd:PLN03077 139 HAWYVFGKMPERDLF----SWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVV 214
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  146 KLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMVEMgykpDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVT 225
Cdd:PLN03077 215 RFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMT 290
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  226 YGTVVNGLCKRGDIDLAlnllnkmeaARIKANVVifntiidslckyrhvevavdlftemeTKGIRPNVVTYNSLINCLCN 305
Cdd:PLN03077 291 ITSVISACELLGDERLG---------REMHGYVV--------------------------KTGFAVDVSVCNSLIQMYLS 335
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  306 YGRWSDASRLLSNMLEKkinpNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDTITYNLLINGFCMHNRLDEAKQM 385
Cdd:PLN03077 336 LGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKL 411
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  386 FKFMVSKDCLPNIQTYNTLINGFCKCKRVEDGVELFREMSQRglvgNTVTYTTIIQGFFQAGDCDSAQMVFKQMVSNRVP 465
Cdd:PLN03077 412 HELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQMLLTLKP 487
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  466 TDImTYSILLHGLCSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGEAWDLFCSLsiKPDVVTYNTMISGL 545
Cdd:PLN03077 488 NSV-TLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFNSH--EKDVVSWNILLTGY 564
                        490       500       510       520
                 ....*....|....*....|....*....|....*....|..
gi 18407744  546 CSKRLLQEADDLFRKMKEDGTLPNSGTYNTLIRAnlrdCDRA 587
Cdd:PLN03077 565 VAHGKGSMAVELFNRMVESGVNPDEVTFISLLCA----CSRS 602
PLN03218 PLN03218
maturation of RBCL 1; Provisional
86-479 1.73e-31

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 131.15  E-value: 1.73e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744    86 FNKLLSAvakmNKFELVISLGEQMQTLGI--SHDLYTYSIFINCfcrRSQLSLALAV-LAKMMKlgyEPDIVTLSSLLNG 162
Cdd:PLN03218  377 YNRLLRD----GRIKDCIDLLEDMEKRGLldMDKIYHAKFFKAC---KKQRAVKEAFrFAKLIR---NPTLSTFNMLMSV 446
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   163 YCHSKRISDAVALVDQMVEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLA 242
Cdd:PLN03218  447 CASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKA 526
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   243 LNLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEM--ETKGIRPNVVTYNSLINCLCNYGR------------ 308
Cdd:PLN03218  527 FGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMkaETHPIDPDHITVGALMKACANAGQvdrakevyqmih 606
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   309 -----------------------WSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDTIT 365
Cdd:PLN03218  607 eynikgtpevytiavnscsqkgdWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVS 686
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   366 YNLLINGFCMHNRLDEAKQMFKFMVSKDCLPNIQTYNTLINGFCKCKRVEDGVELFREMSQRGLVGNTVTYTTIIQGFFQ 445
Cdd:PLN03218  687 YSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASER 766
                         410       420       430
                  ....*....|....*....|....*....|....*
gi 18407744   446 AGDCDSAQMVFKQMVSNRV-PTDIMTYSILlhGLC 479
Cdd:PLN03218  767 KDDADVGLDLLSQAKEDGIkPNLVMCRCIT--GLC 799
PLN03077 PLN03077
Protein ECB2; Provisional
157-627 7.80e-26

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 113.02  E-value: 7.80e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  157 SSLLNGYCHSKRISDAVALVDQMVEMGYKPDTFTFTTLIH----------GLFLHNKASEAV---------ALVDQMVQR 217
Cdd:PLN03077  55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRlcewkraveeGSRVCSRALSSHpslgvrlgnAMLSMFVRF 134
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  218 G------------CQPDLVTYGTVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDS------LCKYRHVEVAVD 279
Cdd:PLN03077 135 GelvhawyvfgkmPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTcggipdLARGREVHAHVV 214
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  280 LFtemetkGIRPNVVTYNSLINCLCNYGRWSDASRLLSNMLEKKInpnvVTFNALIDAFFKEGKLVEAEKLHEEMIQRSI 359
Cdd:PLN03077 215 RF------GFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDC----ISWNAMISGYFENGECLEGLELFFTMRELSV 284
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  360 DPDTITYNLLINGFCMHNRLDEAKQMFKFMVSKDCLPNIQTYNTLINGFCKCKRVEDGVELFREMSQRglvgNTVTYTTI 439
Cdd:PLN03077 285 DPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAM 360
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  440 IQGFFQAGDCDSAQMVFKQMVSNRVPTDIMTYSILLHGLCSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKV 519
Cdd:PLN03077 361 ISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCI 440
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  520 GEAWDLFCSLSIKpDVVTYNTMISGLCSKRLLQEADDLFRKMKEDgTLPNSgtyNTLIRAnLRDCDRAASAELIKEMRSS 599
Cdd:PLN03077 441 DKALEVFHNIPEK-DVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNS---VTLIAA-LSACARIGALMCGKEIHAH 514
                        490       500
                 ....*....|....*....|....*...
gi 18407744  600 gfvgdastiSLVTNMLHDGRLDKSFLNM 627
Cdd:PLN03077 515 ---------VLRTGIGFDGFLPNALLDL 533
PLN03218 PLN03218
maturation of RBCL 1; Provisional
89-410 1.52e-25

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 112.28  E-value: 1.52e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744    89 LLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCHSKR 168
Cdd:PLN03218  478 LISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGA 557
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   169 ISDAValvDQMVEMG-----YKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLAL 243
Cdd:PLN03218  558 VDRAF---DVLAEMKaethpIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFAL 634
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   244 NLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLINCLCNYGRWSDASRLLSNMLEKK 323
Cdd:PLN03218  635 SIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIK 714
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   324 INPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDTITYNLLingFCMHNRLDEAKQMFKFMVS--KDCL-PNIQT 400
Cdd:PLN03218  715 LRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSIL---LVASERKDDADVGLDLLSQakEDGIkPNLVM 791
                         330
                  ....*....|
gi 18407744   401 YNTLInGFCK 410
Cdd:PLN03218  792 CRCIT-GLCL 800
PLN03077 PLN03077
Protein ECB2; Provisional
72-544 9.50e-24

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 106.47  E-value: 9.50e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   72 GDMVKSR------PFPSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAVLAKMM 145
Cdd:PLN03077 236 GDVVSARlvfdrmPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVV 315
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  146 KLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMvemgYKPDTFTFTTLIHGL---FLHNKASEAVALvdqMVQRGCQPD 222
Cdd:PLN03077 316 KTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRM----ETKDAVSWTAMISGYeknGLPDKALETYAL---MEQDNVSPD 388
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  223 LVTYGTVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKgirpNVVTYNSLINC 302
Cdd:PLN03077 389 EITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEK----DVISWTSIIAG 464
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  303 LCNYGRWSDASRLLSNMLeKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDtitynllinGFcmhnrldea 382
Cdd:PLN03077 465 LRLNNRCFEALIFFRQML-LTLKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFD---------GF--------- 525
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  383 kqmfkfmvskdcLPniqtyNTLINGFCKCKRVEDGVELFREMSQrglvgNTVTYTTIIQGFFQAGDCDSAQMVFKQMVSN 462
Cdd:PLN03077 526 ------------LP-----NALLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVELFNRMVES 583
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  463 RVPTDIMTYSILLHGLCSYGKLDTALVIFKYLQ-KSEMELNIFIYNTMIEGMCKAGKVGEAWDLFCSLSIKPDVVTYNTM 541
Cdd:PLN03077 584 GVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEeKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGAL 663

                 ...
gi 18407744  542 ISG 544
Cdd:PLN03077 664 LNA 666
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
120-492 6.95e-23

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 103.41  E-value: 6.95e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  120 TYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMVEMgykpDTFTFTTLIHGLF 199
Cdd:PLN03081 125 TYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIGGLV 200
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  200 LHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVD 279
Cdd:PLN03081 201 DAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARC 280
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  280 LFTEMETKgirpNVVTYNSLINCLCNYGRWSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSI 359
Cdd:PLN03081 281 VFDGMPEK----TTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGF 356
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  360 DPDTITYNLLINGFCMHNRLDEAKQMFKFMVSKdclpNIQTYNTLINGFCKCKRVEDGVELFREMSQRGLVGNTVTYTTI 439
Cdd:PLN03081 357 PLDIVANTALVDLYSKWGRMEDARNVFDRMPRK----NLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAV 432
                        330       340       350       360       370
                 ....*....|....*....|....*....|....*....|....*....|....
gi 18407744  440 IQGFFQAGDCDSAQMVFKQMVSN-RVPTDIMTYSILLHGLCSYGKLDTALVIFK 492
Cdd:PLN03081 433 LSACRYSGLSEQGWEIFQSMSENhRIKPRAMHYACMIELLGREGLLDEAYAMIR 486
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
233-579 5.02e-18

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 88.00  E-value: 5.02e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  233 LCKRgdIDLALNLLNKMEA-ARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLINCLCNYGRWSD 311
Cdd:PLN03081  99 ACGR--HREALELFEILEAgCPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLID 176
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  312 ASRLLSNMLEKkinpNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDTITYNLLINGFCMHNRLDEAKQMFKFMVS 391
Cdd:PLN03081 177 ARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLK 252
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  392 KDCLPNIQTYNTLINGFCKCKRVEDGVELFREMSQRglvgNTVTYTTIIQGFFQAGDCDSAQMVFKQMVSNRVPTDIMTY 471
Cdd:PLN03081 253 TGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEK----TTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTF 328
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  472 SILLHGLCSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGEAWDLFCSLSIKpDVVTYNTMISGLCSKRLL 551
Cdd:PLN03081 329 SIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRK-NLISWNALIAGYGNHGRG 407
                        330       340
                 ....*....|....*....|....*...
gi 18407744  552 QEADDLFRKMKEDGTLPNSGTYNTLIRA 579
Cdd:PLN03081 408 TKAVEMFERMIAEGVAPNHVTFLAVLSA 435
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
195-577 2.29e-17

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 86.08  E-value: 2.29e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  195 IHGLFLHNKASEAVALVdQMVQRGCQPDL--VTYGTVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDSLCKYR 272
Cdd:PLN03081  94 IEKLVACGRHREALELF-EILEAGCPFTLpaSTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCG 172
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  273 HVEVAVDLFTEMETKgirpNVVTYNSLINCLCNYGRWSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHE 352
Cdd:PLN03081 173 MLIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHC 248
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  353 EMIQRSIDPDTITYNLLINGFCMHNRLDEAKQMFkfmvskDCLPNIQT--YNTLINGFCKCKRVEDGVELFREMSQRGLV 430
Cdd:PLN03081 249 CVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVF------DGMPEKTTvaWNSMLAGYALHGYSEEALCLYYEMRDSGVS 322
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  431 GNTVTYTTIIQGFFQAGDCDSAQMVFKQMVSNRVPTDIMTYSILLHGLCSYGKLDTALVIFKYLQKSemelNIFIYNTMI 510
Cdd:PLN03081 323 IDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRK----NLISWNALI 398
                        330       340       350       360       370       380       390
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 18407744  511 EGMCKAGKVGEAWDLFCSL---SIKPDVVTYNTMISGLCSKRLLQEADDLFRKMKED-GTLPNSGTYNTLI 577
Cdd:PLN03081 399 AGYGNHGRGTKAVEMFERMiaeGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENhRIKPRAMHYACMI 469
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
326-374 1.24e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.94  E-value: 1.24e-16
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   326 PNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDTITYNLLINGFC 374
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
361-410 1.64e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.55  E-value: 1.64e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 18407744   361 PDTITYNLLINGFCMHNRLDEAKQMFKFMVSKDCLPNIQTYNTLINGFCK 410
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
257-305 1.96e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.17  E-value: 1.96e-16
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   257 NVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLINCLCN 305
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
64-304 4.95e-16

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 82.23  E-value: 4.95e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744    64 VDDAVDLFGDM-VKSRPF-PSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAVL 141
Cdd:PLN03218  558 VDRAFDVLAEMkAETHPIdPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIY 637
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   142 AKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMVEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQP 221
Cdd:PLN03218  638 DDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRP 717
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   222 DLVTYGTVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLIN 301
Cdd:PLN03218  718 TVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITG 797

                  ...
gi 18407744   302 cLC 304
Cdd:PLN03218  798 -LC 799
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
63-430 5.81e-15

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 78.37  E-value: 5.81e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   63 KVDDAVDLFGDMvksrPFPSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAVLA 142
Cdd:PLN03081 274 DIEDARCVFDGM----PEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHA 349
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  143 KMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMVemgyKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPD 222
Cdd:PLN03081 350 GLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMP----RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPN 425
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  223 LVTYGTVVNGLCKRGDIDLALNLLNKM-EAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKgirPNVVTYNSLIN 301
Cdd:PLN03081 426 HVTFLAVLSACRYSGLSEQGWEIFQSMsENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFK---PTVNMWAALLT 502
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  302 ClCNYGRWSDASRL----LSNMLEKKINPNVVTFNALIDAffkeGKLVEAEKLHEEMIQRsidpdtitynllinGFCMHN 377
Cdd:PLN03081 503 A-CRIHKNLELGRLaaekLYGMGPEKLNNYVVLLNLYNSS----GRQAEAAKVVETLKRK--------------GLSMHP 563
                        330       340       350       360       370
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 18407744  378 R---LDEAKQMFKFMVSKDCLP-NIQTYntlingfckcKRVEdgvELFREMSQRGLV 430
Cdd:PLN03081 564 ActwIEVKKQDHSFFSGDRLHPqSREIY----------QKLD---ELMKEISEYGYV 607
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
533-579 1.01e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.54  E-value: 1.01e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 18407744   533 PDVVTYNTMISGLCSKRLLQEADDLFRKMKEDGTLPNSGTYNTLIRA 579
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILING 47
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
291-340 1.26e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.16  E-value: 1.26e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 18407744   291 PNVVTYNSLINCLCNYGRWSDASRLLSNMLEKKINPNVVTFNALIDAFFK 340
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
396-443 2.29e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.39  E-value: 2.29e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 18407744   396 PNIQTYNTLINGFCKCKRVEDGVELFREMSQRGLVGNTVTYTTIIQGF 443
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
502-546 2.68e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.39  E-value: 2.68e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 18407744   502 NIFIYNTMIEGMCKAGKVGEAWDLFC---SLSIKPDVVTYNTMISGLC 546
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNemkKRGVKPNVYTYTILINGLC 49
PLN03077 PLN03077
Protein ECB2; Provisional
65-429 5.27e-14

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 75.66  E-value: 5.27e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   65 DDAVDLFGDMVKSRPFPSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAVLAKM 144
Cdd:PLN03077 371 DKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNI 450
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  145 MklgyEPDIVTLSSLLNGYCHSKRISDAVALVDQMVeMGYKPDTFTFTTLIH-----GLFLHNKASEAVALVDQMVQRGC 219
Cdd:PLN03077 451 P----EKDVISWTSIIAGLRLNNRCFEALIFFRQML-LTLKPNSVTLIAALSacariGALMCGKEIHAHVLRTGIGFDGF 525
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  220 QPDlvtygTVVNGLCKRGDIDLALNLLNKMEAarikaNVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSL 299
Cdd:PLN03077 526 LPN-----ALLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISL 595
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  300 InCLCNY-GRWSDASRLLSNMLEK-KINPNVVTFNALIDAFFKEGKLVEAEKLHEEMiqrSIDPDTITYNLLINGFCMHN 377
Cdd:PLN03077 596 L-CACSRsGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLTEAYNFINKM---PITPDPAVWGALLNACRIHR 671
                        330       340       350       360       370
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 18407744  378 RLD----EAKQMFKfMVSKDclpnIQTYNTLINGFCKCKRVEDGVELFREMSQRGL 429
Cdd:PLN03077 672 HVElgelAAQHIFE-LDPNS----VGYYILLCNLYADAGKWDEVARVRKTMRENGL 722
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
186-235 9.35e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 62.77  E-value: 9.35e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 18407744   186 PDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCK 235
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
321-599 2.35e-12

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 69.90  E-value: 2.35e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  321 EKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEM-IQRSIDPDTITYNLLINGFCMHNRLDEAKQMFKFMVSKDCLPNIQ 399
Cdd:PLN03081  80 DTQIRKSGVSLCSQIEKLVACGRHREALELFEILeAGCPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQY 159
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  400 TYNTLINGFCKCKRVEDGVELFREMSQRglvgNTVTYTTIIQGFFQAGDCDSAQMVFKQMVSNRVPTDIMTYSILLHGLC 479
Cdd:PLN03081 160 MMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASA 235
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744  480 SYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGEAWDLFCSLSIKpDVVTYNTMISGLCSKRLLQEADDLFR 559
Cdd:PLN03081 236 GLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEK-TTVAWNSMLAGYALHGYSEEALCLYY 314
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|
gi 18407744  560 KMKEDGTLPNSGTYNTLIRAnlrdCDRAASAELIKEMRSS 599
Cdd:PLN03081 315 EMRDSGVSIDQFTFSIMIRI----FSRLALLEHAKQAHAG 350
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
151-198 5.48e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 60.84  E-value: 5.48e-12
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 18407744   151 PDIVTLSSLLNGYCHSKRISDAVALVDQMVEMGYKPDTFTFTTLIHGL 198
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PLN03218 PLN03218
maturation of RBCL 1; Provisional
397-595 5.77e-12

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 69.14  E-value: 5.77e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   397 NIQTYNTLINGFCKCKRVEDGVELFREMSQRGL--------------------VGNTVTYTTII-----QGF-------F 444
Cdd:PLN03218  369 KSPEYIDAYNRLLRDGRIKDCIDLLEDMEKRGLldmdkiyhakffkackkqraVKEAFRFAKLIrnptlSTFnmlmsvcA 448
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   445 QAGDCDSAQMVFKQMVSNRVPTDIMTYSILLHGLCSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCKAGKVGE--- 521
Cdd:PLN03218  449 SSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKafg 528
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 18407744   522 AWDLFCSLSIKPDVVTYNTMISGLCSKRLLQEADDLFRKMKEDGT--LPNSGTYNTLIRAnlrdCDRAASAELIKE 595
Cdd:PLN03218  529 AYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAETHpiDPDHITVGALMKA----CANAGQVDRAKE 600
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
280-341 3.30e-11

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 58.91  E-value: 3.30e-11
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 18407744   280 LFTEMETKGIRPNVVTYNSLINCLCNYGRWSDASRLLSNMLEKKINPNVVTFNALIDAFFKE 341
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
432-479 5.45e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 57.76  E-value: 5.45e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 18407744   432 NTVTYTTIIQGFFQAGDCDSAQMVFKQMVSNRVPTDIMTYSILLHGLC 479
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
221-270 9.06e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 57.37  E-value: 9.06e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 18407744   221 PDLVTYGTVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTIIDSLCK 270
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
392-425 1.37e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 56.20  E-value: 1.37e-10
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18407744   392 KDCLPNIQTYNTLINGFCKCKRVEDGVELFREMS 425
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
217-250 2.23e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 55.81  E-value: 2.23e-10
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18407744   217 RGCQPDLVTYGTVVNGLCKRGDIDLALNLLNKME 250
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
357-390 2.29e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 52.73  E-value: 2.29e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18407744   357 RSIDPDTITYNLLINGFCMHNRLDEAKQMFKFMV 390
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
531-562 2.81e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 52.73  E-value: 2.81e-09
                          10        20        30
                  ....*....|....*....|....*....|..
gi 18407744   531 IKPDVVTYNTMISGLCSKRLLQEADDLFRKMK 562
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
117-165 5.79e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.98  E-value: 5.79e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   117 DLYTYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGYCH 165
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
287-319 1.70e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.42  E-value: 1.70e-08
                          10        20        30
                  ....*....|....*....|....*....|...
gi 18407744   287 KGIRPNVVTYNSLINCLCNYGRWSDASRLLSNM 319
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
324-354 2.35e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.04  E-value: 2.35e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18407744   324 INPNVVTFNALIDAFFKEGKLVEAEKLHEEM 354
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
467-515 7.54e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.90  E-value: 7.54e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   467 DIMTYSILLHGLCSYGKLDTALVIFKYLQKSEMELNIFIYNTMIEGMCK 515
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
314-373 8.43e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 49.28  E-value: 8.43e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   314 RLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDTITYNLLINGF 373
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
376-501 1.59e-07

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 52.40  E-value: 1.59e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   376 HNRLDEAKQMFKFMVSKDCLPNIQTYNTLINgFC----------KCKRVEDGVELFREMSQRGLVGNTVTYTTIIQGFFQ 445
Cdd:pfam17177  24 HADATGALALYDAAKAEGVRLAQYHYNVLLY-LCskaadatdlkPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 18407744   446 AGDCDSAQMVFKQMVSNRVPTDIMTYSILLHGLCSYGKLDTALVIFKYLQKSEMEL 501
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
536-565 2.91e-07

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 46.69  E-value: 2.91e-07
                          10        20        30
                  ....*....|....*....|....*....|
gi 18407744   536 VTYNTMISGLCSKRLLQEADDLFRKMKEDG 565
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
182-215 3.78e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 46.57  E-value: 3.78e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18407744   182 MGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMV 215
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
364-398 4.48e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.29  E-value: 4.48e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   364 ITYNLLINGFCMHNRLDEAKQMFKFMVSKDCLPNI 398
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
81-130 4.53e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.59  E-value: 4.53e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 18407744    81 PSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCR 130
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
400-433 8.55e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.52  E-value: 8.55e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18407744   400 TYNTLINGFCKCKRVEDGVELFREMSQRGLVGNT 433
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
400-429 1.15e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 45.15  E-value: 1.15e-06
                          10        20        30
                  ....*....|....*....|....*....|
gi 18407744   400 TYNTLINGFCKCKRVEDGVELFREMSQRGL 429
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
294-328 1.17e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.14  E-value: 1.17e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   294 VTYNSLINCLCNYGRWSDASRLLSNMLEKKINPNV 328
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
245-301 1.21e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 46.20  E-value: 1.21e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 18407744   245 LLNKMEAARIKANVVIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLIN 301
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PLN03218 PLN03218
maturation of RBCL 1; Provisional
64-272 1.24e-06

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 51.80  E-value: 1.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744    64 VDDAVDLFGDMVKSRPFPSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAVLAK 143
Cdd:PLN03218  630 WDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYED 709
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   144 MMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMVEMGYKPDTFTFTTLIhglflhnKASEavalvdqmvqrgcqpdl 223
Cdd:PLN03218  710 IKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILL-------VASE----------------- 765
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   224 vtygtvvnglcKRGDIDLALNLLNKMEAARIKANVVIFNTIIdSLCKYR 272
Cdd:PLN03218  766 -----------RKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLCLRR 802
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
536-570 1.33e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.14  E-value: 1.33e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   536 VTYNTMISGLCSKRLLQEADDLFRKMKEDGTLPNS 570
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
499-526 3.57e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.87  E-value: 3.57e-06
                          10        20
                  ....*....|....*....|....*...
gi 18407744   499 MELNIFIYNTMIEGMCKAGKVGEAWDLF 526
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELL 30
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
329-363 3.65e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 3.65e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   329 VTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDPDT 363
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
427-460 3.71e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.87  E-value: 3.71e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18407744   427 RGLVGNTVTYTTIIQGFFQAGDCDSAQMVFKQMV 460
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
259-293 4.66e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.60  E-value: 4.66e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   259 VIFNTIIDSLCKYRHVEVAVDLFTEMETKGIRPNV 293
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
210-265 4.88e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 44.27  E-value: 4.88e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 18407744   210 LVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLALNLLNKMEAARIKANVVIFNTII 265
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
254-285 6.06e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.10  E-value: 6.06e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 18407744   254 IKANVVIFNTIIDSLCKYRHVEVAVDLFTEME 285
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
148-180 6.43e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.10  E-value: 6.43e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 18407744   148 GYEPDIVTLSSLLNGYCHSKRISDAVALVDQMV 180
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
329-359 6.44e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.84  E-value: 6.44e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18407744   329 VTFNALIDAFFKEGKLVEAEKLHEEMIQRSI 359
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
349-408 8.38e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.50  E-value: 8.38e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   349 KLHEEMIQRSIDPDTITYNLLINGFCMHNRLDEAKQMFKFMVSKDCLPNIQTYNTLINGF 408
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
235-361 1.15e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 46.62  E-value: 1.15e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   235 KRGDIDLALNLLNKMEAARIKANVVIFNTII---------DSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLINcLCN 305
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLylcskaadaTDLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVAR-LAA 101
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 18407744   306 YGRWSD-ASRLLSNMLEKKINPNVVTFNALIDAFFKEGKLVEAEKLHEEMIQRSIDP 361
Cdd:pfam17177 102 AKGDGDlAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
364-394 1.19e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.45  E-value: 1.19e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18407744   364 ITYNLLINGFCMHNRLDEAKQMFKFMVSKDC 394
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
420-475 1.29e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.12  E-value: 1.29e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 18407744   420 LFREMSQRGLVGNTVTYTTIIQGFFQAGDCDSAQMVFKQMVSNRVPTDIMTYSILL 475
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
154-188 1.38e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.06  E-value: 1.38e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   154 VTLSSLLNGYCHSKRISDAVALVDQMVEMGYKPDT 188
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
119-153 2.03e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 2.03e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   119 YTYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDI 153
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
531-579 2.19e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.35  E-value: 2.19e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 18407744   531 IKPDVVTYNTMISGLCSKRLLQEADDLFRKMKEDGTLPNSGTYNTLIRA 579
Cdd:pfam13812  11 IQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGV 59
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
224-258 4.72e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 4.72e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   224 VTYGTVVNGLCKRGDIDLALNLLNKMEAARIKANV 258
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
175-236 5.63e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 41.19  E-value: 5.63e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 18407744   175 LVDQMVEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKR 236
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
189-222 8.19e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 8.19e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18407744   189 FTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPD 222
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
504-535 9.67e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.75  E-value: 9.67e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   504 FIYNTMIEGMCKAGKVGEAWDLFCSL---SIKPDV 535
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMkerGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
104-163 2.41e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.65  E-value: 2.41e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   104 SLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLSSLLNGY 163
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
307-458 2.48e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 43.18  E-value: 2.48e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744 307 GRWSDASRLLSNMLEkkINP-NVVTFNALIDAFFKEGKLVEAEKLHEEMIQRsiDPDTITYNLLINGFCMH-NRLDEAKQ 384
Cdd:COG2956  22 GQPDKAIDLLEEALE--LDPeTVEAHLALGNLYRRRGEYDRAIRIHQKLLER--DPDRAEALLELAQDYLKaGLLDRAEE 97
                        90       100       110       120       130       140       150
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 18407744 385 MFKFMVSKDCLpNIQTYNTLINGFCKCKRVEDGVELFREMSQRGLvGNTVTYTTIIQGFFQAGDCDSAQMVFKQ 458
Cdd:COG2956  98 LLEKLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLGP-ENAHAYCELAELYLEQGDYDEAIEALEK 169
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
201-308 3.45e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 42.38  E-value: 3.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   201 HNKASEAVALVDQMVQRGCQPDLVTYGTVVNgLCKRGDIDLALNLLNK----------MEAARIKANVVIFNTIIDSLCK 270
Cdd:pfam17177  24 HADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAADATDLKPQLAadrgfevfeaMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 18407744   271 YRHVEVAVDLFTEMETKGIRPNVVTYNSLINCLCNYGR 308
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGD 140
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
479-600 3.85e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 42.00  E-value: 3.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   479 CS-YGKLDTALVIFKYLQKSEMELNIFIYNTMIEgmckagkvgeawdlFCSLSIKPDVVTYNtmisgLCSKRLLQeaddL 557
Cdd:pfam17177  21 CSkHADATGALALYDAAKAEGVRLAQYHYNVLLY--------------LCSKAADATDLKPQ-----LAADRGFE----V 77
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 18407744   558 FRKMKEDGTLPNSGTYNTLIR--ANLRDCDRAASaeLIKEMRSSG 600
Cdd:pfam17177  78 FEAMKAQGVSPNEATYTAVARlaAAKGDGDLAFD--LVKEMEAAG 120
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
294-324 5.03e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 5.03e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18407744   294 VTYNSLINCLCNYGRWSDASRLLSNMLEKKI 324
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
434-464 6.49e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.44  E-value: 6.49e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18407744   434 VTYTTIIQGFFQAGDCDSAQMVFKQMVSNRV 464
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
259-289 7.44e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 7.44e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18407744   259 VIFNTIIDSLCKYRHVEVAVDLFTEMETKGI 289
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
61-160 1.06e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.84  E-value: 1.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744    61 IIKVDDAVDLFGDMVKSRPFPSIVEFNKLLSAVAKMNKFELVISLGEQMQTLGISHDLYTYSIFINCFCRRSQLSLALAV 140
Cdd:pfam17177  68 QLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEV 147
                          90       100
                  ....*....|....*....|
gi 18407744   141 LAKMMKLGYEPDIVTLSSLL 160
Cdd:pfam17177 148 EEHMLAHGVELEEPELAALL 167
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
467-500 1.21e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 36.55  E-value: 1.21e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 18407744   467 DIMTYSILLHGLCSYGKLDTALVIFKylqksEME 500
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLD-----EME 34
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
265-382 1.62e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.46  E-value: 1.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   265 IDSLCKYRHVEVAVDLFTEMETKGIRPNVVTYNSLINcLCNYGRWSDAS----------RLLSNMLEKKINPNVVTFNAL 334
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAADATDLkpqlaadrgfEVFEAMKAQGVSPNEATYTAV 96
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 18407744   335 IDAFFKEGKLVEAEKLHEEMIQRSIDPDTITYNLLINGFCMHNRLDEA 382
Cdd:pfam17177  97 ARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKA 144
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
224-254 1.73e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 36.29  E-value: 1.73e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18407744   224 VTYGTVVNGLCKRGDIDLALNLLNKMEAARI 254
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
MRP-S27 pfam10037
Mitochondrial 28S ribosomal protein S27; Members of this family of small ribosomal proteins ...
265-357 2.68e-03

Mitochondrial 28S ribosomal protein S27; Members of this family of small ribosomal proteins possess one of three conserved blocks of sequence found in proteins that stimulate the dissociation of guanine nucleotides from G-proteins, leaving open the possibility that MRP-S27 might be a functional partner of GTP-binding ribosomal proteins.


Pssm-ID: 462947 [Multi-domain]  Cd Length: 395  Bit Score: 40.50  E-value: 2.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744   265 IDSLCKYRHVEVAVDLftemetkgirPNVVTYnSLINCLCNYGRWSDASRLLSNMLEKKINPNVVTFNALIDAFFKEGKL 344
Cdd:pfam10037  88 EYYLYKLRHSPNCWYL----------RDWTSH-AWIRQCLKYGAPDKALYTLKNKVQYGIFPDNFTFNLLMDSFLKNGDY 156
                          90
                  ....*....|...
gi 18407744   345 VEAEKLHEEMIQR 357
Cdd:pfam10037 157 KSAASVVTELMLQ 169
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
177-252 3.80e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 39.30  E-value: 3.80e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 18407744   177 DQMVEMGYKPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLALNLLNKMEAA 252
Cdd:pfam17177  79 EAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAH 154
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
389-440 4.35e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 35.80  E-value: 4.35e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 18407744   389 MVSKDCLPNIQTYNTLINGFCKCKRVEDGVELFREMSQRGLVGNTVTYTTII 440
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
504-526 4.37e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.13  E-value: 4.37e-03
                          10        20
                  ....*....|....*....|...
gi 18407744   504 FIYNTMIEGMCKAGKVGEAWDLF 526
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELF 23
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
469-503 4.44e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.12  E-value: 4.44e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 18407744   469 MTYSILLHGLCSYGKLDTALVIFKYLQKSEMELNI 503
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
113-144 5.03e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.01  E-value: 5.03e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 18407744   113 GISHDLYTYSIFINCFCRRSQLSLALAVLAKM 144
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
154-183 5.17e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 5.17e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 18407744   154 VTLSSLLNGYCHSKRISDAVALVDQMVEMG 183
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
233-387 6.44e-03

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 38.83  E-value: 6.44e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744 233 LCKRGDIDLALNLLNKmeAARIKAN-VVIFNTIIDSLCKYRHVEVAVDLFTE-MEtkgIRPNVV-TYNSLINCLCNYGRW 309
Cdd:COG0457  18 YRRLGRYEEAIEDYEK--ALELDPDdAEALYNLGLAYLRLGRYEEALADYEQaLE---LDPDDAeALNNLGLALQALGRY 92
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 18407744 310 SDASRLLSNMLEkkINP-NVVTFNALIDAFFKEGKLVEAEKLHEEMIQrsIDPDT-ITYNLLINGFCMHNRLDEAKQMFK 387
Cdd:COG0457  93 EEALEDYDKALE--LDPdDAEALYNLGLALLELGRYDEAIEAYERALE--LDPDDaDALYNLGIALEKLGRYEEALELLE 168
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
189-219 7.66e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.36  E-value: 7.66e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 18407744   189 FTFTTLIHGLFLHNKASEAVALVDQMVQRGC 219
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
140-195 9.21e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 35.03  E-value: 9.21e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 18407744   140 VLAKMMKLGYEPDIVTLSSLLNGYCHSKRISDAVALVDQMVEMGYKPDTFTFTTLI 195
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH