NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15235288|ref|NP_193742|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 13595353)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
190-554 1.71e-31

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 131.54  E-value: 1.71e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   190 FNLVIKALCKLRFVDRAIEVFRGMPERKCLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCK 269
Cdd:PLN03218  440 FNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCAR 519
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   270 KGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVS-SKCI-PNDVTYGTLINGLVKQRRATDAV 347
Cdd:PLN03218  520 AGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAeTHPIdPDHITVGALMKACANAGQVDRAK 599
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   348 RLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDGLCREGKPNEAKEILNRMIASG 427
Cdd:PLN03218  600 EVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQG 679
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   428 CLPNAYTYSSLMkgffktGLC------EEAVQVWKEMDKTGCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPD 501
Cdd:PLN03218  680 IKLGTVSYSSLM------GACsnaknwKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPN 753
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 15235288   502 TVAYSSIIKGLCGIGSMDAALKLYHEMLcqeEPKSQPDVVTYNILLdGLCMQK 554
Cdd:PLN03218  754 TITYSILLVASERKDDADVGLDLLSQAK---EDGIKPNLVMCRCIT-GLCLRR 802
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
538-585 6.71e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.15  E-value: 6.71e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15235288   538 PDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITCNTFLNTL 585
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
146-199 9.69e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 34.65  E-value: 9.69e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 15235288   146 RSVKSFNSVLNVIINEGLYHRGLEFYDYvvnsnM-NMNISPNGLSFNLVIKALCK 199
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNE-----MkKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
190-554 1.71e-31

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 131.54  E-value: 1.71e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   190 FNLVIKALCKLRFVDRAIEVFRGMPERKCLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCK 269
Cdd:PLN03218  440 FNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCAR 519
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   270 KGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVS-SKCI-PNDVTYGTLINGLVKQRRATDAV 347
Cdd:PLN03218  520 AGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAeTHPIdPDHITVGALMKACANAGQVDRAK 599
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   348 RLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDGLCREGKPNEAKEILNRMIASG 427
Cdd:PLN03218  600 EVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQG 679
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   428 CLPNAYTYSSLMkgffktGLC------EEAVQVWKEMDKTGCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPD 501
Cdd:PLN03218  680 IKLGTVSYSSLM------GACsnaknwKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPN 753
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 15235288   502 TVAYSSIIKGLCGIGSMDAALKLYHEMLcqeEPKSQPDVVTYNILLdGLCMQK 554
Cdd:PLN03218  754 TITYSILLVASERKDDADVGLDLLSQAK---EDGIKPNLVMCRCIT-GLCLRR 802
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
290-339 4.15e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 69.70  E-value: 4.15e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15235288   290 PNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVK 339
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
538-585 6.71e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.15  E-value: 6.71e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15235288   538 PDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITCNTFLNTL 585
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
293-327 2.21e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.45  E-value: 2.21e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15235288   293 VTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPND 327
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
541-575 5.82e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.29  E-value: 5.82e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15235288   541 VTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDV 575
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
305-457 3.66e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 42.79  E-value: 3.66e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288 305 KGKLDKAVSLLERMVSSKciPNDVTYGT-LINGLVKQRRATDAVRLLSSMEERGYHlNQHIYSVLISGLFKEGKAEEAMS 383
Cdd:COG2956  89 AGLLDRAEELLEKLLELD--PDDAEALRlLAEIYEQEGDWEKAIEVLERLLKLGPE-NAHAYCELAELYLEQGDYDEAIE 165
                        90       100       110       120       130       140       150
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 15235288 384 LWRKMAEKgcKPNIV-VYSVLVDGLCREGKPNEAKEILNRMIASGclPN-AYTYSSLMKGFFKTGLCEEAVQVWKE 457
Cdd:COG2956 166 ALEKALKL--DPDCArALLLLAELYLEQGDYEEAIAALERALEQD--PDyLPALPRLAELYEKLGDPEEALELLRK 237
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
146-199 9.69e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 34.65  E-value: 9.69e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 15235288   146 RSVKSFNSVLNVIINEGLYHRGLEFYDYvvnsnM-NMNISPNGLSFNLVIKALCK 199
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNE-----MkKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
190-554 1.71e-31

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 131.54  E-value: 1.71e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   190 FNLVIKALCKLRFVDRAIEVFRGMPERKCLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCK 269
Cdd:PLN03218  440 FNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCAR 519
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   270 KGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVS-SKCI-PNDVTYGTLINGLVKQRRATDAV 347
Cdd:PLN03218  520 AGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAeTHPIdPDHITVGALMKACANAGQVDRAK 599
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   348 RLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDGLCREGKPNEAKEILNRMIASG 427
Cdd:PLN03218  600 EVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQG 679
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   428 CLPNAYTYSSLMkgffktGLC------EEAVQVWKEMDKTGCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPD 501
Cdd:PLN03218  680 IKLGTVSYSSLM------GACsnaknwKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPN 753
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 15235288   502 TVAYSSIIKGLCGIGSMDAALKLYHEMLcqeEPKSQPDVVTYNILLdGLCMQK 554
Cdd:PLN03218  754 TITYSILLVASERKDDADVGLDLLSQAK---EDGIKPNLVMCRCIT-GLCLRR 802
PLN03218 PLN03218
maturation of RBCL 1; Provisional
232-602 2.89e-27

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 118.06  E-value: 2.89e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   232 LCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKK----GDLTRVTKLVDNmflkgcvPNEVTYNTLIHGLCLKGK 307
Cdd:PLN03218  380 LLRDGRIKDCIDLLEDMEKRGLLDMDKIYHAKFFKACKKqravKEAFRFAKLIRN-------PTLSTFNMLMSVCASSQD 452
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   308 LDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRK 387
Cdd:PLN03218  453 IDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGI 532
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   388 MAEKGCKPNIVVYSVLVDGLCREGKPNEAKEILNRMIASG--CLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTG--- 462
Cdd:PLN03218  533 MRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNikg 612
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   463 -----------CSRN---KFCYSV------------------LIDGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIK 510
Cdd:PLN03218  613 tpevytiavnsCSQKgdwDFALSIyddmkkkgvkpdevffsaLVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMG 692
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   511 GLCGIGSMDAALKLYHEMlcqEEPKSQPDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITCNTFLNTlSEKSN 590
Cdd:PLN03218  693 ACSNAKNWKKALELYEDI---KSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVA-SERKD 768
                         410
                  ....*....|..
gi 15235288   591 SCDKGRSFLEEL 602
Cdd:PLN03218  769 DADVGLDLLSQA 780
PLN03218 PLN03218
maturation of RBCL 1; Provisional
183-408 5.68e-19

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 91.86  E-value: 5.68e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   183 ISPNGLSFNLVIKALCKLRFVDRAIEVFRGMPER--KCLPDGYT----YCTLMDGLckeeriDEAVLLLDEMQSEGCSPS 256
Cdd:PLN03218  575 IDPDHITVGALMKACANAGQVDRAKEVYQMIHEYniKGTPEVYTiavnSCSQKGDW------DFALSIYDDMKKKGVKPD 648
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   257 PVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLING 336
Cdd:PLN03218  649 EVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITA 728
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 15235288   337 LVKQRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVdGLC 408
Cdd:PLN03218  729 LCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLC 799
PLN03077 PLN03077
Protein ECB2; Provisional
78-487 1.50e-18

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 90.29  E-value: 1.50e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   78 STLSSMIESYANSGDFDSVEKLLSRIRLENRViierSFIVVFRAYGKAHLPDKAVDLFhRMVDEFRCKRSVKSFNSVLNV 157
Cdd:PLN03077 324 SVCNSLIQMYLSLGSWGEAEKVFSRMETKDAV----SWTAMISGYEKNGLPDKALETY-ALMEQDNVSPDEITIASVLSA 398
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  158 IINEGLYHRGLEFYDYVVNSN-MNMNISPNGLsfnlvIKALCKLRFVDRAIEVFRGMPERkclpDGYTYCTLMDGLCKEE 236
Cdd:PLN03077 399 CACLGDLDVGVKLHELAERKGlISYVVVANAL-----IEMYSKCKCIDKALEVFHNIPEK----DVISWTSIIAGLRLNN 469
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  237 RIDEAVLLLDEMQSEgCSPSPVIYNVLIDGLCKKGDLtRVTK------LVDNMFLKGCVPNE------------------ 292
Cdd:PLN03077 470 RCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGAL-MCGKeihahvLRTGIGFDGFLPNAlldlyvrcgrmnyawnqf 547
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  293 -------VTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERgyhlnqhiY 365
Cdd:PLN03077 548 nshekdvVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEK--------Y 619
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  366 SVLisglfkegkaeeamslwrkmaekgckPNIVVYSVLVDGLCREGKPNEAKEILNRMIASgclPNAYTYSSLMKG---F 442
Cdd:PLN03077 620 SIT--------------------------PNLKHYACVVDLLGRAGKLTEAYNFINKMPIT---PDPAVWGALLNAcriH 670
                        410       420       430       440       450
                 ....*....|....*....|....*....|....*....|....*....|..
gi 15235288  443 FKTGLCEEAVQVWKEMDKtgcsrNKFCYSVLIDGL---CG----VGRVKEAM 487
Cdd:PLN03077 671 RHVELGELAAQHIFELDP-----NSVGYYILLCNLyadAGkwdeVARVRKTM 717
PLN03218 PLN03218
maturation of RBCL 1; Provisional
182-448 1.24e-17

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 87.62  E-value: 1.24e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   182 NISPNGLSFNLVIKALCKLRFVDRAIEVFRGM-PERK-CLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVI 259
Cdd:PLN03218  537 NVKPDRVVFNALISACGQSGAVDRAFDVLAEMkAETHpIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEV 616
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   260 YNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVK 339
Cdd:PLN03218  617 YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSN 696
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   340 QRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDGLCREGKPNEAKEI 419
Cdd:PLN03218  697 AKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDL 776
                         250       260
                  ....*....|....*....|....*....
gi 15235288   420 LNRMIASGCLPNAYTYSSLmkgffkTGLC 448
Cdd:PLN03218  777 LSQAKEDGIKPNLVMCRCI------TGLC 799
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
174-621 7.52e-16

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 81.46  E-value: 7.52e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  174 VVNSNMNMNISPNGLSFNLVIKALCKLRFVDRAIEVFRGMPERkclpDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGC 253
Cdd:PLN03081 145 VYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGS 220
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  254 SPSPVIYNVLIDGLCKKGdLTRVTKLVDNMFLK-GCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCipndVTYGT 332
Cdd:PLN03081 221 DAEPRTFVVMLRASAGLG-SARAGQQLHCCVLKtGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTT----VAWNS 295
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  333 LINGLVKQRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDGLCREGK 412
Cdd:PLN03081 296 MLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGR 375
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  413 PNEAKEILNRMiasgclP--NAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSRNKFCYSVLIDGLCGVGRVKEAMMVW 490
Cdd:PLN03081 376 MEDARNVFDRM------PrkNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIF 449
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  491 SKMLTI-GIKPDTVAYSSIIKGLCGIGSMDAALKLYhemlcqeepKSQPDVVTYNI---LLDGLCMQKDISRAVDLLNSM 566
Cdd:PLN03081 450 QSMSENhRIKPRAMHYACMIELLGREGLLDEAYAMI---------RRAPFKPTVNMwaaLLTACRIHKNLELGRLAAEKL 520
                        410       420       430       440       450
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 15235288  567 LdrGCDPDVItcNTFLNTLSEKSNScdkGRSFLEELVVRLLKRQ--RVSGACTIVEV 621
Cdd:PLN03081 521 Y--GMGPEKL--NNYVVLLNLYNSS---GRQAEAAKVVETLKRKglSMHPACTWIEV 570
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
290-339 4.15e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 69.70  E-value: 4.15e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15235288   290 PNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVK 339
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
73-547 1.30e-13

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 74.50  E-value: 1.30e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   73 FKLGDSTLSSMIESYANSGDFDSVEKLLSRIRLENRViierSFIVVFRAYGKAHLPDKAVDLFHRMVdEFRCKRSVKSFN 152
Cdd:PLN03077 218 FELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCI----SWNAMISGYFENGECLEGLELFFTMR-ELSVDPDLMTIT 292
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  153 SVlnVIINEGLYHRGL--EFYDYVVNSNMNMNISPNG---------------------------LSFNLVIKALCKLRFV 203
Cdd:PLN03077 293 SV--ISACELLGDERLgrEMHGYVVKTGFAVDVSVCNsliqmylslgswgeaekvfsrmetkdaVSWTAMISGYEKNGLP 370
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  204 DRAIEVFRGMPERKCLPDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNM 283
Cdd:PLN03077 371 DKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNI 450
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  284 FLKgcvpNEVTYNTLIHGLCLKGKLDKAVSLLERMVSsKCIPNDVTygtlingLVKQRRATDAVRLLSSMEERGYHlnqh 363
Cdd:PLN03077 451 PEK----DVISWTSIIAGLRLNNRCFEALIFFRQMLL-TLKPNSVT-------LIAALSACARIGALMCGKEIHAH---- 514
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  364 iysVLISGLFKEGKAEEAM-SLWRKMAEKG--------CKPNIVVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNAYT 434
Cdd:PLN03077 515 ---VLRTGIGFDGFLPNALlDLYVRCGRMNyawnqfnsHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVT 591
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  435 YSSLMKGFFKTGLCEEAVQVWKEM-DKTGCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKMltiGIKPDTVAYSSIIKGlC 513
Cdd:PLN03077 592 FISLLCACSRSGMVTQGLEYFHSMeEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKM---PITPDPAVWGALLNA-C 667
                        490       500       510
                 ....*....|....*....|....*....|....
gi 15235288  514 GIGSMDAALKLYHEMLCQEEPKSqpdvVTYNILL 547
Cdd:PLN03077 668 RIHRHVELGELAAQHIFELDPNS----VGYYILL 697
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
395-444 1.90e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 65.08  E-value: 1.90e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15235288   395 PNIVVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFK 444
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
255-303 2.04e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 64.69  E-value: 2.04e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 15235288   255 PSPVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLC 303
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
286-319 5.54e-13

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 63.13  E-value: 5.54e-13
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15235288   286 KGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMV 319
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
500-551 5.85e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.54  E-value: 5.85e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 15235288   500 PDTVAYSSIIKGLCGIGSMDAALKLYHEMLCQeepKSQPDVVTYNILLDGLC 551
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKR---GVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
538-585 6.71e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.15  E-value: 6.71e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15235288   538 PDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITCNTFLNTL 585
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PLN03077 PLN03077
Protein ECB2; Provisional
296-582 8.20e-13

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 71.80  E-value: 8.20e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  296 NTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINgLVKQRRATDA-----VRLLSSMEERGYHLNqhiySVLIS 370
Cdd:PLN03077  55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCEWKRAVEEgsrvcSRALSSHPSLGVRLG----NAMLS 129
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  371 GLFKEGKAEEAMSLWRKMAEKgckpNIVVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEE 450
Cdd:PLN03077 130 MFVRFGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLAR 205
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  451 AVQVWKEMDKTGCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIgikpDTVAYSSIIKGLCGIGSMDAALKLYHEMLc 530
Cdd:PLN03077 206 GREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFFTMR- 280
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|..
gi 15235288  531 qeEPKSQPDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITCNTFL 582
Cdd:PLN03077 281 --ELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLI 330
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
361-409 1.29e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 62.38  E-value: 1.29e-12
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 15235288   361 NQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDGLCR 409
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
114-577 2.36e-12

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 70.26  E-value: 2.36e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  114 SFIVVFRAYGKAHLPDKAVDLFHRMVdEFRCKRSVKSFNSVLNVIINEGLYHRGLEFYDYVVNSNMNMNISpnglSFNLV 193
Cdd:PLN03077 154 SWNVLVGGYAKAGYFDEALCLYHRML-WAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVD----VVNAL 228
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  194 IKALCKLRFVDRAIEVFRGMPERKCLpdgyTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPspviynvlidglckkgDL 273
Cdd:PLN03077 229 ITMYVKCGDVVSARLVFDRMPRRDCI----SWNAMISGYFENGECLEGLELFFTMRELSVDP----------------DL 288
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  274 TRVTKLVDNMFLKGcvpnEVTYNTLIHGLCLKGKLDKAVSLlermvsskcipndvtYGTLINGLVKQRRATDAVRLLSSM 353
Cdd:PLN03077 289 MTITSVISACELLG----DERLGREMHGYVVKTGFAVDVSV---------------CNSLIQMYLSLGSWGEAEKVFSRM 349
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  354 EERgyhlNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPN-IVVYSVLVDGLCReGKPNEAKEILNRMIASGCLPNA 432
Cdd:PLN03077 350 ETK----DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDeITIASVLSACACL-GDLDVGVKLHELAERKGLISYV 424
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  433 YTYSSLMKGFFKTGLCEEAVQVWKEMDKtgcsRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLtIGIKPDTVAYSSIIKGL 512
Cdd:PLN03077 425 VVANALIEMYSKCKCIDKALEVFHNIPE----KDVISWTSIIAGLRLNNRCFEALIFFRQML-LTLKPNSVTLIAALSAC 499
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  513 ------------------CGIGSMD----AALKLYHEMLCQEEPKSQ-----PDVVTYNILLDGLCMQKDISRAVDLLNS 565
Cdd:PLN03077 500 arigalmcgkeihahvlrTGIGFDGflpnALLDLYVRCGRMNYAWNQfnsheKDVVSWNILLTGYVAHGKGSMAVELFNR 579
                        490
                 ....*....|..
gi 15235288  566 MLDRGCDPDVIT 577
Cdd:PLN03077 580 MVESGVNPDEVT 591
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
220-269 2.79e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 61.61  E-value: 2.79e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15235288   220 PDGYTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCK 269
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
430-478 3.00e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.53  E-value: 3.00e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 15235288   430 PNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSRNKFCYSVLIDGLC 478
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
369-586 2.37e-10

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 63.74  E-value: 2.37e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  369 ISGLFKEGKAEEAMSLWRkMAEKGC--KPNIVVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTG 446
Cdd:PLN03081  94 IEKLVACGRHREALELFE-ILEAGCpfTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCG 172
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  447 LCEEAVQVWKEMDKtgcsRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIKGLCGIGSMDAALKLY- 525
Cdd:PLN03081 173 MLIDARRLFDEMPE----RNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHc 248
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  526 ---------------------------HEMLCQEEPKSQPDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITC 578
Cdd:PLN03081 249 cvlktgvvgdtfvscalidmyskcgdiEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTF 328

                 ....*...
gi 15235288  579 NTFLNTLS 586
Cdd:PLN03081 329 SIMIRIFS 336
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
468-513 3.07e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 55.83  E-value: 3.07e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 15235288   468 FCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIKGLC 513
Cdd:pfam13041   4 VTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
185-234 6.79e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 54.68  E-value: 6.79e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15235288   185 PNGLSFNLVIKALCKLRFVDRAIEVFRGMPERKCLPDGYTYCTLMDGLCK 234
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
191-585 1.27e-09

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 61.40  E-value: 1.27e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  191 NLVIKALCKLRFVDRAIEVFRGMPERKCLPDGYTYCTLMDgLCKEER-IDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCK 269
Cdd:PLN03077  55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCEWKRaVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVR 133
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  270 KGDLTRVTklvdNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTY------------------- 330
Cdd:PLN03077 134 FGELVHAW----YVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFpcvlrtcggipdlargrev 209
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  331 ----------------GTLINGLVKQRRATDAVRLLSSMEERgyhlNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCK 394
Cdd:PLN03077 210 hahvvrfgfeldvdvvNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFFTMRELSVD 285
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  395 PNIVVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDktgcSRNKFCYSVLI 474
Cdd:PLN03077 286 PDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRME----TKDAVSWTAMI 361
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  475 DGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIKGLCGIGSMDAALKLyHEMlcQEEPKSQPDVVTYNILLDGLCMQK 554
Cdd:PLN03077 362 SGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKL-HEL--AERKGLISYVVVANALIEMYSKCK 438
                        410       420       430
                 ....*....|....*....|....*....|.
gi 15235288  555 DISRAVDLLNSMLDRgcdpDVITCNTFLNTL 585
Cdd:PLN03077 439 CIDKALEVFHNIPEK----DVISWTSIIAGL 465
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
325-374 4.45e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.37  E-value: 4.45e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15235288   325 PNDVTYGTLINGLVKQRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFK 374
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
391-423 6.07e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 51.58  E-value: 6.07e-09
                          10        20        30
                  ....*....|....*....|....*....|...
gi 15235288   391 KGCKPNIVVYSVLVDGLCREGKPNEAKEILNRM 423
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PLN03218 PLN03218
maturation of RBCL 1; Provisional
150-270 8.80e-09

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 58.74  E-value: 8.80e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   150 SFNSVLNVIINEGLYHRGLEFYDYVVNSNMNMNISpnglSFNLVIKALCKLRFVDRAIEVFRGMPERKCLPDGYTYCTLM 229
Cdd:PLN03218  686 SYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVS----TMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILL 761
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 15235288   230 DGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLIdGLCKK 270
Cdd:PLN03218  762 VASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLCLR 801
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
374-526 1.74e-08

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 55.10  E-value: 1.74e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   374 KEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDgLCR------EGKPNEAK----EILNRMIASGCLPNAYTYSSLMKGFF 443
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSkaadatDLKPQLAAdrgfEVFEAMKAQGVSPNEATYTAVARLAA 101
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   444 KTGLCEEAVQVWKEMDKTGCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIKGLCGIGSMDAALK 523
Cdd:pfam17177 102 AKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYA 181

                  ...
gi 15235288   524 LYH 526
Cdd:pfam17177 182 YLH 184
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
203-459 2.84e-08

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 57.19  E-value: 2.84e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  203 VDRAIEVFRGMPERKCLpdgyTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPSPVIYNVLID----------------G 266
Cdd:PLN03081 275 IEDARCVFDGMPEKTTV----AWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRifsrlallehakqahaG 350
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  267 LCKKG---DLTRVTKLVD------------NMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYG 331
Cdd:PLN03081 351 LIRTGfplDIVANTALVDlyskwgrmedarNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFL 430
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  332 TLIN-----GLVKQRRatdavRLLSSMEE------RGYHlnqhiYSVLISGLFKEGKAEEAMSLWRK------------- 387
Cdd:PLN03081 431 AVLSacrysGLSEQGW-----EIFQSMSEnhrikpRAMH-----YACMIELLGREGLLDEAYAMIRRapfkptvnmwaal 500
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288  388 ---------------MAEK--GCKP-NIVVYSVLVDGLCREGKPNEAKEILNRMIASG-CLPNAYTYSSLMKG----FFK 444
Cdd:PLN03081 501 ltacrihknlelgrlAAEKlyGMGPeKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGlSMHPACTWIEVKKQdhsfFSG 580
                        330
                 ....*....|....*
gi 15235288  445 TGLCEEAVQVWKEMD 459
Cdd:PLN03081 581 DRLHPQSREIYQKLD 595
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
218-248 4.25e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 49.27  E-value: 4.25e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15235288   218 CLPDGYTYCTLMDGLCKEERIDEAVLLLDEM 248
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
293-327 2.21e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.45  E-value: 2.21e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15235288   293 VTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPND 327
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
223-256 2.30e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.45  E-value: 2.30e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15235288   223 YTYCTLMDGLCKEERIDEAVLLLDEMQSEGCSPS 256
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
541-575 5.82e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.29  E-value: 5.82e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15235288   541 VTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDV 575
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
269-461 8.88e-07

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 50.09  E-value: 8.88e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   269 KKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHgLCLKG----------KLDKAVSLLERMVSSKCIPNDVTYgtlinglv 338
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAadatdlkpqlAADRGFEVFEAMKAQGVSPNEATY-------- 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   339 kqrraTDAVRLLSsmeergyhlnqhiysvlisglfKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDGLCREGKPNEAKE 418
Cdd:pfam17177  94 -----TAVARLAA----------------------AKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYE 146
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 15235288   419 ILNRMIASGCLPNAYTYSSLMKGFFKTGLceeAVQVWKEMDKT 461
Cdd:pfam17177 147 VEEHMLAHGVELEEPELAALLKVSAKAGR---ADKVYAYLHRL 186
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
398-432 1.05e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.52  E-value: 1.05e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15235288   398 VVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNA 432
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
342-464 1.50e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 49.32  E-value: 1.50e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   342 RATDAVRLLS---SMEERGYHLNQHIYSVLI---------SGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLVDGLCR 409
Cdd:pfam17177  23 KHADATGALAlydAAKAEGVRLAQYHYNVLLylcskaadaTDLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 15235288   410 EGKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCS 464
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVE 157
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
278-335 1.85e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 45.43  E-value: 1.85e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 15235288   278 KLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLIN 335
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
537-566 1.90e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 44.64  E-value: 1.90e-06
                          10        20        30
                  ....*....|....*....|....*....|
gi 15235288   537 QPDVVTYNILLDGLCMQKDISRAVDLLNSM 566
Cdd:pfam12854   4 KPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
293-323 2.41e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 44.38  E-value: 2.41e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15235288   293 VTYNTLIHGLCLKGKLDKAVSLLERMVSSKC 323
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
363-397 3.24e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 3.24e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15235288   363 HIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNI 397
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
251-283 4.05e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.87  E-value: 4.05e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 15235288   251 EGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNM 283
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
497-528 5.76e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.10  E-value: 5.76e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 15235288   497 GIKPDTVAYSSIIKGLCGIGSMDAALKLYHEM 528
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
462-493 9.41e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.72  E-value: 9.41e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 15235288   462 GCSRNKFCYSVLIDGLCGVGRVKEAMMVWSKM 493
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
348-404 1.82e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.73  E-value: 1.82e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 15235288   348 RLLSSMEERGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSVLV 404
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
427-458 2.19e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 41.56  E-value: 2.19e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 15235288   427 GCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEM 458
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
223-253 2.32e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.68  E-value: 2.32e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15235288   223 YTYCTLMDGLCKEERIDEAVLLLDEMQSEGC 253
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
383-442 2.70e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.35  E-value: 2.70e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   383 SLWRKMAEKGCKPNIVVYSVLVDGLCREGKPNEAKEILNRMIASGCLPNAYTYSSLMKGF 442
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
398-428 3.58e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.91  E-value: 3.58e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15235288   398 VVYSVLVDGLCREGKPNEAKEILNRMIASGC 428
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
364-393 5.19e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.53  E-value: 5.19e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 15235288   364 IYSVLISGLFKEGKAEEAMSLWRKMAEKGC 393
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
433-466 6.27e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 6.27e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15235288   433 YTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSRN 466
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
243-299 8.68e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.80  E-value: 8.68e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 15235288   243 LLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLI 299
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
323-354 1.09e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 39.64  E-value: 1.09e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 15235288   323 CIPNDVTYGTLINGLVKQRRATDAVRLLSSME 354
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
258-292 1.34e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.36  E-value: 1.34e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15235288   258 VIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNE 292
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
189-221 1.66e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.36  E-value: 1.66e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 15235288   189 SFNLVIKALCKLRFVDRAIEVFRGMPERKCLPD 221
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
419-479 1.81e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.03  E-value: 1.81e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15235288   419 ILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSRNKFCYSVLIdGLCG 479
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVIG 61
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
246-350 3.33e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 42.38  E-value: 3.33e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   246 DEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSLLERMVSSKCIP 325
Cdd:pfam17177  79 EAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
                          90       100
                  ....*....|....*....|....*
gi 15235288   326 NDVTYGTLINGLVKQRRATDAVRLL 350
Cdd:pfam17177 159 EEPELAALLKVSAKAGRADKVYAYL 183
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
541-571 3.39e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.22  E-value: 3.39e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15235288   541 VTYNILLDGLCMQKDISRAVDLLNSMLDRGC 571
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
305-457 3.66e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 42.79  E-value: 3.66e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288 305 KGKLDKAVSLLERMVSSKciPNDVTYGT-LINGLVKQRRATDAVRLLSSMEERGYHlNQHIYSVLISGLFKEGKAEEAMS 383
Cdd:COG2956  89 AGLLDRAEELLEKLLELD--PDDAEALRlLAEIYEQEGDWEKAIEVLERLLKLGPE-NAHAYCELAELYLEQGDYDEAIE 165
                        90       100       110       120       130       140       150
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 15235288 384 LWRKMAEKgcKPNIV-VYSVLVDGLCREGKPNEAKEILNRMIASGclPN-AYTYSSLMKGFFKTGLCEEAVQVWKE 457
Cdd:COG2956 166 ALEKALKL--DPDCArALLLLAELYLEQGDYEEAIAALERALEQD--PDyLPALPRLAELYEKLGDPEEALELLRK 237
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
433-463 3.85e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.22  E-value: 3.85e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15235288   433 YTYSSLMKGFFKTGLCEEAVQVWKEMDKTGC 463
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
468-502 4.18e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 4.18e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15235288   468 FCYSVLIDGLCGVGRVKEAMMVWSKMLTIGIKPDT 502
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
166-321 5.84e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 41.61  E-value: 5.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   166 RGLEFYDYVVNSNMNMNISpnglSFNLVIKaLC----------KLRFVDRAIEVFRGMPERKCLPDGYTYcTLMDGLC-K 234
Cdd:pfam17177  29 GALALYDAAKAEGVRLAQY----HYNVLLY-LCskaadatdlkPQLAADRGFEVFEAMKAQGVSPNEATY-TAVARLAaA 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15235288   235 EERIDEAVLLLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKLDKAVSL 314
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAY 182

                  ....*..
gi 15235288   315 LERMVSS 321
Cdd:pfam17177 183 LHRLRDA 189
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
356-388 6.36e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 37.33  E-value: 6.36e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 15235288   356 RGYHLNQHIYSVLISGLFKEGKAEEAMSLWRKM 388
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
328-361 6.97e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 6.97e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15235288   328 VTYGTLINGLVKQRRATDAVRLLSSMEERGYHLN 361
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
181-213 2.57e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.78  E-value: 2.57e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 15235288   181 MNISPNGLSFNLVIKALCKLRFVDRAIEVFRGM 213
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
258-288 4.45e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.13  E-value: 4.45e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15235288   258 VIYNVLIDGLCKKGDLTRVTKLVDNMFLKGC 288
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
328-357 6.10e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 6.10e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 15235288   328 VTYGTLINGLVKQRRATDAVRLLSSMEERG 357
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
468-498 6.73e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 6.73e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15235288   468 FCYSVLIDGLCGVGRVKEAMMVWSKMLTIGI 498
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
537-584 6.92e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 35.41  E-value: 6.92e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15235288   537 QPDVVTYNILLDGLCMQKDISRAVDLLNSMLDRGCDPDVITCNTFLNT 584
Cdd:pfam13812  12 QLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGV 59
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
189-218 8.44e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.36  E-value: 8.44e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 15235288   189 SFNLVIKALCKLRFVDRAIEVFRGMPERKC 218
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
146-199 9.69e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 34.65  E-value: 9.69e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 15235288   146 RSVKSFNSVLNVIINEGLYHRGLEFYDYvvnsnM-NMNISPNGLSFNLVIKALCK 199
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNE-----MkKRGVKPNVYTYTILINGLCK 50
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH