NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1063710346|ref|NP_001319507|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
147-580 1.95e-38

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 153.49  E-value: 1.95e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  147 PNLQTYNVLIKMSCKKKEFEKARGFLDWMWKEGFKPDVFSYSTVINDLAKAGKLDDALELFDEMSERGVAPDVTCYNILI 226
Cdd:PLN03218   435 PTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALI 514
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  227 DGFLKEKDHKTAMELWDrLLEDSSVYPNVKTHNIMISGLSKCGRVDDCLKIWERMKQNER--EKDLYTYSSLIHGLCDAG 304
Cdd:PLN03218   515 DGCARAGQVAKAFGAYG-IMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAETHpiDPDHITVGALMKACANAG 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  305 NVDKAESVFNELDErkasidvvtYNtmlggfcrcgkIKESLELWRIMEHKNSvnivsynilikgllENGKIDEATMIWRL 384
Cdd:PLN03218   594 QVDRAKEVYQMIHE---------YN-----------IKGTPEVYTIAVNSCS--------------QKGDWDFALSIYDD 639
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  385 MPAKGYAADKTTYGIFIHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDCLCKKKRLEEASNLVKEmskhgvelns 464
Cdd:PLN03218   640 MKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYED---------- 709
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  465 hvcnaligglIRDSRLgeasfflremgkngcRPTVVSYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGL 544
Cdd:PLN03218   710 ----------IKSIKL---------------RPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVAS 764
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1063710346  545 CRDRKIDLALELWHQFLQSGLETDVMMHNILIhGLC 580
Cdd:PLN03218   765 ERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLC 799
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
603-650 2.52e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.91  E-value: 2.52e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063710346 603 NLVTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDIISYNTIMKGLC 650
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
637-683 4.91e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.37  E-value: 4.91e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1063710346 637 PDIISYNTIMKGLCMCRGVSYAMEFFDDARNHGIFPTVYTWNILVRA 683
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILING 47
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
112-161 5.15e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.59  E-value: 5.15e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 112 PAIRSYNTLLNAFVEAKQWVKVESLFAYFETAGVAPNLQTYNVLIKMSCK 161
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
83-124 4.66e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.50  E-value: 4.66e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 1063710346  83 SVIKTYGKNSMPDQALDVFKRMREiFGCEPAIRSYNTLLNAF 124
Cdd:pfam13041   8 TLINGYCKKGKVEEAFKLFNEMKK-RGVKPNVYTYTILINGL 48
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
570-603 6.77e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


:

Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.74  E-value: 6.77e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 570 MMHNILIHGLCSVGKLDDAMTVMANMEHRNCTAN 603
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
147-580 1.95e-38

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 153.49  E-value: 1.95e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  147 PNLQTYNVLIKMSCKKKEFEKARGFLDWMWKEGFKPDVFSYSTVINDLAKAGKLDDALELFDEMSERGVAPDVTCYNILI 226
Cdd:PLN03218   435 PTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALI 514
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  227 DGFLKEKDHKTAMELWDrLLEDSSVYPNVKTHNIMISGLSKCGRVDDCLKIWERMKQNER--EKDLYTYSSLIHGLCDAG 304
Cdd:PLN03218   515 DGCARAGQVAKAFGAYG-IMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAETHpiDPDHITVGALMKACANAG 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  305 NVDKAESVFNELDErkasidvvtYNtmlggfcrcgkIKESLELWRIMEHKNSvnivsynilikgllENGKIDEATMIWRL 384
Cdd:PLN03218   594 QVDRAKEVYQMIHE---------YN-----------IKGTPEVYTIAVNSCS--------------QKGDWDFALSIYDD 639
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  385 MPAKGYAADKTTYGIFIHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDCLCKKKRLEEASNLVKEmskhgvelns 464
Cdd:PLN03218   640 MKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYED---------- 709
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  465 hvcnaligglIRDSRLgeasfflremgkngcRPTVVSYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGL 544
Cdd:PLN03218   710 ----------IKSIKL---------------RPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVAS 764
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1063710346  545 CRDRKIDLALELWHQFLQSGLETDVMMHNILIhGLC 580
Cdd:PLN03218   765 ERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLC 799
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
497-546 2.31e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 70.47  E-value: 2.31e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 497 PTVVSYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGLCR 546
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
603-650 2.52e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.91  E-value: 2.52e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063710346 603 NLVTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDIISYNTIMKGLC 650
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
637-683 4.91e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.37  E-value: 4.91e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1063710346 637 PDIISYNTIMKGLCMCRGVSYAMEFFDDARNHGIFPTVYTWNILVRA 683
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILING 47
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
500-533 8.88e-08

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 48.61  E-value: 8.88e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 500 VSYNILICGLCKAGKFGEASAFVKEMLENGWKPD 533
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
112-161 5.15e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.59  E-value: 5.15e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 112 PAIRSYNTLLNAFVEAKQWVKVESLFAYFETAGVAPNLQTYNVLIKMSCK 161
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
77-247 7.17e-05

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 45.11  E-value: 7.17e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  77 DEDVALSVIKTYGKNSMPDQALDVFKRMREIFGCEPAIrsYNTLLNAFVEAKQWVKVEslfAYFETA-GVAPN-LQTYNV 154
Cdd:COG2956   109 DAEALRLLAEIYEQEGDWEKAIEVLERLLKLGPENAHA--YCELAELYLEQGDYDEAI---EALEKAlKLDPDcARALLL 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 155 LIKMSCKKKEFEKARGfldwMWKEGFKPDVfSYSTVINDLAKA----GKLDDALELFDEMSERgvAPDVTCYNILIDGFL 230
Cdd:COG2956   184 LAELYLEQGDYEEAIA----ALERALEQDP-DYLPALPRLAELyeklGDPEEALELLRKALEL--DPSDDLLLALADLLE 256
                         170
                  ....*....|....*..
gi 1063710346 231 KEKDHKTAMELWDRLLE 247
Cdd:COG2956   257 RKEGLEAALALLERQLR 273
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
83-124 4.66e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.50  E-value: 4.66e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 1063710346  83 SVIKTYGKNSMPDQALDVFKRMREiFGCEPAIRSYNTLLNAF 124
Cdd:pfam13041   8 TLINGYCKKGKVEEAFKLFNEMKK-RGVKPNVYTYTILINGL 48
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
605-639 8.32e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.32e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063710346 605 VTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDI 639
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
570-603 6.77e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.74  E-value: 6.77e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 570 MMHNILIHGLCSVGKLDDAMTVMANMEHRNCTAN 603
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
147-580 1.95e-38

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 153.49  E-value: 1.95e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  147 PNLQTYNVLIKMSCKKKEFEKARGFLDWMWKEGFKPDVFSYSTVINDLAKAGKLDDALELFDEMSERGVAPDVTCYNILI 226
Cdd:PLN03218   435 PTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALI 514
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  227 DGFLKEKDHKTAMELWDrLLEDSSVYPNVKTHNIMISGLSKCGRVDDCLKIWERMKQNER--EKDLYTYSSLIHGLCDAG 304
Cdd:PLN03218   515 DGCARAGQVAKAFGAYG-IMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAETHpiDPDHITVGALMKACANAG 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  305 NVDKAESVFNELDErkasidvvtYNtmlggfcrcgkIKESLELWRIMEHKNSvnivsynilikgllENGKIDEATMIWRL 384
Cdd:PLN03218   594 QVDRAKEVYQMIHE---------YN-----------IKGTPEVYTIAVNSCS--------------QKGDWDFALSIYDD 639
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  385 MPAKGYAADKTTYGIFIHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDCLCKKKRLEEASNLVKEmskhgvelns 464
Cdd:PLN03218   640 MKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYED---------- 709
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  465 hvcnaligglIRDSRLgeasfflremgkngcRPTVVSYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGL 544
Cdd:PLN03218   710 ----------IKSIKL---------------RPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVAS 764
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1063710346  545 CRDRKIDLALELWHQFLQSGLETDVMMHNILIhGLC 580
Cdd:PLN03218   765 ERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLC 799
PLN03077 PLN03077
Protein ECB2; Provisional
97-595 1.43e-23

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 106.47  E-value: 1.43e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  97 ALDVFKRM--REIFgcepairSYNTLLNAFveAKQWVKVESLFAYFET--AGVAPNLQTYNVLIKmSCKkkefekarGFL 172
Cdd:PLN03077  140 AWYVFGKMpeRDLF-------SWNVLVGGY--AKAGYFDEALCLYHRMlwAGVRPDVYTFPCVLR-TCG--------GIP 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 173 DW-MWKE--------GFKPDVFSYSTVINDLAKAGKLDDALELFDEMSERgvapDVTCYNILIDGFLKEKDHKTAMELWD 243
Cdd:PLN03077  202 DLaRGREvhahvvrfGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFF 277
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 244 RLLEDSsVYPNVKThniMISGLSKCGRVDDclkiwERMKQNER--------EKDLYTYSSLIHGLCDAGNVDKAESVFNE 315
Cdd:PLN03077  278 TMRELS-VDPDLMT---ITSVISACELLGD-----ERLGREMHgyvvktgfAVDVSVCNSLIQMYLSLGSWGEAEKVFSR 348
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 316 LDERkasiDVVTYNTMLGGFCRCGKIKESLELWRIMEHKNsvnIVSYNILIKGLLEN----GKIDEATMIWRLMPAKGYa 391
Cdd:PLN03077  349 METK----DAVSWTAMISGYEKNGLPDKALETYALMEQDN---VSPDEITIASVLSAcaclGDLDVGVKLHELAERKGL- 420
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 392 adkTTYGIFIHGLcVNGY-----VNKALGVMQEVESSgghlDVYAYASIIDCLCKKKRLEEASNLVKEMSKH-------- 458
Cdd:PLN03077  421 ---ISYVVVANAL-IEMYskckcIDKALEVFHNIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQMLLTlkpnsvtl 492
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 459 ---------------GVELNSHVC-----------NALIGGLIRDSRLGEA-SFFlremgkNGCRPTVVSYNILICGLCK 511
Cdd:PLN03077  493 iaalsacarigalmcGKEIHAHVLrtgigfdgflpNALLDLYVRCGRMNYAwNQF------NSHEKDVVSWNILLTGYVA 566
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 512 AGKFGEASAFVKEMLENGWKPDLKTYSILLCGLCRDRKIDLALELWHQFLQS-GLETDVMMHNILIHGLCSVGKLDDAMT 590
Cdd:PLN03077  567 HGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLTEAYN 646

                  ....*
gi 1063710346 591 VMANM 595
Cdd:PLN03077  647 FINKM 651
PLN03218 PLN03218
maturation of RBCL 1; Provisional
265-683 6.71e-22

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 101.11  E-value: 6.71e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  265 LSKCGRVDDCLKIWERMKQnereKDLYTYSSLIHGlcdagnvdkaeSVFNELDERKASIDVVTyntmlggFCRCgkIKES 344
Cdd:PLN03218   380 LLRDGRIKDCIDLLEDMEK----RGLLDMDKIYHA-----------KFFKACKKQRAVKEAFR-------FAKL--IRNP 435
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  345 lelwrIMEHknsvnivsYNILIKGLLENGKIDEATMIWRLMPAKGYAADKTTYGIFIHGLCVNGYVNKALGVMQEVESSG 424
Cdd:PLN03218   436 -----TLST--------FNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAG 502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  425 GHLDVYAYASIIDCLCKKKRLEEASNLVKEMSKHGVELNSHVCNALIGGLIRDSRLGEASFFLREMGKNG--CRPTVVSY 502
Cdd:PLN03218   503 VEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITV 582
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  503 NILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILL--CGLCRDrkIDLALELWHQFLQSGLETDVMMHNILIHGLC 580
Cdd:PLN03218   583 GALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVnsCSQKGD--WDFALSIYDDMKKKGVKPDEVFFSALVDVAG 660
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  581 SVGKLDDAMTVMANMEHRNCTANLVTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDIISYNTIMKGLCMCRGVSYAME 660
Cdd:PLN03218   661 HAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALE 740
                          410       420
                   ....*....|....*....|...
gi 1063710346  661 FFDDARNHGIFPTVYTWNILVRA 683
Cdd:PLN03218   741 VLSEMKRLGLCPNTITYSILLVA 763
PLN03218 PLN03218
maturation of RBCL 1; Provisional
318-683 2.63e-20

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 96.10  E-value: 2.63e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  318 ERKASIDVVTYNTMLggfcRCGKIKESLELWRIMEHKN--SVNIVSYNILIKGLLENGKIDEATMIWRLMPAkgyaADKT 395
Cdd:PLN03218   367 KRKSPEYIDAYNRLL----RDGRIKDCIDLLEDMEKRGllDMDKIYHAKFFKACKKQRAVKEAFRFAKLIRN----PTLS 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  396 TYGIFIHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDCLCKKKRLEEASNLVKEMSKHGVELNSHVCNALIGGLI 475
Cdd:PLN03218   439 TFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCA 518
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  476 RDSRLGEASFFLREMGKNGCRPTVVSYNILICGLCKAGKFGEASAFVKEMLENGW--KPDLKTYSILLCGLCRDRKIDLA 553
Cdd:PLN03218   519 RAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAETHpiDPDHITVGALMKACANAGQVDRA 598
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  554 LELWHQFLQSGLETDVMMHNILIHGLCSVGKLDDAMTVMANMEHRNCTANLVTYNTLMEGFFKVGDSNRATVIWGYMYKM 633
Cdd:PLN03218   599 KEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQ 678
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1063710346  634 GLQPDIISYNTIMKGLCMCRGVSYAMEFFDDARNHGIFPTVYTWNILVRA 683
Cdd:PLN03218   679 GIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITA 728
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
95-453 4.49e-19

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 91.86  E-value: 4.49e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  95 DQALDVFKRMREIFGCEPAIRSYNTLLNAFVEAKQWVKVESLFAYFETAGVAPNLQTYNVLIKMSCKKKEFEKARGFLDW 174
Cdd:PLN03081  104 REALELFEILEAGCPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDE 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 175 MwkegfkPD--VFSYSTVINDLAKAGKLDDALELFDEMSERGVAPDVTCYNILIDG-------FLKEKDHKTAMELwdrl 245
Cdd:PLN03081  184 M------PErnLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRAsaglgsaRAGQQLHCCVLKT---- 253
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 246 ledsSVYPNVKTHNIMISGLSKCGRVDDCLKIWERMKqnerEKDLYTYSSLIHGLCDAGNVDKAESVFNELDERKASIDV 325
Cdd:PLN03081  254 ----GVVGDTFVSCALIDMYSKCGDIEDARCVFDGMP----EKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQ 325
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 326 VTYNTMLGGFCRCGK-----------IKESLEL--------------WRIMEHKNSV-------NIVSYNILIKGLLENG 373
Cdd:PLN03081  326 FTFSIMIRIFSRLALlehakqahaglIRTGFPLdivantalvdlyskWGRMEDARNVfdrmprkNLISWNALIAGYGNHG 405
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 374 KIDEATMIWRLMPAKGYAADKTTYGIFIHGLCVNGYVNKALGVMQEV-ESSGGHLDVYAYASIIDCLCKKKRLEEASNLV 452
Cdd:PLN03081  406 RGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMsENHRIKPRAMHYACMIELLGREGLLDEAYAMI 485

                  .
gi 1063710346 453 K 453
Cdd:PLN03081  486 R 486
PLN03077 PLN03077
Protein ECB2; Provisional
83-533 1.64e-18

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 90.29  E-value: 1.64e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  83 SVIKTYGKNSMPDQALDVFKRMREIFgCEPAIRSYNTLLNAFVEAKQWVKVESLFAYFETAGVAPNLQTYNVLIKMSCKK 162
Cdd:PLN03077  258 AMISGYFENGECLEGLELFFTMRELS-VDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSL 336
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 163 KEFEKARGFLDWMWKEgfkpDVFSYSTVINDLAKAGKLDDALELFDEMSERGVAPD-VTCYNILID-GFLKEKDhkTAME 240
Cdd:PLN03077  337 GSWGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDeITIASVLSAcACLGDLD--VGVK 410
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 241 LwDRLLEDSSVYPNVKTHNIMISGLSKCGRVDDCLKIWERMKqnerEKDLYTYSSLIHGLCDAGNVDKAESVFNE--LDE 318
Cdd:PLN03077  411 L-HELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIP----EKDVISWTSIIAGLRLNNRCFEALIFFRQmlLTL 485
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 319 RKASIDVVTY--------------------------------NTMLGGFCRCGKIKESLELWRIMEHknsvNIVSYNILI 366
Cdd:PLN03077  486 KPNSVTLIAAlsacarigalmcgkeihahvlrtgigfdgflpNALLDLYVRCGRMNYAWNQFNSHEK----DVVSWNILL 561
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 367 KGLLENGKIDEATMIWRLMPAKGYAADKTTygiFIHGLCV---NGYVNKALGVMQEVESSGGHL-DVYAYASIIDCLCKK 442
Cdd:PLN03077  562 TGYVAHGKGSMAVELFNRMVESGVNPDEVT---FISLLCAcsrSGMVTQGLEYFHSMEEKYSITpNLKHYACVVDLLGRA 638
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 443 KRLEEASNLVKEMSkhgVELNSHVCNALIGG--LIRDSRLGE-ASFFLREMGKNGcrptvVSYNILICGL-CKAGKFGEA 518
Cdd:PLN03077  639 GKLTEAYNFINKMP---ITPDPAVWGALLNAcrIHRHVELGElAAQHIFELDPNS-----VGYYILLCNLyADAGKWDEV 710
                         490
                  ....*....|....*
gi 1063710346 519 SAFVKEMLENGWKPD 533
Cdd:PLN03077  711 ARVRKTMRENGLTVD 725
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
262-672 1.19e-16

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 84.15  E-value: 1.19e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 262 ISGLSKCGRVDDCLKIWERMKQNEREK-DLYTYSSLIHGLCDAGNVDKAESVFNELDERKASIDVVTYNTMLGGFCRCGK 340
Cdd:PLN03081   94 IEKLVACGRHREALELFEILEAGCPFTlPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGM 173
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 341 IKESLELWRIMEHKNsvnIVSYNILIKGLLENGKIDEATMIWRLMPAKGYAADKTTYGIFIHGLCVNG--YVNKALGVMq 418
Cdd:PLN03081  174 LIDARRLFDEMPERN---LASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGsaRAGQQLHCC- 249
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 419 eVESSGGHLDVYAYASIIDCLCKKKRLEEASNLVKEMSkhgvelnshvcnaligglirdsrlgeasfflremgkngcRPT 498
Cdd:PLN03081  250 -VLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMP---------------------------------------EKT 289
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 499 VVSYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGLCRDRKIDLALELWHQFLQSGLETDVMMHNILIHG 578
Cdd:PLN03081  290 TVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDL 369
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 579 LCSVGKLDDAMTVMANMEHRnctaNLVTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDIISYNTIMKGlCMCRGVS-Y 657
Cdd:PLN03081  370 YSKWGRMEDARNVFDRMPRK----NLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSA-CRYSGLSeQ 444
                         410
                  ....*....|....*.
gi 1063710346 658 AMEFFDD-ARNHGIFP 672
Cdd:PLN03081  445 GWEIFQSmSENHRIKP 460
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
497-546 2.31e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 70.47  E-value: 2.31e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 497 PTVVSYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGLCR 546
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
64-405 1.77e-14

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 77.22  E-value: 1.77e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346   64 RIVELIRSQECKCDEDVALSVIKTYGKNSMPDQALDVFKRMrEIFGCEPAIRSYNTLLNAFVEAKQWVK----------- 132
Cdd:PLN03218   458 RVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM-VNAGVEANVHTFGALIDGCARAGQVAKafgaygimrsk 536
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  133 ---------------------VESLFAYF-----ETAGVAPNLQTYNVLIKMSCKKKEFEKARGFLDWMWKEGFK--PDV 184
Cdd:PLN03218   537 nvkpdrvvfnalisacgqsgaVDRAFDVLaemkaETHPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKgtPEV 616
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  185 fsYSTVINDLAKAGKLDDALELFDEMSERGVAPDVTCYNILIDgflkekdhktamelwdrlledssvypnVKTHnimisg 264
Cdd:PLN03218   617 --YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVD---------------------------VAGH------ 661
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  265 lskCGRVDDCLKIWERMKQNEREKDLYTYSSLIHGLCDAGNVDKAESVFNELDERKASIDVVTYNTMLGGFCRCGKIKES 344
Cdd:PLN03218   662 ---AGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKA 738
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1063710346  345 LELwrIMEHKNS---VNIVSYNILIKGLLENGKIDEATMIWRLMPAKGYAADKTTYGIFIhGLC 405
Cdd:PLN03218   739 LEV--LSEMKRLglcPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLC 799
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
182-231 1.78e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.77  E-value: 1.78e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 182 PDVFSYSTVINDLAKAGKLDDALELFDEMSERGVAPDVTCYNILIDGFLK 231
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
401-646 4.02e-14

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 76.04  E-value: 4.02e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 401 IHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDcLCKKKRLEEASNLV-----KEMSKHGVELNshvcNALIGGLI 475
Cdd:PLN03077   58 LRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCEWKRAVEEGSRVcsralSSHPSLGVRLG----NAMLSMFV 132
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 476 RDSRLGEASFFLREMGKNgcrpTVVSYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILL--CG----LCRDRK 549
Cdd:PLN03077  133 RFGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLrtCGgipdLARGRE 208
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 550 IDLalelwhQFLQSGLETDVMMHNILIHGLCSVGKLDDAMTVMANMEHRNCtanlVTYNTLMEGFFKVGDSNRATVIWGY 629
Cdd:PLN03077  209 VHA------HVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDC----ISWNAMISGYFENGECLEGLELFFT 278
                         250
                  ....*....|....*..
gi 1063710346 630 MYKMGLQPDIISYNTIM 646
Cdd:PLN03077  279 MRELSVDPDLMTITSVI 295
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
189-592 8.62e-14

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 74.91  E-value: 8.62e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 189 TVINDLAKAGKLDDALELFdEMSERGVA--PDVTCYNILIDGFLKEKDHKTAMELWDrLLEDSSVYPNVKTHNIMISGLS 266
Cdd:PLN03081   92 SQIEKLVACGRHREALELF-EILEAGCPftLPASTYDALVEACIALKSIRCVKAVYW-HVESSGFEPDQYMMNRVLLMHV 169
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 267 KCGRVDDCLKIWERMKqnerEKDLYTYSSLIHGLCDAGNVDKAESVFNELDERKASIDVVTYNTMLGGFCRCGKIKESLE 346
Cdd:PLN03081  170 KCGMLIDARRLFDEMP----ERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQ 245
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 347 LwrimeHKNSVN--IVSYNILIKGLLEN----GKIDEATMIWRLMPAKGYAAdkttYGIFIHGLCVNGYVNKALGVMQEV 420
Cdd:PLN03081  246 L-----HCCVLKtgVVGDTFVSCALIDMyskcGDIEDARCVFDGMPEKTTVA----WNSMLAGYALHGYSEEALCLYYEM 316
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 421 ESSGGHLDVYAYASIIDCLCKKKRLEEASNLVKEMSKHGVELNSHVCNALIGGLIRDSRLGEASFFLREMGkngcRPTVV 500
Cdd:PLN03081  317 RDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMP----RKNLI 392
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 501 SYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGlCRDRkiDLALELWHQFL----QSGLETDVMMHNILI 576
Cdd:PLN03081  393 SWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSA-CRYS--GLSEQGWEIFQsmseNHRIKPRAMHYACMI 469
                         410
                  ....*....|....*.
gi 1063710346 577 HGLCSVGKLDDAMTVM 592
Cdd:PLN03081  470 ELLGREGLLDEAYAMI 485
PLN03077 PLN03077
Protein ECB2; Provisional
294-681 9.74e-14

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 74.89  E-value: 9.74e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 294 SSLIHGLCDAGNVDKAESVFNELDERKASIDVVTYNTMLgGFCRCGK-IKESLELW-RIMEHKNSVNIVSYNILIKGLLE 371
Cdd:PLN03077   55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALF-RLCEWKRaVEEGSRVCsRALSSHPSLGVRLGNAMLSMFVR 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 372 NGKIDEATMIWRLMPAKgyaaDKTTYGIFIHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDCLCKKKRLEEASNL 451
Cdd:PLN03077  134 FGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREV 209
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 452 VKEMSKHGVELNSHVCNALIGGLIRDSRLGEASFFLREMGKNGCrptvVSYNILICGLCKAGKFGEASAFVKEMLENGWK 531
Cdd:PLN03077  210 HAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDC----ISWNAMISGYFENGECLEGLELFFTMRELSVD 285
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 532 PDLKTYS--ILLCGLCRDRKidLALELWHQFLQSGLETDVMMHNILIHGLCSVGKLDDAMTVMANMEHRnctaNLVTYNT 609
Cdd:PLN03077  286 PDLMTITsvISACELLGDER--LGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTA 359
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063710346 610 LMEGFFKVGDSNRATVIWGYMYKMGLQPDIISYNTIMKGlCMCRG-VSYAMEFFDDARNHGIFPTVYTWNILV 681
Cdd:PLN03077  360 MISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSA-CACLGdLDVGVKLHELAERKGLISYVVVANALI 431
PLN03077 PLN03077
Protein ECB2; Provisional
194-672 4.78e-13

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 72.57  E-value: 4.78e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 194 LAKAGKLDDALELFDEMSERGVAPDVTCYNILIDGFLKEKDHKTAMELWDRLLEDSSVYP-------------------- 253
Cdd:PLN03077   61 LCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGvrlgnamlsmfvrfgelvha 140
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 254 ----------NVKTHNIMISGLSKCGRVDDCLKIWERMKQNEREKDLYTYSSLIH---GLCD------------------ 302
Cdd:PLN03077  141 wyvfgkmperDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRtcgGIPDlargrevhahvvrfgfel 220
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 303 --------------AGNVDKAESVFneldERKASIDVVTYNTMLGGFCRCGKIKESLELWRIMEHKN------------- 355
Cdd:PLN03077  221 dvdvvnalitmyvkCGDVVSARLVF----DRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSvdpdlmtitsvis 296
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 356 -----------------------SVNIVSYNILIKGLLENGKIDEATMIWRLMPAKgyaaDKTTYGIFIHGLCVNGYVNK 412
Cdd:PLN03077  297 acellgderlgremhgyvvktgfAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDK 372
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 413 ALGVMQEVESSGGHLDVYAYASIIDCLCKKKRLEEASNLVKEMSKHGVELNSHVCNALIGGLIRDSRLGEASfflrEMGK 492
Cdd:PLN03077  373 ALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKAL----EVFH 448
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 493 NGCRPTVVSYNILICGLCKAGKFGEASAFVKEMLENgWKPDLKTYSILLCGLCRDRKIDLALELWHQFLQSGLETDVMMH 572
Cdd:PLN03077  449 NIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLP 527
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 573 NILIHGLCSVGKLDDAMTvmanmEHRNCTANLVTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDIISYNTImkgLCMC 652
Cdd:PLN03077  528 NALLDLYVRCGRMNYAWN-----QFNSHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISL---LCAC 599
                         570       580
                  ....*....|....*....|....
gi 1063710346 653 -RG--VSYAMEFFDDAR-NHGIFP 672
Cdd:PLN03077  600 sRSgmVTQGLEYFHSMEeKYSITP 623
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
288-337 9.67e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 60.07  E-value: 9.67e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 288 KDLYTYSSLIHGLCDAGNVDKAESVFNELDERKASIDVVTYNTMLGGFCR 337
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
324-369 9.67e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 60.07  E-value: 9.67e-12
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1063710346 324 DVVTYNTMLGGFCRCGKIKESLELWRIMEHKN-SVNIVSYNILIKGL 369
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGvKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
253-302 1.28e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 59.68  E-value: 1.28e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 253 PNVKTHNIMISGLSKCGRVDDCLKIWERMKQNEREKDLYTYSSLIHGLCD 302
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
603-650 2.52e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.91  E-value: 2.52e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063710346 603 NLVTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDIISYNTIMKGLC 650
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
358-405 1.82e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 56.60  E-value: 1.82e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063710346 358 NIVSYNILIKGLLENGKIDEATMIWRLMPAKGYAADKTTYGIFIHGLC 405
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
217-267 2.10e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 56.22  E-value: 2.10e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1063710346 217 PDVTCYNILIDGFLKEKDHKTAMELWDRLLEdSSVYPNVKTHNIMISGLSK 267
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKK-RGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
147-196 2.16e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 56.22  E-value: 2.16e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 147 PNLQTYNVLIKMSCKKKEFEKARGFLDWMWKEGFKPDVFSYSTVINDLAK 196
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
463-511 2.68e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 55.83  E-value: 2.68e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063710346 463 NSHVCNALIGGLIRDSRLGEASFFLREMGKNGCRPTVVSYNILICGLCK 511
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
493-525 9.02e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 54.27  E-value: 9.02e-10
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063710346 493 NGCRPTVVSYNILICGLCKAGKFGEASAFVKEM 525
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
178-211 1.74e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.12  E-value: 1.74e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 178 EGFKPDVFSYSTVINDLAKAGKLDDALELFDEMS 211
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
637-683 4.91e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.37  E-value: 4.91e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1063710346 637 PDIISYNTIMKGLCMCRGVSYAMEFFDDARNHGIFPTVYTWNILVRA 683
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILING 47
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
568-616 5.21e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.37  E-value: 5.21e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063710346 568 DVMMHNILIHGLCSVGKLDDAMTVMANMEHRNCTANLVTYNTLMEGFFK 616
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
500-533 8.88e-08

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 48.61  E-value: 8.88e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 500 VSYNILICGLCKAGKFGEASAFVKEMLENGWKPD 533
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PLN03077 PLN03077
Protein ECB2; Provisional
80-281 1.43e-07

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 54.86  E-value: 1.43e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  80 VALSVIKTYGKNSMPDQALDVFKRMRE---------IFGCEPAIRSYNTLLnAFVEAKQWVKVES--------------- 135
Cdd:PLN03077  426 VANALIEMYSKCKCIDKALEVFHNIPEkdviswtsiIAGLRLNNRCFEALI-FFRQMLLTLKPNSvtliaalsacariga 504
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 136 ------LFAYFETAGVAPNLQTYNVLIKMSCKKKEFEKArgfldWMWKEGFKPDVFSYSTVINDLAKAGKLDDALELFDE 209
Cdd:PLN03077  505 lmcgkeIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYA-----WNQFNSHEKDVVSWNILLTGYVAHGKGSMAVELFNR 579
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063710346 210 MSERGVAPDVTCYNILIDGFLKEKDHKTAMELWDRLLEDSSVYPNVKTHNIMISGLSKCGRVDDCLKIWERM 281
Cdd:PLN03077  580 MVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKM 651
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
289-315 1.45e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 47.73  E-value: 1.45e-07
                          10        20
                  ....*....|....*....|....*..
gi 1063710346 289 DLYTYSSLIHGLCDAGNVDKAESVFNE 315
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
594-647 1.61e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 48.51  E-value: 1.61e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1063710346 594 NMEHRNCTANLVTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDIISYNTIMK 647
Cdd:pfam13812   5 EMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
172-232 2.06e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 48.12  E-value: 2.06e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1063710346 172 LDWMWKEGFKPDVFSYSTVINDLAKAGKLDDALELFDEMSERGVAPDVTCYNILIDGFLKE 232
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
324-352 2.48e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 47.34  E-value: 2.48e-07
                          10        20
                  ....*....|....*....|....*....
gi 1063710346 324 DVVTYNTMLGGFCRCGKIKESLELWRIME 352
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
532-580 2.67e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 47.36  E-value: 2.67e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063710346 532 PDLKTYSILLCGLCRDRKIDLALELWHQFLQSGLETDVMMHNILIHGLC 580
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
112-161 5.15e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.59  E-value: 5.15e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 112 PAIRSYNTLLNAFVEAKQWVKVESLFAYFETAGVAPNLQTYNVLIKMSCK 161
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
96-226 9.96e-07

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 50.09  E-value: 9.96e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  96 QALDVFKRM-REifGCEPAIRSYNTLLN------AFVEAKQWVKVESLFAYFE---TAGVAPNLQTYNVLIKMSCKKKEF 165
Cdd:pfam17177  29 GALALYDAAkAE--GVRLAQYHYNVLLYlcskaaDATDLKPQLAADRGFEVFEamkAQGVSPNEATYTAVARLAAAKGDG 106
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1063710346 166 EKARGFLDWMWKEGFKPDVFSYSTVINDLAKAGKLDDALELFDEMSERGVAPDVTCYNILI 226
Cdd:pfam17177 107 DLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALL 167
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
451-506 7.39e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.89  E-value: 7.39e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1063710346 451 LVKEMSKHGVELNSHVCNALIGGLIRDSRLGEASFFLREMGKNGCRPTVVSYNILI 506
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
191-326 9.41e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 47.01  E-value: 9.41e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 191 INDLAKAGKLDDALELFDEMSERGVAPDVTCYNILI---------DGFLKEKDHKTAMELWDRLLEDSsVYPNVKTHNIM 261
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLylcskaadaTDLKPQLAADRGFEVFEAMKAQG-VSPNEATYTAV 96
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1063710346 262 ISGLSKCGRVDDCLKIWERMKQNEREKDLYTYSSLIHGLCDAGNVDKAEsvfnELDERKASIDVV 326
Cdd:pfam17177  97 ARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAY----EVEEHMLAHGVE 157
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
185-219 1.24e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.44  E-value: 1.24e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063710346 185 FSYSTVINDLAKAGKLDDALELFDEMSERGVAPDV 219
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
564-596 1.61e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 41.95  E-value: 1.61e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063710346 564 GLETDVMMHNILIHGLCSVGKLDDAMTVMANME 596
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
564-611 1.77e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.73  E-value: 1.77e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063710346 564 GLETDVMMHNILIHGLCSVGKLDDAMTVMANMEHRNCTANLVTYNTLM 611
Cdd:pfam13812  10 GIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
185-215 1.84e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.68  E-value: 1.84e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063710346 185 FSYSTVINDLAKAGKLDDALELFDEMSERGV 215
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
291-325 2.49e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 2.49e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063710346 291 YTYSSLIHGLCDAGNVDKAESVFNELDERKASIDV 325
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
500-529 3.68e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.91  E-value: 3.68e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063710346 500 VSYNILICGLCKAGKFGEASAFVKEMLENG 529
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
253-282 4.10e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 40.79  E-value: 4.10e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063710346 253 PNVKTHNIMISGLSKCGRVDDCLKIWERMK 282
Cdd:pfam12854   5 PDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
396-441 5.74e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.81  E-value: 5.74e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1063710346 396 TYGIFIHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDCLCK 441
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
151-184 5.80e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 5.80e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 151 TYNVLIKMSCKKKEFEKARGFLDWMWKEGFKPDV 184
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
424-456 5.84e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 40.41  E-value: 5.84e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063710346 424 GGHLDVYAYASIIDCLCKKKRLEEASNLVKEMS 456
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
430-464 5.86e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 5.86e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063710346 430 YAYASIIDCLCKKKRLEEASNLVKEMSKHGVELNS 464
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
77-247 7.17e-05

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 45.11  E-value: 7.17e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  77 DEDVALSVIKTYGKNSMPDQALDVFKRMREIFGCEPAIrsYNTLLNAFVEAKQWVKVEslfAYFETA-GVAPN-LQTYNV 154
Cdd:COG2956   109 DAEALRLLAEIYEQEGDWEKAIEVLERLLKLGPENAHA--YCELAELYLEQGDYDEAI---EALEKAlKLDPDcARALLL 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 155 LIKMSCKKKEFEKARGfldwMWKEGFKPDVfSYSTVINDLAKA----GKLDDALELFDEMSERgvAPDVTCYNILIDGFL 230
Cdd:COG2956   184 LAELYLEQGDYEEAIA----ALERALEQDP-DYLPALPRLAELyeklGDPEEALELLRKALEL--DPSDDLLLALADLLE 256
                         170
                  ....*....|....*..
gi 1063710346 231 KEKDHKTAMELWDRLLE 247
Cdd:COG2956   257 RKEGLEAALALLERQLR 273
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
94-349 8.35e-05

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 45.11  E-value: 8.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  94 PDQALDVFKRMREIfgcEP-AIRSYNTLLNAFVEAKQWvkvESLFAYFETAgVAPNLQTYNVLIK-----MscKKKEFEK 167
Cdd:COG2956    24 PDKAIDLLEEALEL---DPeTVEAHLALGNLYRRRGEY---DRAIRIHQKL-LERDPDRAEALLElaqdyL--KAGLLDR 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 168 ARGFLDWMWKEGFKpDVFSYSTVINDLAKAGKLDDALELFDEMSERGvAPDVTCYNILIDGFLKEKDHKTAMELWDRLLE 247
Cdd:COG2956    95 AEELLEKLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLG-PENAHAYCELAELYLEQGDYDEAIEALEKALK 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 248 dssVYPN-VKTHNIMISGLSKCGRVDDCLKIWERMKQNEREkDLYTYSSLIHGLCDAGNVDKAESVFNELDERKASIDVV 326
Cdd:COG2956   173 ---LDPDcARALLLLAELYLEQGDYEEAIAALERALEQDPD-YLPALPRLAELYEKLGDPEEALELLRKALELDPSDDLL 248
                         250       260
                  ....*....|....*....|...
gi 1063710346 327 TYNTMLggFCRCGKIKESLELWR 349
Cdd:COG2956   249 LALADL--LERKEGLEAALALLE 269
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
326-355 1.26e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 39.37  E-value: 1.26e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063710346 326 VTYNTMLGGFCRCGKIKESLELWRIMEHKN 355
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
487-548 1.44e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.42  E-value: 1.44e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063710346 487 LREMGKNGCRPTVVSYNILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLcGLCRDR 548
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVIGGR 63
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
276-335 1.51e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.03  E-value: 1.51e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 276 KIWERMKQNEREKDLYTYSSLIHGLCDAGNVDKAESVFNELDERKASIDVVTYNTMLGGF 335
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
109-158 1.63e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.03  E-value: 1.63e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063710346 109 GCEPAIRSYNTLLNAFVEAKQWVKVESLFAYFETAGVAPNLQTYNVLIKM 158
Cdd:pfam13812  10 GIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGV 59
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
206-263 2.97e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.26  E-value: 2.97e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1063710346 206 LFDEMSERGVAPDVTCYNILIDGFLKEKDHKTAMELWDRLLEDsSVYPNVKTHNIMIS 263
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKK-GIKPTLDTYNAILG 58
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
531-559 3.60e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.10  E-value: 3.60e-04
                          10        20
                  ....*....|....*....|....*....
gi 1063710346 531 KPDLKTYSILLCGLCRDRKIDLALELWHQ 559
Cdd:pfam12854   4 KPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
360-393 3.64e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 3.64e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 360 VSYNILIKGLLENGKIDEATMIWRLMPAKGYAAD 393
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
633-664 3.70e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.10  E-value: 3.70e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063710346 633 MGLQPDIISYNTIMKGLCMCRGVSYAMEFFDD 664
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
291-319 4.16e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 4.16e-04
                          10        20
                  ....*....|....*....|....*....
gi 1063710346 291 YTYSSLIHGLCDAGNVDKAESVFNELDER 319
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
83-124 4.66e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.50  E-value: 4.66e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 1063710346  83 SVIKTYGKNSMPDQALDVFKRMREiFGCEPAIRSYNTLLNAF 124
Cdd:pfam13041   8 TLINGYCKKGKVEEAFKLFNEMKK-RGVKPNVYTYTILINGL 48
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
257-284 5.02e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 5.02e-04
                          10        20
                  ....*....|....*....|....*...
gi 1063710346 257 THNIMISGLSKCGRVDDCLKIWERMKQN 284
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
358-385 7.16e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 37.33  E-value: 7.16e-04
                          10        20
                  ....*....|....*....|....*...
gi 1063710346 358 NIVSYNILIKGLLENGKIDEATMIWRLM 385
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
536-569 8.08e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.08e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 536 TYSILLCGLCRDRKIDLALELWHQFLQSGLETDV 569
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
605-639 8.32e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.32e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063710346 605 VTYNTLMEGFFKVGDSNRATVIWGYMYKMGLQPDI 639
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
20-173 1.14e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.84  E-value: 1.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  20 EKNPRAAFALFDSATRHpGYAHSAVVYHHILRRLS---------ETRMVNHVSRIVELIRSQECKCDEDVALSVIKTYGK 90
Cdd:pfam17177  24 HADATGALALYDAAKAE-GVRLAQYHYNVLLYLCSkaadatdlkPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346  91 NSMPDQALDVFKRMREiFGCEPAIRSYNTLLNAFVEAKQWVKVESLFAYFETAGVAPNLQTYNVLIKMSCKKKEFEKARG 170
Cdd:pfam17177 103 KGDGDLAFDLVKEMEA-AGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYA 181

                  ...
gi 1063710346 171 FLD 173
Cdd:pfam17177 182 YLH 184
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
213-244 1.15e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 36.94  E-value: 1.15e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063710346 213 RGVAPDVTCYNILIDGFLKEKDHKTAMELWDR 244
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
144-246 1.56e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.46  E-value: 1.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 144 GVAPNLQTYNVLIKMSCK-------KKEFEKARGFLDW--MWKEGFKPDVFSYSTVINDLAKAGKLDDALELFDEMSERG 214
Cdd:pfam17177  41 GVRLAQYHYNVLLYLCSKaadatdlKPQLAADRGFEVFeaMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAG 120
                          90       100       110
                  ....*....|....*....|....*....|..
gi 1063710346 215 VAPDVTCYNILIDGFLKEKDHKTAMELWDRLL 246
Cdd:pfam17177 121 VSPRLRSYSPALHAYCEAGDADKAYEVEEHML 152
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
326-355 1.62e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 1.62e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063710346 326 VTYNTMLGGFCRCGKIKESLELWRIMEHKN 355
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERG 30
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
630-687 2.64e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.57  E-value: 2.64e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1063710346 630 MYKMGLQPDIISYNTIMKGLCMCRGVSYAMEFFDDARNHGIFPTVYTWNILVRAVVNR 687
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
360-389 2.67e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.90  E-value: 2.67e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063710346 360 VSYNILIKGLLENGKIDEATMIWRLMPAKG 389
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
257-284 3.05e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 3.05e-03
                          10        20
                  ....*....|....*....|....*...
gi 1063710346 257 THNIMISGLSKCGRVDDCLKIWERMKQN 284
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKER 29
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
598-630 3.66e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.40  E-value: 3.66e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063710346 598 RNCTANLVTYNTLMEGFFKVGDSNRATVIWGYM 630
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
156-316 3.82e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 39.30  E-value: 3.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 156 IKMSCKKKEFEKARGFLDWMWKEGFKPDVFSYSTVINDLAKAG---------KLDDALELFDEMSERGVAPDVTCYNILI 226
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLYLCSKAAdatdlkpqlAADRGFEVFEAMKAQGVSPNEATYTAVA 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 227 DGFLKEKDHKTAMELWDRLlEDSSVYPNVKTHNIMISGLSKCGRVDDCLKIWERMKQNEREKDLYTYSSLIHGLCDAGNV 306
Cdd:pfam17177  98 RLAAAKGDGDLAFDLVKEM-EAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRA 176
                         170
                  ....*....|
gi 1063710346 307 DKAESVFNEL 316
Cdd:pfam17177 177 DKVYAYLHRL 186
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
441-566 4.41e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 39.30  E-value: 4.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063710346 441 KKKRLEEASNLVKEMSKHGVELNSHVCNALI-----GGLIRDSRLGEASFFLRE----MGKNGCRPTVVSYNILICGLCK 511
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLylcskAADATDLKPQLAADRGFEvfeaMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1063710346 512 AGKFGEASAFVKEMLENGWKPDLKTYSILLCGLCRDRKIDLALELWHQFLQSGLE 566
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVE 157
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
430-460 5.53e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 5.53e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063710346 430 YAYASIIDCLCKKKRLEEASNLVKEMSKHGV 460
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
570-603 6.77e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.74  E-value: 6.77e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063710346 570 MMHNILIHGLCSVGKLDDAMTVMANMEHRNCTAN 603
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
221-255 7.85e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.35  E-value: 7.85e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063710346 221 CYNILIDGFLKEKDHKTAMELWDRLLEdSSVYPNV 255
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKE-RGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
388-419 7.98e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 34.63  E-value: 7.98e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063710346 388 KGYAADKTTYGIFIHGLCVNGYVNKALGVMQE 419
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH