NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|347954490|gb|AEP33745|]
View 

chloroplast biogenesis 19, partial [Arabis hirsuta]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1000585)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03077 super family cl33629
Protein ECB2; Provisional
2-420 1.52e-96

Protein ECB2; Provisional


The actual alignment was detected with superfamily member PLN03077:

Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 308.32  E-value: 1.52e-96
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490   2 FSGMRLAGVEPNHITFIALLSGWGDLlsGSEALGDLLHGYACKLGLDrAHVMVGTAILGMYSKRGRFRKARLVFDYMEEK 81
Cdd:PLN03077 276 FFTMRELSVDPDLMTITSVISACELL--GDERLGREMHGYVVKTGFA-VDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK 352
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  82 NSVTWNTMIDGYMRNG---------------------------------------------------------------- 97
Cdd:PLN03077 353 DAVSWTAMISGYEKNGlpdkaletyalmeqdnvspdeitiasvlsacaclgdldvgvklhelaerkglisyvvvanalie 432
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  98 ------QVDDAVKLFDEMPERDLISWTAMINGFVKKGFHEEALAWFREMQISgVKPDYVAIIAALAACTNLGALSFGLWI 171
Cdd:PLN03077 433 myskckCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEI 511
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 172 HRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKrTVVSWNSVIVGFAANGHAHESLVYFRKMQEEGFKPNAV 251
Cdd:PLN03077 512 HAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFNSHEK-DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEV 590
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 252 TFTGALAACNHVGLVEEGLRYFQSMKRDYRISPRIEHYGCLVDLYSRAGRLEDALKVVQSMPMKPNEVVIGSLLAACRTQ 331
Cdd:PLN03077 591 TFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIH 670
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 332 gNNTVLAERVMKHLSDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKEPGFSSIEIDDCTHVFMAGDNTHVE 411
Cdd:PLN03077 671 -RHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDDESHPQ 749

                 ....*....
gi 347954490 412 TTNIREVLE 420
Cdd:PLN03077 750 IKEINTVLE 758
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
2-420 1.52e-96

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 308.32  E-value: 1.52e-96
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490   2 FSGMRLAGVEPNHITFIALLSGWGDLlsGSEALGDLLHGYACKLGLDrAHVMVGTAILGMYSKRGRFRKARLVFDYMEEK 81
Cdd:PLN03077 276 FFTMRELSVDPDLMTITSVISACELL--GDERLGREMHGYVVKTGFA-VDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK 352
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  82 NSVTWNTMIDGYMRNG---------------------------------------------------------------- 97
Cdd:PLN03077 353 DAVSWTAMISGYEKNGlpdkaletyalmeqdnvspdeitiasvlsacaclgdldvgvklhelaerkglisyvvvanalie 432
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  98 ------QVDDAVKLFDEMPERDLISWTAMINGFVKKGFHEEALAWFREMQISgVKPDYVAIIAALAACTNLGALSFGLWI 171
Cdd:PLN03077 433 myskckCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEI 511
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 172 HRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKrTVVSWNSVIVGFAANGHAHESLVYFRKMQEEGFKPNAV 251
Cdd:PLN03077 512 HAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFNSHEK-DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEV 590
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 252 TFTGALAACNHVGLVEEGLRYFQSMKRDYRISPRIEHYGCLVDLYSRAGRLEDALKVVQSMPMKPNEVVIGSLLAACRTQ 331
Cdd:PLN03077 591 TFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIH 670
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 332 gNNTVLAERVMKHLSDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKEPGFSSIEIDDCTHVFMAGDNTHVE 411
Cdd:PLN03077 671 -RHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDDESHPQ 749

                 ....*....
gi 347954490 412 TTNIREVLE 420
Cdd:PLN03077 750 IKEINTVLE 758
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
81-126 5.03e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 65.85  E-value: 5.03e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 347954490   81 KNSVTWNTMIDGYMRNGQVDDAVKLFDEMPER----DLISWTAMINGFVK 126
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
84-114 4.69e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.91  E-value: 4.69e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 347954490   84 VTWNTMIDGYMRNGQVDDAVKLFDEMPERDL 114
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGI 31
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
62-312 1.41e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 43.56  E-value: 1.41e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  62 YSKRGRFRKARLVFDYMEEKNS---VTWNTMIDGYMRNGQVDDAVKLFDEMPERDLISWTAMIN-G--FVKKGFHEEALA 135
Cdd:COG2956   18 YLLNGQPDKAIDLLEEALELDPetvEAHLALGNLYRRRGEYDRAIRIHQKLLERDPDRAEALLElAqdYLKAGLLDRAEE 97
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 136 WFRemQISGVKPDYVAIIAALAACtnlgalsfglwihrYVVSQDFKNNVRVSNSLIDL-------YCRCGCVEFARQVFD 208
Cdd:COG2956   98 LLE--KLLELDPDDAEALRLLAEI--------------YEQEGDWEKAIEVLERLLKLgpenahaYCELAELYLEQGDYD 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 209 KMEK--RTVVSWNSVIVG--------FAANGHAHESLVYFRKMQEEgfKPNAVTFTGALAAC-NHVGLVEEGLRYFQSMk 277
Cdd:COG2956  162 EAIEalEKALKLDPDCARallllaelYLEQGDYEEAIAALERALEQ--DPDYLPALPRLAELyEKLGDPEEALELLRKA- 238
                        250       260       270
                 ....*....|....*....|....*....|....*
gi 347954490 278 rdYRISPRIEHYGCLVDLYSRAGRLEDALKVVQSM 312
Cdd:COG2956  239 --LELDPSDDLLLALADLLERKEGLEAALALLERQ 271
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
2-420 1.52e-96

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 308.32  E-value: 1.52e-96
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490   2 FSGMRLAGVEPNHITFIALLSGWGDLlsGSEALGDLLHGYACKLGLDrAHVMVGTAILGMYSKRGRFRKARLVFDYMEEK 81
Cdd:PLN03077 276 FFTMRELSVDPDLMTITSVISACELL--GDERLGREMHGYVVKTGFA-VDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK 352
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  82 NSVTWNTMIDGYMRNG---------------------------------------------------------------- 97
Cdd:PLN03077 353 DAVSWTAMISGYEKNGlpdkaletyalmeqdnvspdeitiasvlsacaclgdldvgvklhelaerkglisyvvvanalie 432
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  98 ------QVDDAVKLFDEMPERDLISWTAMINGFVKKGFHEEALAWFREMQISgVKPDYVAIIAALAACTNLGALSFGLWI 171
Cdd:PLN03077 433 myskckCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEI 511
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 172 HRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKrTVVSWNSVIVGFAANGHAHESLVYFRKMQEEGFKPNAV 251
Cdd:PLN03077 512 HAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFNSHEK-DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEV 590
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 252 TFTGALAACNHVGLVEEGLRYFQSMKRDYRISPRIEHYGCLVDLYSRAGRLEDALKVVQSMPMKPNEVVIGSLLAACRTQ 331
Cdd:PLN03077 591 TFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIH 670
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 332 gNNTVLAERVMKHLSDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKEPGFSSIEIDDCTHVFMAGDNTHVE 411
Cdd:PLN03077 671 -RHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDDESHPQ 749

                 ....*....
gi 347954490 412 TTNIREVLE 420
Cdd:PLN03077 750 IKEINTVLE 758
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
58-422 4.16e-76

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 250.56  E-value: 4.16e-76
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  58 ILGMYSKRGRFRKARLVFDYMEEKNSVTWNTMIDG--------------------------------------------- 92
Cdd:PLN03081 164 VLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGlvdagnyreafalfremwedgsdaeprtfvvmlrasaglgsarag 243
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  93 -------------------------YMRNGQVDDAVKLFDEMPERDLISWTAMINGFVKKGFHEEALAWFREMQISGVKP 147
Cdd:PLN03081 244 qqlhccvlktgvvgdtfvscalidmYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSI 323
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 148 DYVAIIAALAACTNLGALSFGLWIHRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKRTVVSWNSVIVGFAA 227
Cdd:PLN03081 324 DQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGN 403
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 228 NGHAHESLVYFRKMQEEGFKPNAVTFTGALAACNHVGLVEEGLRYFQSMKRDYRISPRIEHYGCLVDLYSRAGRLEDALK 307
Cdd:PLN03081 404 HGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEAYA 483
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 308 VVQSMPMKPNEVVIGSLLAACRTQGnNTVLAERVMKHLSDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKE 387
Cdd:PLN03081 484 MIRRAPFKPTVNMWAALLTACRIHK-NLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMH 562
                        410       420       430
                 ....*....|....*....|....*....|....*.
gi 347954490 388 PGFSSIEIDDCTHVFMAGDNTHVETTNI-REVLELL 422
Cdd:PLN03081 563 PACTWIEVKKQDHSFFSGDRLHPQSREIyQKLDELM 598
PLN03077 PLN03077
Protein ECB2; Provisional
65-329 4.08e-42

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 158.86  E-value: 4.08e-42
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  65 RGRFRKARLV-FDYMEEKNSVtwNTMIDGYMRNGQVDDAVKLFDEMPERDLISWTAMINGFVKKGFHEEALAWFREMQIS 143
Cdd:PLN03077 205 RGREVHAHVVrFGFELDVDVV--NALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMREL 282
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 144 GVKPDYVAIIAALAACTNLGALSFGLWIHRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKRTVVSWNSVIV 223
Cdd:PLN03077 283 SVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMIS 362
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 224 GFAANGHAHESLVYFRKMQEEGFKPNAVTFTGALAACNHVGLVEEGLRYFQSMKRDYRISPRIEHyGCLVDLYSRAGRLE 303
Cdd:PLN03077 363 GYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVA-NALIEMYSKCKCID 441
                        250       260
                 ....*....|....*....|....*.
gi 347954490 304 DALKVVQSMPMKpNEVVIGSLLAACR 329
Cdd:PLN03077 442 KALEVFHNIPEK-DVISWTSIIAGLR 466
PLN03077 PLN03077
Protein ECB2; Provisional
87-364 6.86e-41

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 155.39  E-value: 6.86e-41
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  87 NTMIDGYMRNGQVDDAVKLFDEMPERDLISWTAMINGFVKKGFHEEALAWFREMQISGVKPDYVAIIAALAACTNLGALS 166
Cdd:PLN03077 125 NAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLA 204
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 167 FGLWIHRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKRTVVSWNSVIVGFAANGHAHESLVYFRKMQEEGF 246
Cdd:PLN03077 205 RGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSV 284
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 247 KPNAVTFTGALAACNHVGlvEEGL-RYFQS--MKRDYRISprIEHYGCLVDLYSRAGRLEDALKVVQSMPMK-------- 315
Cdd:PLN03077 285 DPDLMTITSVISACELLG--DERLgREMHGyvVKTGFAVD--VSVCNSLIQMYLSLGSWGEAEKVFSRMETKdavswtam 360
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 347954490 316 --------------------------PNEVVIGSLLAACRTQGNntvLAERVMKHLSDLNvKSHSNYVILSN----MYA 364
Cdd:PLN03077 361 isgyeknglpdkaletyalmeqdnvsPDEITIASVLSACACLGD---LDVGVKLHELAER-KGLISYVVVANalieMYS 435
PLN03077 PLN03077
Protein ECB2; Provisional
118-379 6.69e-15

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 77.20  E-value: 6.69e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 118 TAMINGFVKKGFHEEALAWFREMQISGVKPDYVAIIAALAACTNLGALSFGLWIHRYVVSQDFKNNVRVSNSLIDLYCRC 197
Cdd:PLN03077  55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRF 134
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 198 GCVEFARQVFDKMEKRTVVSWNSVIVGFAANGHAHESLVYFRKMQEEGFKPNAVTFTGALAACNHVGLVEEGLRYFQSMK 277
Cdd:PLN03077 135 GELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVV 214
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 278 RdYRISPRIEHYGCLVDLYSRAGRLEDALKVVQSMPMK----------------------------------PNEVVIGS 323
Cdd:PLN03077 215 R-FGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRdciswnamisgyfengecleglelfftmrelsvdPDLMTITS 293
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 347954490 324 LLAACRTQGNntvlaERVMKHLSDLNVKSH-----SNYVILSNMYAADGKWEGASKMRRKM 379
Cdd:PLN03077 294 VISACELLGD-----ERLGREMHGYVVKTGfavdvSVCNSLIQMYLSLGSWGEAEKVFSRM 349
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
81-126 5.03e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 65.85  E-value: 5.03e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 347954490   81 KNSVTWNTMIDGYMRNGQVDDAVKLFDEMPER----DLISWTAMINGFVK 126
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGLCK 50
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
333-394 2.20e-11

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 59.09  E-value: 2.20e-11
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 347954490  333 NNTVLAERVMKHLSDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKEPGFSSIE 394
Cdd:pfam20431   2 SNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PLN03218 PLN03218
maturation of RBCL 1; Provisional
2-295 3.21e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 65.67  E-value: 3.21e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490    2 FSGMRLAGVEPNHITFIALLSGWGDLLSGSEALGdllhGYAC----KLGLDRahvMVGTAILGMYSKRGRFRKArlvFDY 77
Cdd:PLN03218  495 FHEMVNAGVEANVHTFGALIDGCARAGQVAKAFG----AYGImrskNVKPDR---VVFNALISACGQSGAVDRA---FDV 564
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490   78 MEEKNS---------VTWNTMIDGYMRNGQVDDAVKLFDEMPERDLIS----WTAMINGFVKKGFHEEALAWFREMQISG 144
Cdd:PLN03218  565 LAEMKAethpidpdhITVGALMKACANAGQVDRAKEVYQMIHEYNIKGtpevYTIAVNSCSQKGDWDFALSIYDDMKKKG 644
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  145 VKPDYVAIIAALAACTNLGALSFGLWIHRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKME----KRTVVSWNS 220
Cdd:PLN03218  645 VKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKsiklRPTVSTMNA 724
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 347954490  221 VIVGFAANGHAHESLVYFRKMQEEGFKPNAVTFTGALAACNHVGLVEEGLRYFQSMKRDyRISPRIEHYGCLVDL 295
Cdd:PLN03218  725 LITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKED-GIKPNLVMCRCITGL 798
PLN03218 PLN03218
maturation of RBCL 1; Provisional
66-333 3.30e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 65.67  E-value: 3.30e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490   66 GRFRKARLVFDYMEEKNSVTWNTMIDGYMRNGQVDDAVKLFDEMP----ERDLISWTAMINGFVKKGFHEEALAWFREMQ 141
Cdd:PLN03218  455 GALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVnagvEANVHTFGALIDGCARAGQVAKAFGAYGIMR 534
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  142 ISGVKPDYVAIIAALAACTNLGAL--SFGLWIHRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFdKMekrtvvswn 219
Cdd:PLN03218  535 SKNVKPDRVVFNALISACGQSGAVdrAFDVLAEMKAETHPIDPDHITVGALMKACANAGQVDRAKEVY-QM--------- 604
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  220 svivgfaanghaheslvyfrkMQEEGFKPNAVTFTGALAACNHVGLVEEGLRYFQSMKRDyRISPRIEHYGCLVDLYSRA 299
Cdd:PLN03218  605 ---------------------IHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKK-GVKPDEVFFSALVDVAGHA 662
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 347954490  300 GRLEDALKVVQSMP---MKPNEVVIGSLLAACRTQGN 333
Cdd:PLN03218  663 GDLDKAFEILQDARkqgIKLGTVSYSSLMGACSNAKN 699
PLN03218 PLN03218
maturation of RBCL 1; Provisional
76-404 1.02e-09

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 60.66  E-value: 1.02e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490   76 DYMEEKNSVTWNTM-----------IDGY---MRNGQVDDAVKLFDEMPERDL-----ISWTAMINGFVKKGFHEEALAW 136
Cdd:PLN03218  349 SDVEEENSLAAYNGgvsgkrkspeyIDAYnrlLRDGRIKDCIDLLEDMEKRGLldmdkIYHAKFFKACKKQRAVKEAFRF 428
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  137 FREMQisgvKPDYVAIIAALAACTNLGALSFGLWIHRYVVSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKME----K 212
Cdd:PLN03218  429 AKLIR----NPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVnagvE 504
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  213 RTVVSWNSVIVGFAANGHAHESLVYFRKMQEEGFKPNAVTFTGALAACNHVGLVEEGLRYFQSMKrdyrispriehygcl 292
Cdd:PLN03218  505 ANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMK--------------- 569
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  293 vdlysragrledalkvVQSMPMKPNEVVIGSLLAACrTQGNNTVLAERVMKHLSDLNVK-SHSNYVILSNMYAADGKWEG 371
Cdd:PLN03218  570 ----------------AETHPIDPDHITVGALMKAC-ANAGQVDRAKEVYQMIHEYNIKgTPEVYTIAVNSCSQKGDWDF 632
                         330       340       350
                  ....*....|....*....|....*....|...
gi 347954490  372 ASKMRRKMKGLGLKKEPGFSSIEIDDCTHVFMA 404
Cdd:PLN03218  633 ALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDL 665
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
82-109 1.15e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.12  E-value: 1.15e-09
                          10        20
                  ....*....|....*....|....*...
gi 347954490   82 NSVTWNTMIDGYMRNGQVDDAVKLFDEM 109
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
84-114 2.29e-08

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 49.39  E-value: 2.29e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 347954490   84 VTWNTMIDGYMRNGQVDDAVKLFDEMPERDL 114
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
112-157 1.22e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 47.74  E-value: 1.22e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 347954490  112 RDLISWTAMINGFVKKGFHEEALAWFREMQISGVKPD---YVAIIAALA 157
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNvytYTILINGLC 49
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
84-114 4.69e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.91  E-value: 4.69e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 347954490   84 VTWNTMIDGYMRNGQVDDAVKLFDEMPERDL 114
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
213-254 1.31e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 45.05  E-value: 1.31e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 347954490  213 RTVVSWNSVIVGFAANGHAHESLVYFRKMQEEGFKPNAVTFT 254
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYT 42
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
115-148 1.93e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 1.93e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 347954490  115 ISWTAMINGFVKKGFHEEALAWFREMQISGVKPD 148
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
115-145 1.95e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 43.99  E-value: 1.95e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 347954490  115 ISWTAMINGFVKKGFHEEALAWFREMQISGV 145
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
56-95 8.44e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 42.74  E-value: 8.44e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 347954490   56 TAILGMYSKRGRFRKARLVFDYMEEK----NSVTWNTMIDGYMR 95
Cdd:pfam13041   7 NTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGLCK 50
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
62-312 1.41e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 43.56  E-value: 1.41e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  62 YSKRGRFRKARLVFDYMEEKNS---VTWNTMIDGYMRNGQVDDAVKLFDEMPERDLISWTAMIN-G--FVKKGFHEEALA 135
Cdd:COG2956   18 YLLNGQPDKAIDLLEEALELDPetvEAHLALGNLYRRRGEYDRAIRIHQKLLERDPDRAEALLElAqdYLKAGLLDRAEE 97
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 136 WFRemQISGVKPDYVAIIAALAACtnlgalsfglwihrYVVSQDFKNNVRVSNSLIDL-------YCRCGCVEFARQVFD 208
Cdd:COG2956   98 LLE--KLLELDPDDAEALRLLAEI--------------YEQEGDWEKAIEVLERLLKLgpenahaYCELAELYLEQGDYD 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 209 KMEK--RTVVSWNSVIVG--------FAANGHAHESLVYFRKMQEEgfKPNAVTFTGALAAC-NHVGLVEEGLRYFQSMk 277
Cdd:COG2956  162 EAIEalEKALKLDPDCARallllaelYLEQGDYEEAIAALERALEQ--DPDYLPALPRLAELyEKLGDPEEALELLRKA- 238
                        250       260       270
                 ....*....|....*....|....*....|....*
gi 347954490 278 rdYRISPRIEHYGCLVDLYSRAGRLEDALKVVQSM 312
Cdd:COG2956  239 --LELDPSDDLLLALADLLERKEGLEAALALLERQ 271
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
188-226 2.99e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.50  E-value: 2.99e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 347954490  188 NSLIDLYCRCGCVEFARQVFDKMEKR----TVVSWNSVIVGFA 226
Cdd:pfam13041   7 NTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGLC 49
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
216-250 3.86e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.82  E-value: 3.86e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 347954490  216 VSWNSVIVGFAANGHAHESLVYFRKMQEEGFKPNA 250
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
238-296 4.58e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.11  E-value: 4.58e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 347954490  238 FRKMQEEGFKPNAVTFTGALAACNHVGLVEEGLRYFQSMKRDyRISPRIEHYGCLVDLY 296
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKK-GIKPTLDTYNAILGVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
188-213 6.36e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 6.36e-04
                          10        20
                  ....*....|....*....|....*.
gi 347954490  188 NSLIDLYCRCGCVEFARQVFDKMEKR 213
Cdd:pfam01535   4 NSLISGYCKNGKLEEALELFKEMKEK 29
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
61-213 6.89e-04

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 41.15  E-value: 6.89e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490  61 MYSKRGRFRKARLVFDY---MEEKNSVTWNTMIDGYMRNGQVDDAVKLFD---EMPERDLISWTAMINGFVKKGFHEEAL 134
Cdd:COG0457   17 AYRRLGRYEEAIEDYEKaleLDPDDAEALYNLGLAYLRLGRYEEALADYEqalELDPDDAEALNNLGLALQALGRYEEAL 96
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 347954490 135 AWFRemQISGVKPDYVAIIAALA-ACTNLGALSFGLWIHRYVVSQDfKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKR 213
Cdd:COG0457   97 EDYD--KALELDPDDAEALYNLGlALLELGRYDEAIEAYERALELD-PDDADALYNLGIALEKLGRYEEALELLEKLEAA 173
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
216-246 1.47e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.90  E-value: 1.47e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 347954490  216 VSWNSVIVGFAANGHAHESLVYFRKMQEEGF 246
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
188-213 2.08e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 2.08e-03
                          10        20
                  ....*....|....*....|....*.
gi 347954490  188 NSLIDLYCRCGCVEFARQVFDKMEKR 213
Cdd:TIGR00756   4 NTLIDGLCKAGRVEEALELFKEMKER 29
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
183-211 4.69e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 34.63  E-value: 4.69e-03
                          10        20
                  ....*....|....*....|....*....
gi 347954490  183 NVRVSNSLIDLYCRCGCVEFARQVFDKME 211
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
206-260 8.36e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 34.64  E-value: 8.36e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 347954490  206 VFDKMEKR----TVVSWNSVIVGFAANGHAHESLVYFRKMQEEGFKPNAVTFTGALAAC 260
Cdd:pfam13812   2 ILREMVRDgiqlNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH