NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15241651|ref|NP_196468|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1000585)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03077 super family cl33629
Protein ECB2; Provisional
5-494 6.75e-114

Protein ECB2; Provisional


The actual alignment was detected with superfamily member PLN03077:

Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 356.47  E-value: 6.75e-114
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651    5 KQLHAHCLRTGVDETKDLLQRLLLI----PNLVYARKLFDHHQNSCTFLYNKLIQAYYVHHQPHESIVLYNLLSFDGLRP 80
Cdd:PLN03077 207 REVHAHVVRFGFELDVDVVNALITMyvkcGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDP 286
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651   81 SHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSKRDVPVWNAMITGYQR 160
Cdd:PLN03077 287 DLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEK 366
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  161 RG---------------------------------------------------------------DMKA-------AMEL 170
Cdd:PLN03077 367 NGlpdkaletyalmeqdnvspdeitiasvlsacaclgdldvgvklhelaerkglisyvvvanaliEMYSkckcidkALEV 446
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  171 FDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMEKDksVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNI 250
Cdd:PLN03077 447 FHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT--LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDG 524
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  251 YVCNATIEMYSKCGMIDVAKRLFEElgNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVHG 330
Cdd:PLN03077 525 FLPNALLDLYVRCGRMNYAWNQFNS--HEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRS 602
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  331 GMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASEA 410
Cdd:PLN03077 603 GMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQH 682
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  411 LFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYfVEVGVDVHKFTVEDKSHPRSYEIYQVLEEIF 490
Cdd:PLN03077 683 IFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSW-VEVKGKVHAFLTDDESHPQIKEINTVLEGFY 761

                 ....
gi 15241651  491 RRMK 494
Cdd:PLN03077 762 EKMK 765
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
5-494 6.75e-114

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 356.47  E-value: 6.75e-114
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651    5 KQLHAHCLRTGVDETKDLLQRLLLI----PNLVYARKLFDHHQNSCTFLYNKLIQAYYVHHQPHESIVLYNLLSFDGLRP 80
Cdd:PLN03077 207 REVHAHVVRFGFELDVDVVNALITMyvkcGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDP 286
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651   81 SHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSKRDVPVWNAMITGYQR 160
Cdd:PLN03077 287 DLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEK 366
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  161 RG---------------------------------------------------------------DMKA-------AMEL 170
Cdd:PLN03077 367 NGlpdkaletyalmeqdnvspdeitiasvlsacaclgdldvgvklhelaerkglisyvvvanaliEMYSkckcidkALEV 446
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  171 FDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMEKDksVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNI 250
Cdd:PLN03077 447 FHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT--LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDG 524
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  251 YVCNATIEMYSKCGMIDVAKRLFEElgNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVHG 330
Cdd:PLN03077 525 FLPNALLDLYVRCGRMNYAWNQFNS--HEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRS 602
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  331 GMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASEA 410
Cdd:PLN03077 603 GMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQH 682
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  411 LFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYfVEVGVDVHKFTVEDKSHPRSYEIYQVLEEIF 490
Cdd:PLN03077 683 IFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSW-VEVKGKVHAFLTDDESHPQIKEINTVLEGFY 761

                 ....
gi 15241651  491 RRMK 494
Cdd:PLN03077 762 EKMK 765
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
398-458 1.43e-14

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 68.34  E-value: 1.43e-14
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15241651   398 HGNVEIAEIASEALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSY 458
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSW 61
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
283-317 1.88e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 1.88e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15241651   283 CSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDA 317
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
5-494 6.75e-114

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 356.47  E-value: 6.75e-114
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651    5 KQLHAHCLRTGVDETKDLLQRLLLI----PNLVYARKLFDHHQNSCTFLYNKLIQAYYVHHQPHESIVLYNLLSFDGLRP 80
Cdd:PLN03077 207 REVHAHVVRFGFELDVDVVNALITMyvkcGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDP 286
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651   81 SHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSKRDVPVWNAMITGYQR 160
Cdd:PLN03077 287 DLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEK 366
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  161 RG---------------------------------------------------------------DMKA-------AMEL 170
Cdd:PLN03077 367 NGlpdkaletyalmeqdnvspdeitiasvlsacaclgdldvgvklhelaerkglisyvvvanaliEMYSkckcidkALEV 446
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  171 FDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMEKDksVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNI 250
Cdd:PLN03077 447 FHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT--LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDG 524
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  251 YVCNATIEMYSKCGMIDVAKRLFEElgNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVHG 330
Cdd:PLN03077 525 FLPNALLDLYVRCGRMNYAWNQFNS--HEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRS 602
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  331 GMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASEA 410
Cdd:PLN03077 603 GMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQH 682
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  411 LFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYfVEVGVDVHKFTVEDKSHPRSYEIYQVLEEIF 490
Cdd:PLN03077 683 IFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSW-VEVKGKVHAFLTDDESHPQIKEINTVLEGFY 761

                 ....
gi 15241651  491 RRMK 494
Cdd:PLN03077 762 EKMK 765
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
3-494 1.40e-83

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 272.90  E-value: 1.40e-83
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651    3 GIKQLHAHCLRTGVDETKDLLQRLLLIP----NLVYARKLFDHHQNSCTFLYNKLIQAYYVHHQPHESIVLYNLLSFDGL 78
Cdd:PLN03081 141 CVKAVYWHVESSGFEPDQYMMNRVLLMHvkcgMLIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGS 220
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651   79 RPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLGALCCARRVFDemskrdvpvwnamitgy 158
Cdd:PLN03081 221 DAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFD----------------- 283
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  159 qrrgdmkaamelfdSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMeKDKSVKPNHITVVSVLPACANLGELEIGRRLE 238
Cdd:PLN03081 284 --------------GMPEKTTVAWNSMLAGYALHGYSEEALCLYYEM-RDSGVSIDQFTFSIMIRIFSRLALLEHAKQAH 348
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  239 GYARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELgNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAV 318
Cdd:PLN03081 349 AGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRM-PRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHV 427
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  319 TFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFH 398
Cdd:PLN03081 428 TFLAVLSACRYSGLSEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIH 507
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  399 GNVEIAEIASEALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYfVEVGVDVHKFTVEDKSHPR 478
Cdd:PLN03081 508 KNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTW-IEVKKQDHSFFSGDRLHPQ 586
                        490
                 ....*....|....*.
gi 15241651  479 SYEIYQVLEEIFRRMK 494
Cdd:PLN03081 587 SREIYQKLDELMKEIS 602
PLN03077 PLN03077
Protein ECB2; Provisional
126-430 1.96e-26

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 113.79  E-value: 1.96e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  126 AYAKLGALCCARRVFDE-----------MSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQNGN 194
Cdd:PLN03077  88 AYVALFRLCEWKRAVEEgsrvcsralssHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVGGYAKAGY 167
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  195 YSEALKMFLCMEKdKSVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFE 274
Cdd:PLN03077 168 FDEALCLYHRMLW-AGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFD 246
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  275 ELGnQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACvhgGMVvkGQELFKsmEEVHKISPK-- 352
Cdd:PLN03077 247 RMP-RRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISAC---ELL--GDERLG--REMHGYVVKtg 318
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  353 ----LEHYGCMIDLLGRVGKLQEAYDLIKTMPMKpDAVVWGTLLGACSFHGNVEIAeIASEALFKLEPTNPGNCVIMSNI 428
Cdd:PLN03077 319 favdVSVCNSLIQMYLSLGSWGEAEKVFSRMETK-DAVSWTAMISGYEKNGLPDKA-LETYALMEQDNVSPDEITIASVL 396

                 ..
gi 15241651  429 YA 430
Cdd:PLN03077 397 SA 398
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
172-404 1.00e-25

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 111.12  E-value: 1.00e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  172 DSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIY 251
Cdd:PLN03081  80 DTQIRKSGVSLCSQIEKLVACGRHREALELFEILEAGCPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQY 159
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  252 VCNATIEMYSKCGMIDVAKRLFEELgNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVHGG 331
Cdd:PLN03081 160 MMNRVLLMHVKCGMLIDARRLFDEM-PERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLG 238
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 15241651  332 MVVKGQELfksmeevHKISPKLEHYG-----C-MIDLLGRVGKLQEAYDLIKTMPMKpDAVVWGTLLGACSFHGNVEIA 404
Cdd:PLN03081 239 SARAGQQL-------HCCVLKTGVVGdtfvsCaLIDMYSKCGDIEDARCVFDGMPEK-TTVAWNSMLAGYALHGYSEEA 309
PLN03218 PLN03218
maturation of RBCL 1; Provisional
99-404 1.90e-19

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 92.25  E-value: 1.90e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651    99 RPLRLLHSqffrSGFESDSFCCTTLITAYAKLGALCCARRVFDEMS----KRDVPVWNAMITGYQRRGDMKAAMELFDSM 174
Cdd:PLN03218  458 RVLRLVQE----AGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVnagvEANVHTFGALIDGCARAGQVAKAFGAYGIM 533
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651   175 PRKNVTS----WTTVISGFSQNGNYSEALKMFLCMEKD-KSVKPNHITVVSVLPACANLGelEIGRRLEGYA--RENGFF 247
Cdd:PLN03218  534 RSKNVKPdrvvFNALISACGQSGAVDRAFDVLAEMKAEtHPIDPDHITVGALMKACANAG--QVDRAKEVYQmiHEYNIK 611
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651   248 DNIYVCNATIEMYSKCGMIDVAKRLFEEL---GNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLL 324
Cdd:PLN03218  612 GTPEVYTIAVNSCSQKGDWDFALSIYDDMkkkGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLM 691
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651   325 LACVHGGMVVKGQELFKSMEEVhKISPKLEHYGCMIDLLGRVGKLQEAYDL---IKTMPMKPDAVVWGTLLGACSFHGNV 401
Cdd:PLN03218  692 GACSNAKNWKKALELYEDIKSI-KLRPTVSTMNALITALCEGNQLPKALEVlseMKRLGLCPNTITYSILLVASERKDDA 770

                  ...
gi 15241651   402 EIA 404
Cdd:PLN03218  771 DVG 773
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
398-458 1.43e-14

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 68.34  E-value: 1.43e-14
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15241651   398 HGNVEIAEIASEALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSY 458
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSW 61
PLN03077 PLN03077
Protein ECB2; Provisional
188-339 3.12e-11

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 66.03  E-value: 3.12e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651  188 GFSQNGNYSEALKMFLCMEKDKsVKPNHITVVSVLPACANLGELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMID 267
Cdd:PLN03077  60 ALCSHGQLEQALKLLESMQELR-VPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELV 138
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 15241651  268 VAKRLFEELGnQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACvhGGM--VVKGQEL 339
Cdd:PLN03077 139 HAWYVFGKMP-ERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTC--GGIpdLARGREV 209
PLN03218 PLN03218
maturation of RBCL 1; Provisional
273-410 2.88e-09

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 59.89  E-value: 2.88e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15241651   273 FEELGNQRNLCSWNsMIGSLATHGKH-DEALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEeVHKISP 351
Cdd:PLN03218  428 FAKLIRNPTLSTFN-MLMSVCASSQDiDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMV-NAGVEA 505
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 15241651   352 KLEHYGCMIDLLGRVGKLQEA---YDLIKTMPMKPDAVVWGTLLGACSFHGNVE-----IAEIASEA 410
Cdd:PLN03218  506 NVHTFGALIDGCARAGQVAKAfgaYGIMRSKNVKPDRVVFNALISACGQSGAVDrafdvLAEMKAET 572
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
280-329 1.21e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.82  E-value: 1.21e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15241651   280 RNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACVH 329
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
146-189 1.34e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.13  E-value: 1.34e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15241651   146 RDVPVWNAMITGYQRRGDMKAAMELFDSMPRK----NVTSWTTVISGF 189
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGL 48
Eplus_motif pfam20430
E+ motif; This is the E+ motif found in some plant pentatricopeptide repeat (PPR) proteins ...
466-493 2.72e-07

E+ motif; This is the E+ motif found in some plant pentatricopeptide repeat (PPR) proteins which contain a C-terminal DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E motif, precedes the DYW domain and, although their role is not clear, they are essential in th RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466579 [Multi-domain]  Cd Length: 28  Bit Score: 46.50  E-value: 2.72e-07
                          10        20
                  ....*....|....*....|....*...
gi 15241651   466 VHKFTVEDKSHPRSYEIYQVLEEIFRRM 493
Cdd:pfam20430   1 TYTFFAGDKSHPESKQIYEKLSDLTQRI 28
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
177-227 1.57e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 45.05  E-value: 1.57e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 15241651   177 KNVTSWTTVISGFSQNGNYSEALKMFLCMeKDKSVKPNHITVVSVLPACAN 227
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEM-KKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
283-317 1.88e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 1.88e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15241651   283 CSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDA 317
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
284-312 1.50e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.99  E-value: 1.50e-04
                          10        20
                  ....*....|....*....|....*....
gi 15241651   284 SWNSMIGSLATHGKHDEALTLFAQMLREG 312
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
150-179 1.68e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.60  E-value: 1.68e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 15241651   150 VWNAMITGYQRRGDMKAAMELFDSMPRKNV 179
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
181-208 1.73e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.60  E-value: 1.73e-04
                          10        20
                  ....*....|....*....|....*...
gi 15241651   181 SWTTVISGFSQNGNYSEALKMFLCMEKD 208
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
150-179 8.25e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.05  E-value: 8.25e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 15241651   150 VWNAMITGYQRRGDMKAAMELFDSMPRKNV 179
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
170-227 2.23e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.57  E-value: 2.23e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 15241651   170 LFDSMPRK----NVTSWTTVISGFSQNGNYSEALKMFLCMEKDKsVKPNHITVVSVLPACAN 227
Cdd:pfam13812   2 ILREMVRDgiqlNVNTYTHLLHAYANVGNLKLALEIFERMKKKG-IKPTLDTYNAILGVIGG 62
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
116-158 7.62e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 34.65  E-value: 7.62e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 15241651   116 DSFCCTTLITAYAKLGALCCARRVFDEMSKRDVPV----WNAMITGY 158
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPnvytYTILINGL 48
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH