NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15226737|ref|NP_181604|]
View 

Tetratricopeptide repeat (TPR)-like superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 13804408)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03077 super family cl33629
Protein ECB2; Provisional
187-853 9.65e-170

Protein ECB2; Provisional


The actual alignment was detected with superfamily member PLN03077:

Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 514.01  E-value: 9.65e-170
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  187 EEGKQIHGFMLRNSLDTDSFLKTALIDMYFKFGLSIDAWRVFVEIEDKsNVVLWNVMIVGFGGSGICESSLDLYMLAKNN 266
Cdd:PLN03077 103 EEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMPER-DLFSWNVLVGGYAKAGYFDEALCLYHRMLWA 181
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  267 SVKLVSTSFTGALGACSQSENSGFGRQIHCDVVKMGLHNDPYVCTSLLSMYSKCGMVGEAETVFSCVVDKRLEIWNAMVA 346
Cdd:PLN03077 182 GVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMIS 261
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  347 AYAENDYGYSALDLFGFMRQKSVLPDSFTLSNVISCCSVLGLYNYGKSVHAELFKRPIQSTSTIESALLTLYSKCGCDPD 426
Cdd:PLN03077 262 GYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGE 341
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  427 AYLVFKSMEEKDMVAWGSLISGLCKNGKFKEALKVFGDMkdDDDSLKPDSDIMTSVTNACAGLEALRFGLQVHGSMIKTG 506
Cdd:PLN03077 342 AEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALM--EQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKG 419
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  507 LVLNVFVGSSLIDLYSKCGLPEMALKVFTSMSTENMVAWNSMISCYSRNNLPELSIDLFNLMLSQgIFPDSVSITSVLVA 586
Cdd:PLN03077 420 LISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSA 498
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  587 ISSTASLLKGKSLHGYTLRLGIPSDTHLKNALIDMYVKCGFSKYAENIFKkMQHKSLITWNLMIYGYGSHGDCITALSLF 666
Cdd:PLN03077 499 CARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFN-SHEKDVVSWNILLTGYVAHGKGSMAVELF 577
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  667 DEMKKAGESPDDVTFLSLISACNHSGFVEEGKNIFEFMKQDYGIEPNMEHYANMVDLLGRAGLLEEAYSFIKAMPIEADS 746
Cdd:PLN03077 578 NRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDP 657
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  747 SIWLCLLSASRTHHNVELGILSAEKLLRMEPERGSTYVQLINLYMEAGLKNEAAKLLGLMKEKGLHKQPGCSWIEVSDRT 826
Cdd:PLN03077 658 AVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKV 737
                        650       660
                 ....*....|....*....|....*..
gi 15226737  827 NVFFSGGSSSPMKAEIFNVLNRLKSNM 853
Cdd:PLN03077 738 HAFLTDDESHPQIKEINTVLEGFYEKM 764
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
131-180 1.44e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.51  E-value: 1.44e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15226737   131 RDVTVWNSMIDGYFKFRRFKEGVGCFRRMLVFGVRPDAFSLSIVVSVMCK 180
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
187-853 9.65e-170

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 514.01  E-value: 9.65e-170
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  187 EEGKQIHGFMLRNSLDTDSFLKTALIDMYFKFGLSIDAWRVFVEIEDKsNVVLWNVMIVGFGGSGICESSLDLYMLAKNN 266
Cdd:PLN03077 103 EEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMPER-DLFSWNVLVGGYAKAGYFDEALCLYHRMLWA 181
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  267 SVKLVSTSFTGALGACSQSENSGFGRQIHCDVVKMGLHNDPYVCTSLLSMYSKCGMVGEAETVFSCVVDKRLEIWNAMVA 346
Cdd:PLN03077 182 GVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMIS 261
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  347 AYAENDYGYSALDLFGFMRQKSVLPDSFTLSNVISCCSVLGLYNYGKSVHAELFKRPIQSTSTIESALLTLYSKCGCDPD 426
Cdd:PLN03077 262 GYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGE 341
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  427 AYLVFKSMEEKDMVAWGSLISGLCKNGKFKEALKVFGDMkdDDDSLKPDSDIMTSVTNACAGLEALRFGLQVHGSMIKTG 506
Cdd:PLN03077 342 AEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALM--EQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKG 419
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  507 LVLNVFVGSSLIDLYSKCGLPEMALKVFTSMSTENMVAWNSMISCYSRNNLPELSIDLFNLMLSQgIFPDSVSITSVLVA 586
Cdd:PLN03077 420 LISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSA 498
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  587 ISSTASLLKGKSLHGYTLRLGIPSDTHLKNALIDMYVKCGFSKYAENIFKkMQHKSLITWNLMIYGYGSHGDCITALSLF 666
Cdd:PLN03077 499 CARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFN-SHEKDVVSWNILLTGYVAHGKGSMAVELF 577
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  667 DEMKKAGESPDDVTFLSLISACNHSGFVEEGKNIFEFMKQDYGIEPNMEHYANMVDLLGRAGLLEEAYSFIKAMPIEADS 746
Cdd:PLN03077 578 NRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDP 657
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  747 SIWLCLLSASRTHHNVELGILSAEKLLRMEPERGSTYVQLINLYMEAGLKNEAAKLLGLMKEKGLHKQPGCSWIEVSDRT 826
Cdd:PLN03077 658 AVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKV 737
                        650       660
                 ....*....|....*....|....*..
gi 15226737  827 NVFFSGGSSSPMKAEIFNVLNRLKSNM 853
Cdd:PLN03077 738 HAFLTDDESHPQIKEINTVLEGFYEKM 764
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
759-821 1.87e-14

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 68.72  E-value: 1.87e-14
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 15226737   759 HHNVELGILSAEKLLRMEPERGSTYVQLINLYMEAGLKNEAAKLLGLMKEKGLHKQPGCSWIE 821
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
131-180 1.44e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.51  E-value: 1.44e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15226737   131 RDVTVWNSMIDGYFKFRRFKEGVGCFRRMLVFGVRPDAFSLSIVVSVMCK 180
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
543-577 5.43e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.60  E-value: 5.43e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15226737   543 VAWNSMISCYSRNNLPELSIDLFNLMLSQGIFPDS 577
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
713-815 8.71e-05

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 46.14  E-value: 8.71e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737 713 NMEHYANMVDLLGRAGLLEEAY-SFIKAMPIEAD-SSIWLCLLSASRTHHNVELGILSAEKLLRMEPERGSTYVQLINLY 790
Cdd:COG3914 111 NAEALFNLGNLLLALGRLEEALaALRRALALNPDfAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNAL 190
                        90       100
                ....*....|....*....|....*
gi 15226737 791 MEAGLKNEAAKLLglmkEKGLHKQP 815
Cdd:COG3914 191 QDLGRLEEAIAAY----RRALELDP 211
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
135-168 9.64e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 9.64e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15226737   135 VWNSMIDGYFKFRRFKEGVGCFRRMLVFGVRPDA 168
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
187-853 9.65e-170

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 514.01  E-value: 9.65e-170
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  187 EEGKQIHGFMLRNSLDTDSFLKTALIDMYFKFGLSIDAWRVFVEIEDKsNVVLWNVMIVGFGGSGICESSLDLYMLAKNN 266
Cdd:PLN03077 103 EEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMPER-DLFSWNVLVGGYAKAGYFDEALCLYHRMLWA 181
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  267 SVKLVSTSFTGALGACSQSENSGFGRQIHCDVVKMGLHNDPYVCTSLLSMYSKCGMVGEAETVFSCVVDKRLEIWNAMVA 346
Cdd:PLN03077 182 GVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMIS 261
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  347 AYAENDYGYSALDLFGFMRQKSVLPDSFTLSNVISCCSVLGLYNYGKSVHAELFKRPIQSTSTIESALLTLYSKCGCDPD 426
Cdd:PLN03077 262 GYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGE 341
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  427 AYLVFKSMEEKDMVAWGSLISGLCKNGKFKEALKVFGDMkdDDDSLKPDSDIMTSVTNACAGLEALRFGLQVHGSMIKTG 506
Cdd:PLN03077 342 AEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALM--EQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKG 419
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  507 LVLNVFVGSSLIDLYSKCGLPEMALKVFTSMSTENMVAWNSMISCYSRNNLPELSIDLFNLMLSQgIFPDSVSITSVLVA 586
Cdd:PLN03077 420 LISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSA 498
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  587 ISSTASLLKGKSLHGYTLRLGIPSDTHLKNALIDMYVKCGFSKYAENIFKkMQHKSLITWNLMIYGYGSHGDCITALSLF 666
Cdd:PLN03077 499 CARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFN-SHEKDVVSWNILLTGYVAHGKGSMAVELF 577
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  667 DEMKKAGESPDDVTFLSLISACNHSGFVEEGKNIFEFMKQDYGIEPNMEHYANMVDLLGRAGLLEEAYSFIKAMPIEADS 746
Cdd:PLN03077 578 NRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDP 657
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  747 SIWLCLLSASRTHHNVELGILSAEKLLRMEPERGSTYVQLINLYMEAGLKNEAAKLLGLMKEKGLHKQPGCSWIEVSDRT 826
Cdd:PLN03077 658 AVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKV 737
                        650       660
                 ....*....|....*....|....*..
gi 15226737  827 NVFFSGGSSSPMKAEIFNVLNRLKSNM 853
Cdd:PLN03077 738 HAFLTDDESHPQIKEINTVLEGFYEKM 764
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
414-849 2.12e-119

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 377.67  E-value: 2.12e-119
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  414 LLTLYSKCGCDPDAYLVFKSMEEKDMVAWGSLISGLCKNGKFKEALKVFGDMKDDDDSlkPDSDIMTSVTNACAGLEALR 493
Cdd:PLN03081 164 VLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSD--AEPRTFVVMLRASAGLGSAR 241
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  494 FGLQVHGSMIKTGLVLNVFVGSSLIDLYSKCGLPEMALKVFTSMSTENMVAWNSMISCYSRNNLPELSIDLFNLMLSQGI 573
Cdd:PLN03081 242 AGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGV 321
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  574 FPDSVSITSVLVAISSTASLLKGKSLHGYTLRLGIPSDTHLKNALIDMYVKCGFSKYAENIFKKMQHKSLITWNLMIYGY 653
Cdd:PLN03081 322 SIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGY 401
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  654 GSHGDCITALSLFDEMKKAGESPDDVTFLSLISACNHSGFVEEGKNIFEFMKQDYGIEPNMEHYANMVDLLGRAGLLEEA 733
Cdd:PLN03081 402 GNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEA 481
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  734 YSFIKAMPIEADSSIWLCLLSASRTHHNVELGILSAEKLLRMEPERGSTYVQLINLYMEAGLKNEAAKLLGLMKEKGLHK 813
Cdd:PLN03081 482 YAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSM 561
                        410       420       430
                 ....*....|....*....|....*....|....*.
gi 15226737  814 QPGCSWIEVSDRTNVFFSGGSSSPMKAEIFNVLNRL 849
Cdd:PLN03081 562 HPACTWIEVKKQDHSFFSGDRLHPQSREIYQKLDEL 597
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
23-395 1.10e-47

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 181.22  E-value: 1.10e-47
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   23 SPASINSGIRALIQKGEYLQALHLYSKHDGSSPFWTSVFTFPSLLKACSALTNLSYGKTIHGSVVVLGWRYDPFIATSLV 102
Cdd:PLN03081  86 SGVSLCSQIEKLVACGRHREALELFEILEAGCPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVL 165
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  103 NMYVKCGFLDYAVQVFDGwsqsqsgVSARDVTVWNSMIDGYFKFRRFKEGVGCFRRMLvfGVRPDAFSLSIVVSVMCKEG 182
Cdd:PLN03081 166 LMHVKCGMLIDARRLFDE-------MPERNLASWGTIIGGLVDAGNYREAFALFREMW--EDGSDAEPRTFVVMLRASAG 236
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  183 NFRREEGKQIHGFMLRNSLDTDSFLKTALIDMYFKFGLSIDAWRVFVEIEDKSnVVLWNVMIVGFGGSGICESSLDLYML 262
Cdd:PLN03081 237 LGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKT-TVAWNSMLAGYALHGYSEEALCLYYE 315
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  263 AKNNSVKLVSTSFTGALGACSQSENSGFGRQIHCDVVKMGLHNDPYVCTSLLSMYSKCGMVGEAETVFSCVVDKRLEIWN 342
Cdd:PLN03081 316 MRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWN 395
                        330       340       350       360       370
                 ....*....|....*....|....*....|....*....|....*....|...
gi 15226737  343 AMVAAYAENDYGYSALDLFGFMRQKSVLPDSFTLSNVISCCSVLGLYNYGKSV 395
Cdd:PLN03081 396 ALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEI 448
PLN03077 PLN03077
Protein ECB2; Provisional
444-754 2.85e-35

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 144.61  E-value: 2.85e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  444 SLISGLCKNGKFKEALKVFGDMKDDDDSlkPDSDIMTSVTNACAGLEALRFGLQVHGSMIKTGLVLNVFVGSSLIDLYSK 523
Cdd:PLN03077  56 SQLRALCSHGQLEQALKLLESMQELRVP--VDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVR 133
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  524 CGLPEMALKVFTSMSTENMVAWNSMISCYSRNNLPELSIDLFNLMLSQGIFPDSVSITSVLVAISSTASLLKGKSLHGYT 603
Cdd:PLN03077 134 FGELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHV 213
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737  604 LRLGIPSDTHLKNALIDMYVKCGFSKYAENIFKKMQHKSLITWNLMIYGYGSHGDCITALSLFDEMKKAGESPDDVTFLS 683
Cdd:PLN03077 214 VRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITS 293
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15226737  684 LISACNHSGFVEEGKNIFEFMKQDyGIEPNMEHYANMVDLLGRAGLLEEAYSFIKAMPIEaDSSIWLCLLS 754
Cdd:PLN03077 294 VISACELLGDERLGREMHGYVVKT-GFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK-DAVSWTAMIS 362
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
759-821 1.87e-14

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 68.72  E-value: 1.87e-14
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 15226737   759 HHNVELGILSAEKLLRMEPERGSTYVQLINLYMEAGLKNEAAKLLGLMKEKGLHKQPGCSWIE 821
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PLN03218 PLN03218
maturation of RBCL 1; Provisional
269-714 9.57e-12

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 69.14  E-value: 9.57e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   269 KLVSTS----FTGALGACSQSENSGFGRQIHCDVVKMGLHNDPYVCTSLLSMYSKCGMVgeaetvfscvvDKRLEIWNAM 344
Cdd:PLN03218  430 KLIRNPtlstFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKV-----------DAMFEVFHEM 498
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   345 VAAYAE-NDYGYSAL--------------DLFGFMRQKSVLPDSFTLSNVISCCSVLGLYNYGKSVHAELF--KRPIqst 407
Cdd:PLN03218  499 VNAGVEaNVHTFGALidgcaragqvakafGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPI--- 575
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   408 stiesalltlyskcgcDPdaylvfksmeekDMVAWGSLISGLCKNGKFKEALKVFGDMKDDDdsLKPDSDIMTSVTNACA 487
Cdd:PLN03218  576 ----------------DP------------DHITVGALMKACANAGQVDRAKEVYQMIHEYN--IKGTPEVYTIAVNSCS 625
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   488 GLEALRFGLQVHGSMIKTGLVLNVFVGSSLIDLYSKCGLPEMALKVFTSMStenmvawnsmiscysrnnlpelsidlfnl 567
Cdd:PLN03218  626 QKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDAR----------------------------- 676
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   568 mlSQGIFPDSVSITSVLVAISSTASLLKGkslhgytlrlgipsdthlknalIDMYVKCGFSKYAENIfkkmqhkslITWN 647
Cdd:PLN03218  677 --KQGIKLGTVSYSSLMGACSNAKNWKKA----------------------LELYEDIKSIKLRPTV---------STMN 723
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 15226737   648 LMIYGYGSHGDCITALSLFDEMKKAGESPDDVTFLSLISACNHSGFVEEGKNIFEFMKQDyGIEPNM 714
Cdd:PLN03218  724 ALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKED-GIKPNL 789
PLN03218 PLN03218
maturation of RBCL 1; Provisional
425-755 2.61e-10

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 64.51  E-value: 2.61e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   425 PDAYLVFKSMEEKDMVAWGSLISGLCKNGKFKEALKVFGDMKDDddSLKPDSDIMTSVTNACAGLEALRFGLQVHGSMIK 504
Cdd:PLN03218  423 KEAFRFAKLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEA--GLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVN 500
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   505 TGLVLNVFVGSSLIDLYSKCGLPEMALKVFTSMSTENM----VAWNSMISCYSRNNLPELSIDLFNLMLSQG--IFPDSV 578
Cdd:PLN03218  501 AGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVkpdrVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHI 580
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   579 SItsvlvaisstasllkgkslhgytlrlgipsdthlkNALIDMYVKCGFSKYAENIFKkMQHKSLITWNLMIYGYGSH-- 656
Cdd:PLN03218  581 TV-----------------------------------GALMKACANAGQVDRAKEVYQ-MIHEYNIKGTPEVYTIAVNsc 624
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   657 ---GDCITALSLFDEMKKAGESPDDVTFLSLISACNHSGFVEEGKNIFEFMKQDyGIEPNMEHYANMVDLLGRAGLLEEA 733
Cdd:PLN03218  625 sqkGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQ-GIKLGTVSYSSLMGACSNAKNWKKA 703
                         330       340
                  ....*....|....*....|....*
gi 15226737   734 ---YSFIKAMPIEADSSIWLCLLSA 755
Cdd:PLN03218  704 lelYEDIKSIKLRPTVSTMNALITA 728
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
641-688 2.20e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.82  E-value: 2.20e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15226737   641 KSLITWNLMIYGYGSHGDCITALSLFDEMKKAGESPDDVTFLSLISAC 688
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
131-180 1.44e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.51  E-value: 1.44e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15226737   131 RDVTVWNSMIDGYFKFRRFKEGVGCFRRMLVFGVRPDAFSLSIVVSVMCK 180
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
541-584 4.32e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 47.36  E-value: 4.32e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 15226737   541 NMVAWNSMISCYSRNNLPELSIDLFNLMLSQGIFPDSVSITSVL 584
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILI 45
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
437-487 6.59e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.59  E-value: 6.59e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 15226737   437 KDMVAWGSLISGLCKNGKFKEALKVFGDMKddDDSLKPDSDIMTSVTNACA 487
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMK--KRGVKPNVYTYTILINGLC 49
PLN03218 PLN03218
maturation of RBCL 1; Provisional
515-755 3.23e-06

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 51.03  E-value: 3.23e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   515 SSLIDLYSKCGLPEMALKVFTSMSTENMVAWNSMISCYSR-----NNLPElSIDLFNLMLSQGIFPDSVSitsvLVAISS 589
Cdd:PLN03218  323 NSLSLDKKNNGVKDAELPGQSSGQAASDVEEENSLAAYNGgvsgkRKSPE-YIDAYNRLLRDGRIKDCID----LLEDME 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   590 TASLLKGKSLHgytlrlgipsdtHLKnalidMYVKCGFSKYAENIF---KKMQHKSLITWNLMIYGYGSHGDCITALSLF 666
Cdd:PLN03218  398 KRGLLDMDKIY------------HAK-----FFKACKKQRAVKEAFrfaKLIRNPTLSTFNMLMSVCASSQDIDGALRVL 460
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   667 DEMKKAGESPDDVTFLSLISACNHSGFVEEGKNIFEFMkQDYGIEPNMEHYANMVDLLGRAGLLEE---AYSFIKAMPIE 743
Cdd:PLN03218  461 RLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM-VNAGVEANVHTFGALIDGCARAGQVAKafgAYGIMRSKNVK 539
                         250
                  ....*....|..
gi 15226737   744 ADSSIWLCLLSA 755
Cdd:PLN03218  540 PDRVVFNALISA 551
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
438-466 3.84e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 44.26  E-value: 3.84e-06
                          10        20
                  ....*....|....*....|....*....
gi 15226737   438 DMVAWGSLISGLCKNGKFKEALKVFGDMK 466
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
543-577 5.43e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.60  E-value: 5.43e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15226737   543 VAWNSMISCYSRNNLPELSIDLFNLMLSQGIFPDS 577
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
440-468 6.79e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 43.22  E-value: 6.79e-06
                          10        20
                  ....*....|....*....|....*....
gi 15226737   440 VAWGSLISGLCKNGKFKEALKVFGDMKDD 468
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
644-673 1.32e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.45  E-value: 1.32e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 15226737   644 ITWNLMIYGYGSHGDCITALSLFDEMKKAG 673
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PLN03218 PLN03218
maturation of RBCL 1; Provisional
449-740 1.80e-05

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 48.72  E-value: 1.80e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   449 LCKNGKFKEALKVFGDMkDDDDSLKPDSDIMTSVTNAC----AGLEALRFglqvhgsmikTGLVLNvfvgsslidlyskc 524
Cdd:PLN03218  380 LLRDGRIKDCIDLLEDM-EKRGLLDMDKIYHAKFFKACkkqrAVKEAFRF----------AKLIRN-------------- 434
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   525 glpemalkvfTSMSTENMVawnsMISCYSRNNLpELSIDLFNLMLSQGIFPDSVSITSVlvaISSTASllKGKS------ 598
Cdd:PLN03218  435 ----------PTLSTFNML----MSVCASSQDI-DGALRVLRLVQEAGLKADCKLYTTL---ISTCAK--SGKVdamfev 494
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737   599 LHGYTLRlGIPSDTHLKNALIDMYVKCG-----FSKYAENIFKKMQHKSLItWNLMIYGYGSHGDCITALSLFDEMKkaG 673
Cdd:PLN03218  495 FHEMVNA-GVEANVHTFGALIDGCARAGqvakaFGAYGIMRSKNVKPDRVV-FNALISACGQSGAVDRAFDVLAEMK--A 570
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15226737   674 ES----PDDVTFLSLISACNHSGFVEEGKNIFEFMKQdYGIEPNMEHYANMVDLLGRAGLLEEAYSFIKAM 740
Cdd:PLN03218  571 EThpidPDHITVGALMKACANAGQVDRAKEVYQMIHE-YNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDM 640
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
440-476 3.15e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 3.15e-05
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 15226737   440 VAWGSLISGLCKNGKFKEALKVFGDMKDDDdsLKPDS 476
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERG--IEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
644-678 3.40e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.29  E-value: 3.40e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15226737   644 ITWNLMIYGYGSHGDCITALSLFDEMKKAGESPDD 678
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
713-815 8.71e-05

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 46.14  E-value: 8.71e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737 713 NMEHYANMVDLLGRAGLLEEAY-SFIKAMPIEAD-SSIWLCLLSASRTHHNVELGILSAEKLLRMEPERGSTYVQLINLY 790
Cdd:COG3914 111 NAEALFNLGNLLLALGRLEEALaALRRALALNPDfAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNAL 190
                        90       100
                ....*....|....*....|....*
gi 15226737 791 MEAGLKNEAAKLLglmkEKGLHKQP 815
Cdd:COG3914 191 QDLGRLEEAIAAY----RRALELDP 211
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
543-573 8.79e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.14  E-value: 8.79e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15226737   543 VAWNSMISCYSRNNLPELSIDLFNLMLSQGI 573
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
135-168 9.64e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 9.64e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15226737   135 VWNSMIDGYFKFRRFKEGVGCFRRMLVFGVRPDA 168
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
664-717 3.10e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.65  E-value: 3.10e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 15226737   664 SLFDEMKKAGESPDDVTFLSLISACNHSGFVEEGKNIFEFMKQDyGIEPNMEHY 717
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKK-GIKPTLDTY 53
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
340-383 9.11e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 37.73  E-value: 9.11e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 15226737   340 IWNAMVAAYAENDYGYSALDLFGFMRQKSVLPDSFTLSNVISCC 383
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
135-164 1.32e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 1.32e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 15226737   135 VWNSMIDGYFKFRRFKEGVGCFRRMLVFGV 164
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
634-688 4.74e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.18  E-value: 4.74e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 15226737   634 IFKKMQHK----SLITWNLMIYGYGSHGDCITALSLFDEMKKAGESPDDVTFLSLISAC 688
Cdd:pfam13812   2 ILREMVRDgiqlNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
713-815 9.59e-03

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 37.48  E-value: 9.59e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15226737 713 NMEHYANMVDLLGRAGLLEEAYSFI-KAMPIEADSS-IWLCLLSASRTHHNVELGILSAEKLLRMEPERGSTYVQLINLY 790
Cdd:COG4783  37 NPEAFALLGEILLQLGDLDEAIVLLhEALELDPDEPeARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAY 116
                        90       100
                ....*....|....*....|....*
gi 15226737 791 MEAGLKNEAAKLLglmkEKGLHKQP 815
Cdd:COG4783 117 RALGRPDEAIAAL----EKALELDP 137
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH