NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15221304|ref|NP_177599|]
View 

pentatricopeptide (PPR) repeat-containing protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1004131)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03077 super family cl33629
Protein ECB2; Provisional
249-895 1.59e-131

Protein ECB2; Provisional


The actual alignment was detected with superfamily member PLN03077:

Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 415.40  E-value: 1.59e-131
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  249 PDSYTYSSVLAAC----ASLEKLRFGKVVQARVIKCGAEdvfVCTAIVDLYAKCGHMAEAMEVFSRIPNPSVVSWTVMLS 324
Cdd:PLN03077  84 VDEDAYVALFRLCewkrAVEEGSRVCSRALSSHPSLGVR---LGNAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVG 160
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  325 GYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACGRPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDL 404
Cdd:PLN03077 161 GYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVS 240
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  405 SEQVFedlDDIQRQNIV--NVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLN---LGKQVHGYTLKS 479
Cdd:PLN03077 241 ARLVF---DRMPRRDCIswNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGderLGREMHGYVVKT 317
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  480 GLVLDLTVGSSLFTLYSKCGSLEESYKLFQGIPFKDNACWASMISGFNEYGYLREAIGLFSEMLDDGTSPDESTLAAVLT 559
Cdd:PLN03077 318 GFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLS 397
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  560 VCSSHPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCGSLKLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLL 639
Cdd:PLN03077 398 ACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIF 477
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  640 FRDMVMSgFTMDSFAISSILKAAALSDESSLGAQVHAYITKIGLCTEPSVGSSLLTMYSKFGSIDDCCKAFSqINGPDLI 719
Cdd:PLN03077 478 FRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFN-SHEKDVV 555
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  720 AWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSHGGLVEESYFHLNSMVKDYGIEPENRHYVCMVDAL 799
Cdd:PLN03077 556 SWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLL 635
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  800 GRSGRLREAESFINNMHIKPDALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEVEETRK 879
Cdd:PLN03077 636 GRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRK 715
                        650
                 ....*....|....*.
gi 15221304  880 LMKGTGVQKEPGWSSV 895
Cdd:PLN03077 716 TMRENGLTVDPGCSWV 731
PLN03081 super family cl33631
pentatricopeptide (PPR) repeat-containing protein; Provisional
83-361 3.85e-38

pentatricopeptide (PPR) repeat-containing protein; Provisional


The actual alignment was detected with superfamily member PLN03081:

Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 152.72  E-value: 3.85e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   83 DVFLTKSLLSWYSNSGSMADAAKLFDTIPQPDVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSA 162
Cdd:PLN03081 157 DQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAG 236
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  163 L-QAPLFSELVCChTIKMGYFFYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAG-ALRNQNYGAVfDLFH 240
Cdd:PLN03081 237 LgSARAGQQLHCC-VLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGyALHGYSEEAL-CLYY 314
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  241 EMCVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAE-DVFVCTAIVDLYAKCGHMAEAMEVFSRIPNPSVVSW 319
Cdd:PLN03081 315 EMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPlDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISW 394
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|..
gi 15221304  320 TVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISAC 361
Cdd:PLN03081 395 NALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSAC 436
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
249-895 1.59e-131

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 415.40  E-value: 1.59e-131
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  249 PDSYTYSSVLAAC----ASLEKLRFGKVVQARVIKCGAEdvfVCTAIVDLYAKCGHMAEAMEVFSRIPNPSVVSWTVMLS 324
Cdd:PLN03077  84 VDEDAYVALFRLCewkrAVEEGSRVCSRALSSHPSLGVR---LGNAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVG 160
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  325 GYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACGRPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDL 404
Cdd:PLN03077 161 GYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVS 240
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  405 SEQVFedlDDIQRQNIV--NVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLN---LGKQVHGYTLKS 479
Cdd:PLN03077 241 ARLVF---DRMPRRDCIswNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGderLGREMHGYVVKT 317
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  480 GLVLDLTVGSSLFTLYSKCGSLEESYKLFQGIPFKDNACWASMISGFNEYGYLREAIGLFSEMLDDGTSPDESTLAAVLT 559
Cdd:PLN03077 318 GFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLS 397
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  560 VCSSHPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCGSLKLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLL 639
Cdd:PLN03077 398 ACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIF 477
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  640 FRDMVMSgFTMDSFAISSILKAAALSDESSLGAQVHAYITKIGLCTEPSVGSSLLTMYSKFGSIDDCCKAFSqINGPDLI 719
Cdd:PLN03077 478 FRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFN-SHEKDVV 555
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  720 AWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSHGGLVEESYFHLNSMVKDYGIEPENRHYVCMVDAL 799
Cdd:PLN03077 556 SWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLL 635
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  800 GRSGRLREAESFINNMHIKPDALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEVEETRK 879
Cdd:PLN03077 636 GRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRK 715
                        650
                 ....*....|....*.
gi 15221304  880 LMKGTGVQKEPGWSSV 895
Cdd:PLN03077 716 TMRENGLTVDPGCSWV 731
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
83-361 3.85e-38

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 152.72  E-value: 3.85e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   83 DVFLTKSLLSWYSNSGSMADAAKLFDTIPQPDVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSA 162
Cdd:PLN03081 157 DQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAG 236
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  163 L-QAPLFSELVCChTIKMGYFFYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAG-ALRNQNYGAVfDLFH 240
Cdd:PLN03081 237 LgSARAGQQLHCC-VLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGyALHGYSEEAL-CLYY 314
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  241 EMCVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAE-DVFVCTAIVDLYAKCGHMAEAMEVFSRIPNPSVVSW 319
Cdd:PLN03081 315 EMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPlDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISW 394
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|..
gi 15221304  320 TVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISAC 361
Cdd:PLN03081 395 NALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSAC 436
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
834-895 4.60e-17

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 76.04  E-value: 4.60e-17
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 15221304   834 HGEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEVEETRKLMKGTGVQKEPGWSSV 895
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWI 62
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
113-160 1.66e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.21  E-value: 1.66e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15221304   113 PDVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISAC 160
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
719-752 3.36e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.37  E-value: 3.36e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15221304   719 IAWTALIASYAQHGKANEALQVYNLMKEKGFKPD 752
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
712-876 2.31e-05

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 46.92  E-value: 2.31e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 712 QINGPDLIAWTALIASYAQHGKANEALQVYNLMKEkgFKPDKVTFVGVL-SACSHGGLVEESYFHLNSMVKdygIEPEN- 789
Cdd:COG0457   2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE--LDPDDAEALYNLgLAYLRLGRYEEALADYEQALE---LDPDDa 76
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 790 RHYVCMVDALGRSGRLREAESFINN-MHIKP-DALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISLSNILAE 867
Cdd:COG0457  77 EALNNLGLALQALGRYEEALEDYDKaLELDPdDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEK 156

                ....*....
gi 15221304 868 VGEWDEVEE 876
Cdd:COG0457 157 LGRYEEALE 165
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
116-150 8.05e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.05e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15221304   116 VSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANE 150
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
249-895 1.59e-131

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 415.40  E-value: 1.59e-131
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  249 PDSYTYSSVLAAC----ASLEKLRFGKVVQARVIKCGAEdvfVCTAIVDLYAKCGHMAEAMEVFSRIPNPSVVSWTVMLS 324
Cdd:PLN03077  84 VDEDAYVALFRLCewkrAVEEGSRVCSRALSSHPSLGVR---LGNAMLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVG 160
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  325 GYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACGRPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDL 404
Cdd:PLN03077 161 GYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVS 240
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  405 SEQVFedlDDIQRQNIV--NVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLN---LGKQVHGYTLKS 479
Cdd:PLN03077 241 ARLVF---DRMPRRDCIswNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGderLGREMHGYVVKT 317
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  480 GLVLDLTVGSSLFTLYSKCGSLEESYKLFQGIPFKDNACWASMISGFNEYGYLREAIGLFSEMLDDGTSPDESTLAAVLT 559
Cdd:PLN03077 318 GFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLS 397
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  560 VCSSHPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCGSLKLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLL 639
Cdd:PLN03077 398 ACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIF 477
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  640 FRDMVMSgFTMDSFAISSILKAAALSDESSLGAQVHAYITKIGLCTEPSVGSSLLTMYSKFGSIDDCCKAFSqINGPDLI 719
Cdd:PLN03077 478 FRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFN-SHEKDVV 555
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  720 AWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSHGGLVEESYFHLNSMVKDYGIEPENRHYVCMVDAL 799
Cdd:PLN03077 556 SWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLL 635
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  800 GRSGRLREAESFINNMHIKPDALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEVEETRK 879
Cdd:PLN03077 636 GRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRK 715
                        650
                 ....*....|....*.
gi 15221304  880 LMKGTGVQKEPGWSSV 895
Cdd:PLN03077 716 TMRENGLTVDPGCSWV 731
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
470-895 3.98e-90

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 301.02  E-value: 3.98e-90
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  470 KQVHGYTLKSGLVLDLTVGSSLFTLYSKCGSLEESYKLFQGIPFKDNACWASMISGFNEYGYLREAIGLFSEMLDDGTSP 549
Cdd:PLN03081 143 KAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDA 222
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  550 DESTLAAVLTVCSSHPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCGSLKLARQVYDRLPELDPVSCSSLISGYSQ 629
Cdd:PLN03081 223 EPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYAL 302
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  630 HGLIQDGFLLFRDMVMSGFTMDSFAISSILKAAALSDESSLGAQVHAYITKIGLCTEPSVGSSLLTMYSKFGSIDDCCKA 709
Cdd:PLN03081 303 HGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNV 382
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  710 FSQINGPDLIAWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSHGGLVEESYFHLNSMVKDYGIEPEN 789
Cdd:PLN03081 383 FDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENHRIKPRA 462
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  790 RHYVCMVDALGRSGRLREAESFINNMHIKPDALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISLSNILAEVG 869
Cdd:PLN03081 463 MHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSG 542
                        410       420
                 ....*....|....*....|....*...
gi 15221304  870 EWDEVEETRKLMKGTGVQKEPG--WSSV 895
Cdd:PLN03081 543 RQAEAAKVVETLKRKGLSMHPActWIEV 570
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
131-612 2.49e-45

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 174.29  E-value: 2.49e-45
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  131 FEESLRFFSKMHFLG-FEANEISYGSVISACSALQAPLFSELVCCHTIKMGYFFYEVVESALIDVFSKNLRFEDAYKVFR 209
Cdd:PLN03081 103 HREALELFEILEAGCpFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFD 182
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  210 DSLSANVYCWNTIIAGALRNQNYGAVFDLFHEMCVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGA-EDVFVC 288
Cdd:PLN03081 183 EMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVvGDTFVS 262
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  289 TAIVDLYAKCGHMAEAMEVFSRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACGRPSMVC 368
Cdd:PLN03081 263 CALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLE 342
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  369 EASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDLSEQVFEDLddiQRQNIV--NVMITSFSQSKKPGKAIRLFTRMLQ 446
Cdd:PLN03081 343 HAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRM---PRKNLIswNALIAGYGNHGRGTKAVEMFERMIA 419
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  447 EGLRTDEfsvCSLLSVLDCLNLgkqvhgytlkSGLVldltvgsslftlyskcgslEESYKLFQGIP----FKDNAC-WAS 521
Cdd:PLN03081 420 EGVAPNH---VTFLAVLSACRY----------SGLS-------------------EQGWEIFQSMSenhrIKPRAMhYAC 467
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  522 MISGFNEYGYLREAIGLFSemlDDGTSPDESTLAAVLTVCSSHPSLPRGKEIHGYTLRAGIDKgmdLGS--ALVNMYSKC 599
Cdd:PLN03081 468 MIELLGREGLLDEAYAMIR---RAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEK---LNNyvVLLNLYNSS 541
                        490
                 ....*....|...
gi 15221304  600 GSLKLARQVYDRL 612
Cdd:PLN03081 542 GRQAEAAKVVETL 554
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
83-361 3.85e-38

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 152.72  E-value: 3.85e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   83 DVFLTKSLLSWYSNSGSMADAAKLFDTIPQPDVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSA 162
Cdd:PLN03081 157 DQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAG 236
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  163 L-QAPLFSELVCChTIKMGYFFYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAG-ALRNQNYGAVfDLFH 240
Cdd:PLN03081 237 LgSARAGQQLHCC-VLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGyALHGYSEEAL-CLYY 314
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  241 EMCVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAE-DVFVCTAIVDLYAKCGHMAEAMEVFSRIPNPSVVSW 319
Cdd:PLN03081 315 EMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPlDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISW 394
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|..
gi 15221304  320 TVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISAC 361
Cdd:PLN03081 395 NALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSAC 436
PLN03077 PLN03077
Protein ECB2; Provisional
521-835 1.85e-30

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 129.20  E-value: 1.85e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  521 SMISGFNEYGYLREAIGLFSEMLDDGTSPDESTLAAVLTVCSSHPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCG 600
Cdd:PLN03077  56 SQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFG 135
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  601 SLKLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTMDSFAISSILKAAALSDESSLGAQVHAYITK 680
Cdd:PLN03077 136 ELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVR 215
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  681 IGLCTEPSVGSSLLTMYSKFGSIDDCCKAFSQINGPDLIAWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVL 760
Cdd:PLN03077 216 FGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVI 295
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 15221304  761 SACshgGLVEESYF--HLNSMVKDYGIEPENRHYVCMVDALGRSGRLREAESFINNMHIKpDALVWGTLLAACKIHG 835
Cdd:PLN03077 296 SAC---ELLGDERLgrEMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK-DAVSWTAMISGYEKNG 368
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
834-895 4.60e-17

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 76.04  E-value: 4.60e-17
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 15221304   834 HGEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEVEETRKLMKGTGVQKEPGWSSV 895
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWI 62
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
59-307 2.38e-12

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 70.67  E-value: 2.38e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   59 SRLCNLRTTKILQAHLLRRYLlPFDVFLTKSLLSWYSNSGSMADAAKLFDTIPQPDVVSCNIMISGYKQHRLFEESLRFF 138
Cdd:PLN03081 336 SRLALLEHAKQAHAGLIRTGF-PLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMF 414
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  139 SKMHFLGFEANEISYGSVISAC--SALQAP---LFSELVCCHTIKMGYFFYevveSALIDVFSKNLRFEDAYKVFRDsls 213
Cdd:PLN03081 415 ERMIAEGVAPNHVTFLAVLSACrySGLSEQgweIFQSMSENHRIKPRAMHY----ACMIELLGREGLLDEAYAMIRR--- 487
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304  214 anvycwntiiagalrnqnygAVFdlfhemcvgfqKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAEDVFVCTAIVD 293
Cdd:PLN03081 488 --------------------APF-----------KPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLN 536
                        250
                 ....*....|....
gi 15221304  294 LYAKCGHMAEAMEV 307
Cdd:PLN03081 537 LYNSSGRQAEAAKV 550
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
716-765 1.56e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 59.68  E-value: 1.56e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15221304   716 PDLIAWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSH 765
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
600-850 7.90e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 66.05  E-value: 7.90e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   600 GSLKLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTmdsfaiSSILKAAALSDESSLGAQVH---- 675
Cdd:PLN03218  455 GALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVE------ANVHTFGALIDGCARAGQVAkafg 528
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   676 AYITKIGLCTEPS--VGSSLLTMYSKFGSIDDC------CKAFSQINGPDLIAWTALIASYAQHGKANEALQVYNLMKEK 747
Cdd:PLN03218  529 AYGIMRSKNVKPDrvVFNALISACGQSGAVDRAfdvlaeMKAETHPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEY 608
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   748 GFKPDKVTFVGVLSACSHGGLVEESYFHLNSMVKDyGIEPENRHYVCMVDALGRSGRLREAESFINNMH---IKPDALVW 824
Cdd:PLN03218  609 NIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKK-GVKPDEVFFSALVDVAGHAGDLDKAFEILQDARkqgIKLGTVSY 687
                         250       260
                  ....*....|....*....|....*.
gi 15221304   825 GTLLAACKIHGEvelgkvaAKKAIEL 850
Cdd:PLN03218  688 SSLMGACSNAKN-------WKKALEL 706
PLN03218 PLN03218
maturation of RBCL 1; Provisional
386-840 2.74e-09

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 61.05  E-value: 2.74e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   386 SSVAAALISMYSKSGDID----LSEQVFEDLDDIQRQNIVNVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLS 461
Cdd:PLN03218  318 VSSATNSLSLDKKNNGVKdaelPGQSSGQAASDVEEENSLAAYNGGVSGKRKSPEYIDAYNRLLRDGRIKDCIDLLEDME 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   462 VLDCLNLGKQVHGYTLKS----GLVLD------LTVGSSLFT---LYSKCGSLEES------YKLFQGIPFK-DNACWAS 521
Cdd:PLN03218  398 KRGLLDMDKIYHAKFFKAckkqRAVKEafrfakLIRNPTLSTfnmLMSVCASSQDIdgalrvLRLVQEAGLKaDCKLYTT 477
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   522 MISGFNEYGYLREAIGLFSEMLDDGTSPDESTLAAVLTVCSSHPSLPrgKEIHGYtlraGIdkgmdlgsalvnMYSKcgS 601
Cdd:PLN03218  478 LISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVA--KAFGAY----GI------------MRSK--N 537
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   602 LKLARQVYDrlpeldpvscsSLISGYSQHGLIQDGFLLFRDMVMSGFTM--DSFAISSILKAAALSDESSLGAQVHAYIT 679
Cdd:PLN03218  538 VKPDRVVFN-----------ALISACGQSGAVDRAFDVLAEMKAETHPIdpDHITVGALMKACANAGQVDRAKEVYQMIH 606
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   680 KIGLCTEPSVGSSLLTMYSKFGSIDDCCKAFSQIN----GPDLIAWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVT 755
Cdd:PLN03218  607 EYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKkkgvKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVS 686
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   756 FVGVLSACSHGGLVEESyFHLNSMVKDYGIEPENRHYVCMVDALGRSGRLREAESFINNMH---IKPDALVWGTLLAACK 832
Cdd:PLN03218  687 YSSLMGACSNAKNWKKA-LELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKrlgLCPNTITYSILLVASE 765

                  ....*...
gi 15221304   833 IHGEVELG 840
Cdd:PLN03218  766 RKDDADVG 773
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
314-363 1.21e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.60  E-value: 1.21e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15221304   314 PSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACGR 363
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
113-160 1.66e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.21  E-value: 1.66e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15221304   113 PDVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISAC 160
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
215-262 2.47e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.82  E-value: 2.47e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15221304   215 NVYCWNTIIAGALRNQNYGAVFDLFHEMCVGFQKPDSYTYSSVLAACA 262
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PLN03218 PLN03218
maturation of RBCL 1; Provisional
515-787 4.99e-07

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 53.73  E-value: 4.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   515 DNACWASMISGFNEYGYLREAIGLFSEMLDDGT--SPDESTLAAVLTVCSSHPSLPRGKEIHGYTLRAGIDKGMDLGSAL 592
Cdd:PLN03218  541 DRVVFNALISACGQSGAVDRAFDVLAEMKAETHpiDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIA 620
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   593 VNMYSKCGSLKLARQVYDRLPEL----DPVSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTMDSFAISSILKAAALSDES 668
Cdd:PLN03218  621 VNSCSQKGDWDFALSIYDDMKKKgvkpDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNW 700
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   669 SLGAQVHAYITKIGLctEPSVgsslltmyskfgsiddcckafSQINgpdliawtALIASYAQHGKANEALQVYNLMKEKG 748
Cdd:PLN03218  701 KKALELYEDIKSIKL--RPTV---------------------STMN--------ALITALCEGNQLPKALEVLSEMKRLG 749
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 15221304   749 FKPDKVTFVGVLSACSHGGLVEESyFHLNSMVKDYGIEP 787
Cdd:PLN03218  750 LCPNTITYSILLVASERKDDADVG-LDLLSQAKEDGIKP 787
PLN03218 PLN03218
maturation of RBCL 1; Provisional
191-558 2.28e-06

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 51.80  E-value: 2.28e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   191 LIDVFSKNLRFEDAYKVFR----DSLSANVYCWNTIIAGALRNQNYGAVFDLFHEMCVGFQKPDSYTYSSVLAACA---S 263
Cdd:PLN03218  443 LMSVCASSQDIDGALRVLRlvqeAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCAragQ 522
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   264 LEK-------LRFGKVVQARVI------KCG-------AEDVF---------------VCTAIVDLYAKCGHMAEAMEVF 308
Cdd:PLN03218  523 VAKafgaygiMRSKNVKPDRVVfnalisACGqsgavdrAFDVLaemkaethpidpdhiTVGALMKACANAGQVDRAKEVY 602
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   309 SRIPN------PSVvsWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACGRPSMVCEASQVHAWVFKSGF 382
Cdd:PLN03218  603 QMIHEynikgtPEV--YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGI 680
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   383 YLDSSVAAALISMYSKSGDIDLSEQVFEDLDDIQRQNIV---NVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSL 459
Cdd:PLN03218  681 KLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVstmNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSIL 760
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304   460 LSVL---DCLNLGKQVHGYTLKSGLVLDLTVGSSLFTL----YSKCGSLEESYKLFQ-GIPFKDNAcWASMisgfneygy 531
Cdd:PLN03218  761 LVASerkDDADVGLDLLSQAKEDGIKPNLVMCRCITGLclrrFEKACALGEPVVSFDsGRPQIENK-WTSW--------- 830
                         410       420
                  ....*....|....*....|....*..
gi 15221304   532 lreAIGLFSEMLDDGTSPDESTLAAVL 558
Cdd:PLN03218  831 ---ALMVYRETISAGTLPTMEVLSQVL 854
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
719-752 3.36e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.37  E-value: 3.36e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15221304   719 IAWTALIASYAQHGKANEALQVYNLMKEKGFKPD 752
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
716-763 4.97e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 44.66  E-value: 4.97e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15221304   716 PDLIAWTALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSAC 763
Cdd:pfam13812  13 LNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
712-876 2.31e-05

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 46.92  E-value: 2.31e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 712 QINGPDLIAWTALIASYAQHGKANEALQVYNLMKEkgFKPDKVTFVGVL-SACSHGGLVEESYFHLNSMVKdygIEPEN- 789
Cdd:COG0457   2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE--LDPDDAEALYNLgLAYLRLGRYEEALADYEQALE---LDPDDa 76
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 790 RHYVCMVDALGRSGRLREAESFINN-MHIKP-DALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISLSNILAE 867
Cdd:COG0457  77 EALNNLGLALQALGRYEEALEDYDKaLELDPdDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEK 156

                ....*....
gi 15221304 868 VGEWDEVEE 876
Cdd:COG0457 157 LGRYEEALE 165
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
719-749 2.63e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.68  E-value: 2.63e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15221304   719 IAWTALIASYAQHGKANEALQVYNLMKEKGF 749
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
317-347 3.74e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.30  E-value: 3.74e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15221304   317 VSWTVMLSGYTKSNDAFSALEIFKEMRHSGV 347
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
317-350 1.99e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.36  E-value: 1.99e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15221304   317 VSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEIN 350
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
284-328 2.72e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 39.27  E-value: 2.72e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 15221304   284 DVFVCTAIVDLYAKCGHMAEAMEVFSRIPN----PSVVSWTVMLSGYTK 328
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKrgvkPNVYTYTILINGLCK 50
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
798-877 3.40e-04

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 41.87  E-value: 3.40e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 798 ALGRSGRLREAESFINN-MHIKPD-ALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEVE 875
Cdd:COG5010  63 LYNKLGDFEESLALLEQaLQLDPNnPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAK 142

                ..
gi 15221304 876 ET 877
Cdd:COG5010 143 AA 144
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
314-362 3.78e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.26  E-value: 3.78e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 15221304   314 PSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACG 362
Cdd:pfam13812  13 LNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIG 61
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
616-661 4.67e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.88  E-value: 4.67e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 15221304   616 DPVSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTMDSFAISSILKA 661
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILING 47
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
798-882 7.41e-04

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 40.56  E-value: 7.41e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 798 ALGRSGRLREAESFINN-MHIKPDALVWGTLLAACKIH-GEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEVE 875
Cdd:COG4783  13 ALLLAGDYDEAEALLEKaLELDPDNPEAFALLGEILLQlGDLDEAIVLLHEALELDPDEPEARLNLGLALLKAGDYDEAL 92

                ....*...
gi 15221304 876 ET-RKLMK 882
Cdd:COG4783  93 ALlEKALK 100
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
116-150 8.05e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 8.05e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15221304   116 VSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANE 150
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
797-877 8.16e-04

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 43.06  E-value: 8.16e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 797 DALGRSGRLREAESFINN-MHIKPD-ALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEV 874
Cdd:COG3914 120 NLLLALGRLEEALAALRRaLALNPDfAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLEEA 199

                ...
gi 15221304 875 EET 877
Cdd:COG3914 200 IAA 202
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
785-876 8.31e-04

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 40.56  E-value: 8.31e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 785 IEPENRH-YVCMVDALGRSGRLREAESFINN-MHIKPD-ALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISL 861
Cdd:COG4783  33 LDPDNPEaFALLGEILLQLGDLDEAIVLLHEaLELDPDePEARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRL 112
                        90
                ....*....|....*
gi 15221304 862 SNILAEVGEWDEVEE 876
Cdd:COG4783 113 ARAYRALGRPDEAIA 127
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
618-648 1.03e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 1.03e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15221304   618 VSCSSLISGYSQHGLIQDGFLLFRDMVMSGF 648
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
116-146 1.12e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 1.12e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 15221304   116 VSCNIMISGYKQHRLFEESLRFFSKMHFLGF 146
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
618-652 1.27e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.05  E-value: 1.27e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 15221304   618 VSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTMDS 652
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
835-882 2.02e-03

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 38.23  E-value: 2.02e-03
                        10        20        30        40
                ....*....|....*....|....*....|....*....|....*...
gi 15221304 835 GEVELGKVAAKKAIELEPSDAGAYISLSNILAEVGEWDEVEETRKLMK 882
Cdd:COG3063   6 GDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAIALEKALK 53
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
785-882 2.22e-03

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 41.52  E-value: 2.22e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 785 IEPEN-RHYVCMVDALGRSGRLREAESFINNM-HIKPD-ALVWGTLLAACKIHGEVELGKVAAKKAIELEPSDAGAYISL 861
Cdd:COG3914 141 LNPDFaEAYLNLGEALRRLGRLEEAIAALRRAlELDPDnAEALNNLGNALQDLGRLEEAIAAYRRALELDPDNADAHSNL 220
                        90       100
                ....*....|....*....|.
gi 15221304 862 SNILAEVGEWDEVEETRKLMK 882
Cdd:COG3914 221 LFALRQACDWEVYDRFEELLA 241
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
514-553 3.88e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 36.19  E-value: 3.88e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 15221304   514 KDNACWASMISGFNEYGYLREAIGLFSEMLDDGTSPDEST 553
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYT 40
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
518-551 4.55e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 4.55e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15221304   518 CWASMISGFNEYGYLREAIGLFSEMLDDGTSPDE 551
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
419-463 4.59e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 35.80  E-value: 4.59e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 15221304   419 NIV--NVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVL 463
Cdd:pfam13041   2 DVVtyNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
798-876 7.03e-03

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 36.69  E-value: 7.03e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15221304 798 ALGRSGRLREAESFINN-MHIKPD-ALVWGTLLAACKIHGEVELGKvAAKKAIELEPSDAGAYISLSNILAEVGEWDEVE 875
Cdd:COG3063   1 LYLKLGDLEEAEEYYEKaLELDPDnADALNNLGLLLLEQGRYDEAI-ALEKALKLDPNNAEALLNLAELLLELGDYDEAL 79

                .
gi 15221304 876 E 876
Cdd:COG3063  80 A 80
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH