NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1789572199|emb|CAA0403050|]
View 

unnamed protein product [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 13595575)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
144-500 6.18e-41

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 157.35  E-value: 6.18e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  144 KLGHEPSIVTFGSLLNGFCRGDRVYDALYMFDQMVGMGYKPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVT 223
Cdd:PLN03218   430 KLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHT 509
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  224 YNSLISGLCSSGRWSDATRMVSCMTKREIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRS--LDPDIVTYSLLIYGL 301
Cdd:PLN03218   510 FGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKAC 589
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  302 CMYSRLDEAEEMFGFMVS---KGcFPDVvtYSILINGYCKSKKVEHGMKLFCEMSQRGVVRNTVTYTILIQGYCRAGKLN 378
Cdd:PLN03218   590 ANAGQVDRAKEVYQMIHEyniKG-TPEV--YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLD 666
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  379 VAEEIFRRMVFCGVHPNIITYNVLLhGLCDNGK-IEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSL 457
Cdd:PLN03218   667 KAFEILQDARKQGIKLGTVSYSSLM-GACSNAKnWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEM 745
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1789572199  458 NCQGLMPDIWTYTTMMLGLYKKGLRREADALFRKMKEDGILPN 500
Cdd:PLN03218   746 KRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPN 788
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
115-163 1.46e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.52  E-value: 1.46e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1789572199 115 NLCTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSIVTFGSLLNGFCR 163
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
79-128 2.75e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 35.80  E-value: 2.75e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199  79 PSIADFSRLLSAISKMKKYDVVIYLWEQMQMLGIPHNLCTCNILLNCFCR 128
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
144-500 6.18e-41

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 157.35  E-value: 6.18e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  144 KLGHEPSIVTFGSLLNGFCRGDRVYDALYMFDQMVGMGYKPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVT 223
Cdd:PLN03218   430 KLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHT 509
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  224 YNSLISGLCSSGRWSDATRMVSCMTKREIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRS--LDPDIVTYSLLIYGL 301
Cdd:PLN03218   510 FGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKAC 589
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  302 CMYSRLDEAEEMFGFMVS---KGcFPDVvtYSILINGYCKSKKVEHGMKLFCEMSQRGVVRNTVTYTILIQGYCRAGKLN 378
Cdd:PLN03218   590 ANAGQVDRAKEVYQMIHEyniKG-TPEV--YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLD 666
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  379 VAEEIFRRMVFCGVHPNIITYNVLLhGLCDNGK-IEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSL 457
Cdd:PLN03218   667 KAFEILQDARKQGIKLGTVSYSSLM-GACSNAKnWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEM 745
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1789572199  458 NCQGLMPDIWTYTTMMLGLYKKGLRREADALFRKMKEDGILPN 500
Cdd:PLN03218   746 KRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPN 788
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
184-232 3.53e-17

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 75.09  E-value: 3.53e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1789572199 184 PNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVTYNSLISGLC 232
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
115-163 1.46e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.52  E-value: 1.46e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1789572199 115 NLCTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSIVTFGSLLNGFCR 163
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
362-396 1.17e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.75  E-value: 1.17e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 362 VTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNI 396
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
117-151 4.69e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 4.69e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 117 CTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSI 151
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
79-128 2.75e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 35.80  E-value: 2.75e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199  79 PSIADFSRLLSAISKMKKYDVVIYLWEQMQMLGIPHNLCTCNILLNCFCR 128
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
144-500 6.18e-41

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 157.35  E-value: 6.18e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  144 KLGHEPSIVTFGSLLNGFCRGDRVYDALYMFDQMVGMGYKPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVT 223
Cdd:PLN03218   430 KLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHT 509
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  224 YNSLISGLCSSGRWSDATRMVSCMTKREIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRS--LDPDIVTYSLLIYGL 301
Cdd:PLN03218   510 FGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKAC 589
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  302 CMYSRLDEAEEMFGFMVS---KGcFPDVvtYSILINGYCKSKKVEHGMKLFCEMSQRGVVRNTVTYTILIQGYCRAGKLN 378
Cdd:PLN03218   590 ANAGQVDRAKEVYQMIHEyniKG-TPEV--YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLD 666
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  379 VAEEIFRRMVFCGVHPNIITYNVLLhGLCDNGK-IEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSL 457
Cdd:PLN03218   667 KAFEILQDARKQGIKLGTVSYSSLM-GACSNAKnWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEM 745
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1789572199  458 NCQGLMPDIWTYTTMMLGLYKKGLRREADALFRKMKEDGILPN 500
Cdd:PLN03218   746 KRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPN 788
PLN03218 PLN03218
maturation of RBCL 1; Provisional
84-416 1.14e-29

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 123.45  E-value: 1.14e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199   84 FSRLLSAISKMKKYDVVIYLWEQMQMLGIPHNLCTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSIVTFGSLLNGFCR 163
Cdd:PLN03218   475 YTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQ 554
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  164 GDRVYDAlymFDQMVGMG-----YKPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVTYNSLISGLCSSGRWS 238
Cdd:PLN03218   555 SGAVDRA---FDVLAEMKaethpIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWD 631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  239 DATRMVSCMTKREIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDIVTYSLLIYGLCMYSRLDEAEEMFGFMV 318
Cdd:PLN03218   632 FALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIK 711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  319 SKGCFPDVVTYSILINGYCKSKKVEHGMKLFCEMSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIIT 398
Cdd:PLN03218   712 SIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVM 791
                          330
                   ....*....|....*...
gi 1789572199  399 YNVLLhGLCDNgKIEKAL 416
Cdd:PLN03218   792 CRCIT-GLCLR-RFEKAC 807
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
85-491 4.06e-26

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 112.27  E-value: 4.06e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  85 SRLLSAISKM---KKYDVVIYLWEQMQmLGIPHNL--CTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSIVTFGSLLN 159
Cdd:PLN03081   88 VSLCSQIEKLvacGRHREALELFEILE-AGCPFTLpaSTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLL 166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 160 GFCRGDRVYDALYMFDQMVgmgyKPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVTYNSLISGLCSSGRWSD 239
Cdd:PLN03081  167 MHVKCGMLIDARRLFDEMP----ERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARA 242
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 240 ATRMVSCMTKREIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSldpdIVTYSLLIYGLCMYSRLDEAEEMFGFMVS 319
Cdd:PLN03081  243 GQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKT----TVAWNSMLAGYALHGYSEEALCLYYEMRD 318
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 320 KGCFPDVVTYSILINGYCKSKKVEHGMKLFCEMSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVfcgvHPNIITY 399
Cdd:PLN03081  319 SGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMP----RKNLISW 394
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 400 NVLLHGLCDNGKIEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSLN-CQGLMPDIWTYTTMMLGLYK 478
Cdd:PLN03081  395 NALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSeNHRIKPRAMHYACMIELLGR 474
                         410
                  ....*....|...
gi 1789572199 479 KGLRREADALFRK 491
Cdd:PLN03081  475 EGLLDEAYAMIRR 487
PLN03077 PLN03077
Protein ECB2; Provisional
120-500 1.09e-22

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 101.85  E-value: 1.09e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 120 NILLNCFCRCSQLSLALSFLGKMIKLghepSIVTFGSLLNGFCRGDRVYDALYMFDQMVGMGYKPNVVIYNTIIDGLCKS 199
Cdd:PLN03077  226 NALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELL 301
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 200 KQVDNALDLLNRMEKDGIGPDVVTYNSLISGLCSSGRWSDATRMVSCMTKReiypDVFTFNALIDACVKEGRVSEAEEFY 279
Cdd:PLN03077  302 GDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDKALETY 377
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 280 EEMIRRSLDPDIVTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCEMSQrgvvR 359
Cdd:PLN03077  378 ALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPE----K 453
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 360 NTVTYTILIQGYCRAGKLNVAEEIFRRMVFcGVHPNIITY---------------------NVLLHGLCDNGKIEKALVI 418
Cdd:PLN03077  454 DVISWTSIIAGLRLNNRCFEALIFFRQMLL-TLKPNSVTLiaalsacarigalmcgkeihaHVLRTGIGFDGFLPNALLD 532
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 419 L---------ADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSLNCQGLMPDIWTYTTMMLGLYKKGLRREADALF 489
Cdd:PLN03077  533 LyvrcgrmnyAWNQFNSHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYF 612
                         410
                  ....*....|..
gi 1789572199 490 RKMKED-GILPN 500
Cdd:PLN03077  613 HSMEEKySITPN 624
PLN03218 PLN03218
maturation of RBCL 1; Provisional
224-503 4.13e-22

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 100.34  E-value: 4.13e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  224 YNSLISglcsSGRWSDATRMVSCMTKR------EIYPDVFtfnalIDACVKEGRVSEAEEFYEEMirrsLDPDIVTYSLL 297
Cdd:PLN03218   377 YNRLLR----DGRIKDCIDLLEDMEKRglldmdKIYHAKF-----FKACKKQRAVKEAFRFAKLI----RNPTLSTFNML 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  298 IYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCEMSQRGVVRNTVTYTILIQGYCRAGKL 377
Cdd:PLN03218   444 MSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQV 523
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  378 NVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGKIEKALVILADM--QKNGMDADIVTYNIIIRGMCKAGEVADAWDIYC 455
Cdd:PLN03218   524 AKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMkaETHPIDPDHITVGALMKACANAGQVDRAKEVYQ 603
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1789572199  456 SLNCQGLMPDIWTYTTMMLGLYKKGLRREADALFRKMKEDGILPNECY 503
Cdd:PLN03218   604 MIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVF 651
PLN03077 PLN03077
Protein ECB2; Provisional
63-406 1.33e-18

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 89.14  E-value: 1.33e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  63 DDSLDLFFHMVQCRPLPSIADFSRLLSAISKMKKYDVVIYLWEQMQMLGIPHNLCTCNILLNCFCRCSQLSLALSFLGKM 142
Cdd:PLN03077  371 DKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNI 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 143 iklgHEPSIVTFGSLLNGFCRGDRVYDALYMFDQMVgMGYKPNVVIY--------------------------------- 189
Cdd:PLN03077  451 ----PEKDVISWTSIIAGLRLNNRCFEALIFFRQML-LTLKPNSVTLiaalsacarigalmcgkeihahvlrtgigfdgf 525
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 190 --NTIIDGLCKSKQVDNALDLLNRMEKdgigpDVVTYNSLISGLCSSGRWSDATRMVSCMTKREIYPDVFTFNALIDACV 267
Cdd:PLN03077  526 lpNALLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACS 600
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 268 KEGRVSEAEEFYEEM-IRRSLDPDIVTYSLLIYGLCMYSRLDEAEEMFGFMVSKgcfPDVVTYSILINGYCKSKKVEHGm 346
Cdd:PLN03077  601 RSGMVTQGLEYFHSMeEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPIT---PDPAVWGALLNACRIHRHVELG- 676
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 347 klfcEMSQRGVVR---NTVTYTILIQG-YCRAGKLNVAEEIFRRMVFCGV------------------------HPNIIT 398
Cdd:PLN03077  677 ----ELAAQHIFEldpNSVGYYILLCNlYADAGKWDEVARVRKTMRENGLtvdpgcswvevkgkvhafltddesHPQIKE 752

                  ....*...
gi 1789572199 399 YNVLLHGL 406
Cdd:PLN03077  753 INTVLEGF 760
PLN03077 PLN03077
Protein ECB2; Provisional
150-500 3.26e-17

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 84.90  E-value: 3.26e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 150 SIVTFG-----SLLNGFCRGDRVYDALYMFDQMVgmgyKPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVTY 224
Cdd:PLN03077  115 SHPSLGvrlgnAMLSMFVRFGELVHAWYVFGKMP----ERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTF 190
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 225 -----------------------------------NSLISGLCSSGRWSDATRMVSCMTKReiypDVFTFNALIDACVKE 269
Cdd:PLN03077  191 pcvlrtcggipdlargrevhahvvrfgfeldvdvvNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFEN 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 270 GRVSEAEEFYEEMIRRSLDPDIVTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLF 349
Cdd:PLN03077  267 GECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVF 346
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 350 CEMSqrgvVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGKIEKALVILADMQKNGMDA 429
Cdd:PLN03077  347 SRME----TKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLIS 422
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1789572199 430 DIVTYNIIIRGMCKAGEVADAWDIYcslncqGLMP--DIWTYTTMMLGLYKKGLRREADALFRKMKEDgILPN 500
Cdd:PLN03077  423 YVVVANALIEMYSKCKCIDKALEVF------HNIPekDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPN 488
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
184-232 3.53e-17

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 75.09  E-value: 3.53e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1789572199 184 PNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVTYNSLISGLC 232
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
324-373 4.55e-17

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 74.71  E-value: 4.55e-17
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199 324 PDVVTYSILINGYCKSKKVEHGMKLFCEMSQRGVVRNTVTYTILIQGYCR 373
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
190-501 1.23e-16

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 82.98  E-value: 1.23e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 190 NTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVTY-----------------------------------NSLISGLCSS 234
Cdd:PLN03077   55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYvalfrlcewkraveegsrvcsralsshpslgvrlgNAMLSMFVRF 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 235 GRWSDATRMVSCMTKReiypDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDIVTYSLLIYGLCMYSRLDEAEEMF 314
Cdd:PLN03077  135 GELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVH 210
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 315 GFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCEMSqrgvVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHP 394
Cdd:PLN03077  211 AHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMP----RRDCISWNAMISGYFENGECLEGLELFFTMRELSVDP 286
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 395 NIITYNVLLHGLCDNGKIEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSLNCQglmpDIWTYTTMML 474
Cdd:PLN03077  287 DLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMIS 362
                         330       340
                  ....*....|....*....|....*..
gi 1789572199 475 GLYKKGLRREADALFRKMKEDGILPNE 501
Cdd:PLN03077  363 GYEKNGLPDKALETYALMEQDNVSPDE 389
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
394-443 7.23e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 71.24  E-value: 7.23e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199 394 PNIITYNVLLHGLCDNGKIEKALVILADMQKNGMDADIVTYNIIIRGMCK 443
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
140-452 1.39e-15

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 79.89  E-value: 1.39e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 140 GKMIKLGHEPSIVTFGSLLNGFCRGDRVYDALYMFDQMvgmgYKPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGP 219
Cdd:PLN03077  312 GYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRM----ETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSP 387
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 220 DVVTYNSLISGLCSSGRWSDATRMVSCMTKREIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRsldpDIVTYSLLIY 299
Cdd:PLN03077  388 DEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEK----DVISWTSIIA 463
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 300 GLCMYSRLDEAEEMFGFMVSKgCFPDVVTYSILIN-----GYCKSKKVEHGMKLFCEMSQRGVVRNT------------- 361
Cdd:PLN03077  464 GLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSacariGALMCGKEIHAHVLRTGIGFDGFLPNAlldlyvrcgrmny 542
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 362 ------------VTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGKIEKALVILADMQKN-GMD 428
Cdd:PLN03077  543 awnqfnshekdvVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySIT 622
                         330       340
                  ....*....|....*....|....
gi 1789572199 429 ADIVTYNIIIRGMCKAGEVADAWD 452
Cdd:PLN03077  623 PNLKHYACVVDLLGRAGKLTEAYN 646
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
254-302 2.84e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 69.70  E-value: 2.84e-15
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1789572199 254 PDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDIVTYSLLIYGLC 302
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
193-492 5.94e-15

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 77.60  E-value: 5.94e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 193 IDGLCKSKQVDNALDLLNRMEKDG-IGPDVVTYNSLISGLCSSGRWSDATRMVSCMTKREIYPDVFTFNALIDACVKEGR 271
Cdd:PLN03081   94 IEKLVACGRHREALELFEILEAGCpFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGM 173
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 272 VSEAEEFYEEMIRRSLdpdiVTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCE 351
Cdd:PLN03081  174 LIDARRLFDEMPERNL----ASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCC 249
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 352 MSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVfcgvHPNIITYNVLLHGLCDNGKIEKALVILADMQKNGMDADI 431
Cdd:PLN03081  250 VLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMP----EKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQ 325
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1789572199 432 VTYNIIIRGMCKAGEVADAWDIYCSLNCQGLMPDIwTYTTMMLGLYKK-GLRREADALFRKM 492
Cdd:PLN03081  326 FTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDI-VANTALVDLYSKwGRMEDARNVFDRM 386
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
359-408 7.83e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.54  E-value: 7.83e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199 359 RNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCD 408
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
289-338 2.63e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.00  E-value: 2.63e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199 289 PDIVTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCK 338
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
219-268 2.44e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 61.23  E-value: 2.44e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199 219 PDVVTYNSLISGLCSSGRWSDATRMVSCMTKREIYPDVFTFNALIDACVK 268
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
430-478 1.19e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 59.30  E-value: 1.19e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1789572199 430 DIVTYNIIIRGMCKAGEVADAWDIYCSLNCQGLMPDIWTYTTMMLGLYK 478
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
103-357 2.35e-11

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 66.05  E-value: 2.35e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 103 LWEQMQMLGIPHNLCTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSIVTFGSLLNGFCRGDRVYDALYMFDQMVgmgy 182
Cdd:PLN03081  312 LYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMP---- 387
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 183 KPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVTYNSLISGLCSSGRWSDATRMVSCMTK-REIYPDVFTFNA 261
Cdd:PLN03081  388 RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSEnHRIKPRAMHYAC 467
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 262 LIDACVKEGRVSEAeefyEEMIRRS-LDPDIVTYSLLIYGLCMYSRLD----EAEEMFGFMVSKgcfpdVVTYSILINGY 336
Cdd:PLN03081  468 MIELLGREGLLDEA----YAMIRRApFKPTVNMWAALLTACRIHKNLElgrlAAEKLYGMGPEK-----LNNYVVLLNLY 538
                         250       260
                  ....*....|....*....|.
gi 1789572199 337 CKSKKVEHGMKLFCEMSQRGV 357
Cdd:PLN03081  539 NSSGRQAEAAKVVETLKRKGL 559
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
149-198 1.65e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 56.22  E-value: 1.65e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199 149 PSIVTFGSLLNGFCRGDRVYDALYMFDQMVGMGYKPNVVIYNTIIDGLCK 198
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
320-353 2.38e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 55.43  E-value: 2.38e-10
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1789572199 320 KGCFPDVVTYSILINGYCKSKKVEHGMKLFCEMS 353
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PLN03218 PLN03218
maturation of RBCL 1; Provisional
79-244 2.84e-10

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 62.97  E-value: 2.84e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199   79 PSIADFSRLLSAISKMKKYDVVIYLWEQMQMLGIPHNLCTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSIVTFGSLL 158
Cdd:PLN03218   682 LGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILL 761
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199  159 NGFCRGDRVYDALYMFDQMVGMGYKPNVVIYNTIIdGLCKsKQVDNALDLlnrmekdgiGPDVVTYNSliSGLCSSGRWS 238
Cdd:PLN03218   762 VASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLCL-RRFEKACAL---------GEPVVSFDS--GRPQIENKWT 828

                   ....*.
gi 1789572199  239 DATRMV 244
Cdd:PLN03218   829 SWALMV 834
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
355-388 4.24e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 54.66  E-value: 4.24e-10
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1789572199 355 RGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMV 388
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
180-213 8.49e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.89  E-value: 8.49e-10
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1789572199 180 MGYKPNVVIYNTIIDGLCKSKQVDNALDLLNRME 213
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
115-163 1.46e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.52  E-value: 1.46e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1789572199 115 NLCTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSIVTFGSLLNGFCR 163
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
263-445 2.67e-09

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 57.02  E-value: 2.67e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 263 IDACVKEGRVSEAEEFYEEMIRRSLDPDIVTYSLLIYgLCmySRLDEAEEMFgfmvskgcfpdvvtysilingycKSKKV 342
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLY-LC--SKAADATDLK-----------------------PQLAA 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 343 EHGMKLFCEMSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGKIEKALVILADM 422
Cdd:pfam17177  72 DRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHM 151
                         170       180
                  ....*....|....*....|...
gi 1789572199 423 QKNGMDADIVTYNIIIRGMCKAG 445
Cdd:pfam17177 152 LAHGVELEEPELAALLKVSAKAG 174
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
347-409 3.59e-09

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 52.75  E-value: 3.59e-09
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1789572199 347 KLFCEMSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLhGLCDN 409
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVIGG 62
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
383-442 2.18e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 50.82  E-value: 2.18e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 383 IFRRMVFCGVHPNIITYNVLLHGLCDNGKIEKALVILADMQKNGMDADIVTYNIIIrGMC 442
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVI 60
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
215-248 2.55e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 49.65  E-value: 2.55e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1789572199 215 DGIGPDVVTYNSLISGLCSSGRWSDATRMVSCMT 248
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
208-269 5.00e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 49.66  E-value: 5.00e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1789572199 208 LLNRMEKDGIGPDVVTYNSLISGLCSSGRWSDATRMVSCMTKREIYPDVFTFNALIDACVKE 269
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
173-229 6.03e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 49.28  E-value: 6.03e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1789572199 173 MFDQMVGMGYKPNVVIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDVVTYNSLIS 229
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
250-282 4.63e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 46.18  E-value: 4.63e-07
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1789572199 250 REIYPDVFTFNALIDACVKEGRVSEAEEFYEEM 282
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
391-422 7.13e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 45.41  E-value: 7.13e-07
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1789572199 391 GVHPNIITYNVLLHGLCDNGKIEKALVILADM 422
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
362-396 1.17e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.75  E-value: 1.17e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 362 VTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNI 396
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
257-291 1.34e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.75  E-value: 1.34e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 257 FTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDI 291
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
187-221 1.35e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.75  E-value: 1.35e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 187 VIYNTIIDGLCKSKQVDNALDLLNRMEKDGIGPDV 221
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
327-361 2.79e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 2.79e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 327 VTYSILINGYCKSKKVEHGMKLFCEMSQRGVVRNT 361
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
146-178 3.52e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.49  E-value: 3.52e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1789572199 146 GHEPSIVTFGSLLNGFCRGDRVYDALYMFDQMV 178
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
327-357 4.26e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 43.22  E-value: 4.26e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1789572199 327 VTYSILINGYCKSKKVEHGMKLFCEMSQRGV 357
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
168-286 5.01e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 47.39  E-value: 5.01e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 168 YDALYMFDQMVGMGYKPNVVIYNTIIDgLCKSKQ-VDNALDLLNRMEKDGIGPDVVTYNSLISGLCSSGRWSDATRMVSC 246
Cdd:pfam17177  72 DRGFEVFEAMKAQGVSPNEATYTAVAR-LAAAKGdGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEH 150
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 1789572199 247 MTKREIYPDVFTFNALIDACVKEGRvseAEEFYEEM--IRRS 286
Cdd:pfam17177 151 MLAHGVELEEPELAALLKVSAKAGR---ADKVYAYLhrLRDA 189
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
280-336 5.70e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.89  E-value: 5.70e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1789572199 280 EEMIRRSLDPDIVTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGY 336
Cdd:pfam13812   4 REMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
425-450 6.22e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.72  E-value: 6.22e-06
                          10        20
                  ....*....|....*....|....*.
gi 1789572199 425 NGMDADIVTYNIIIRGMCKAGEVADA 450
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEA 26
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
285-318 8.19e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.33  E-value: 8.19e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1789572199 285 RSLDPDIVTYSLLIYGLCMYSRLDEAEEMFGFMV 318
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
464-501 9.06e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 42.74  E-value: 9.06e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1789572199 464 PDIWTYTTMMLGLYKKGLRREADALFRKMKEDGILPNE 501
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNV 38
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
257-287 1.08e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.07  E-value: 1.08e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1789572199 257 FTFNALIDACVKEGRVSEAEEFYEEMIRRSL 287
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
362-392 1.11e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.07  E-value: 1.11e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1789572199 362 VTYTILIQGYCRAGKLNVAEEIFRRMVFCGV 392
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
187-217 1.70e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.68  E-value: 1.70e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1789572199 187 VIYNTIIDGLCKSKQVDNALDLLNRMEKDGI 217
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
292-326 2.13e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.29  E-value: 2.13e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 292 VTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDV 326
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
198-333 3.28e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 45.08  E-value: 3.28e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 198 KSKQVDNALDLLNRMEKDGIGPDVVTYNSLISgLCSSGRWSDATRMVSCMTK-REIY---------PDVFTFNALIDACV 267
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAADATDLKPQLAADRgFEVFeamkaqgvsPNEATYTAVARLAA 101
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1789572199 268 KEGRVSEAEEFYEEMIRRSLDPDIVTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILI 333
Cdd:pfam17177 102 AKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALL 167
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
398-496 3.91e-05

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 46.40  E-value: 3.91e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 398 TYNVLLHGLCDNGKIEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSlncqglMPD--IWTYTTMMLG 475
Cdd:PLN03081  125 TYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDE------MPErnLASWGTIIGG 198
                          90       100
                  ....*....|....*....|.
gi 1789572199 476 LYKKGLRREADALFRKMKEDG 496
Cdd:PLN03081  199 LVDAGNYREAFALFREMWEDG 219
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
397-431 6.14e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 6.14e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 397 ITYNVLLHGLCDNGKIEKALVILADMQKNGMDADI 431
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
170-282 6.20e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 44.31  E-value: 6.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1789572199 170 ALYMFDQMVGMGYKPNVVIYNTIIDgLC----------KSKQVDNALDLLNRMEKDGIGPDVVTYNSLISgLCSSGRWSD 239
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLY-LCskaadatdlkPQLAADRGFEVFEAMKAQGVSPNEATYTAVAR-LAAAKGDGD 107
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1789572199 240 -ATRMVSCMTKREIYPDVFTFNALIDACVKEGRVSEAEEFYEEM 282
Cdd:pfam17177 108 lAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHM 151
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
152-186 9.18e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.75  E-value: 9.18e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 152 VTFGSLLNGFCRGDRVYDALYMFDQMVGMGYKPNV 186
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
137-193 1.28e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.03  E-value: 1.28e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1789572199 137 SFLGKMIKLGHEPSIVTFGSLLNGFCRGDRVYDALYMFDQMVGMGYKPNVVIYNTII 193
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
432-466 1.83e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.59  E-value: 1.83e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 432 VTYNIIIRGMCKAGEVADAWDIYCSLNCQGLMPDI 466
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
247-298 2.99e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.88  E-value: 2.99e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1789572199 247 MTKREIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDIVTYSLLI 298
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
397-427 3.49e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 3.49e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1789572199 397 ITYNVLLHGLCDNGKIEKALVILADMQKNGM 427
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
117-151 4.69e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 4.69e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 117 CTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSI 151
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
103-161 7.91e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 37.72  E-value: 7.91e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1789572199 103 LWEQMQMLGIPHNLCTCNILLNCFCRCSQLSLALSFLGKMIKLGHEPSIVTFGSLLNGF 161
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
292-322 1.20e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 36.29  E-value: 1.20e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1789572199 292 VTYSLLIYGLCMYSRLDEAEEMFGFMVSKGC 322
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
222-252 1.65e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.90  E-value: 1.65e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1789572199 222 VTYNSLISGLCSSGRWSDATRMVSCMTKREI 252
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
222-256 1.88e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.89  E-value: 1.88e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1789572199 222 VTYNSLISGLCSSGRWSDATRMVSCMTKREIYPDV 256
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
457-503 1.88e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.57  E-value: 1.88e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1789572199 457 LNCQGLMPDIWTYTTMMLGLYKKGLRREADALFRKMKEDGILPNECY 503
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDT 52
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
467-497 2.11e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.52  E-value: 2.11e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1789572199 467 WTYTTMMLGLYKKGLRREADALFRKMKEDGI 497
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
79-128 2.75e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 35.80  E-value: 2.75e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1789572199  79 PSIADFSRLLSAISKMKKYDVVIYLWEQMQMLGIPHNLCTCNILLNCFCR 128
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
468-501 2.90e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 2.90e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1789572199 468 TYTTMMLGLYKKGLRREADALFRKMKEDGILPNE 501
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
461-493 3.15e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.40  E-value: 3.15e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1789572199 461 GLMPDIWTYTTMMLGLYKKGLRREADALFRKMK 493
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH