NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1063732230|ref|NP_001331040|]
View 

Pentatricopeptide repeat (PPR-like) superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1001960)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
188-500 2.29e-23

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 104.57  E-value: 2.29e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  188 TPNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTVL 267
Cdd:PLN03218   434 NPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGAL 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  268 MDGYCKLGRFSEAA-----------------------------------TVMDDM--EKNEIEPNEVTYGVMIRALCKEK 310
Cdd:PLN03218   514 IDGCARAGQVAKAFgaygimrsknvkpdrvvfnalisacgqsgavdrafDVLAEMkaETHPIDPDHITVGALMKACANAG 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  311 KSGEARNMFDEMLERSFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTEARKLFD 390
Cdd:PLN03218   594 QVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQ 673
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  391 EFEK-GSIPSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPN 469
Cdd:PLN03218   674 DARKqGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPN 753
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1063732230  470 KTTFLILFEGLQKLGKEEDAMKIVSMAVMNG 500
Cdd:PLN03218   754 TITYSILLVASERKDDADVGLDLLSQAKEDG 784
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
153-203 2.05e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.88  E-value: 2.05e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1063732230 153 RSVRSLNTLLNVLIQNQRFDLVHAMFKNSKESfGITPNIFTCNLLVKALCK 203
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKR-GVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
188-500 2.29e-23

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 104.57  E-value: 2.29e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  188 TPNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTVL 267
Cdd:PLN03218   434 NPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGAL 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  268 MDGYCKLGRFSEAA-----------------------------------TVMDDM--EKNEIEPNEVTYGVMIRALCKEK 310
Cdd:PLN03218   514 IDGCARAGQVAKAFgaygimrsknvkpdrvvfnalisacgqsgavdrafDVLAEMkaETHPIDPDHITVGALMKACANAG 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  311 KSGEARNMFDEMLERSFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTEARKLFD 390
Cdd:PLN03218   594 QVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQ 673
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  391 EFEK-GSIPSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPN 469
Cdd:PLN03218   674 DARKqGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPN 753
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1063732230  470 KTTFLILFEGLQKLGKEEDAMKIVSMAVMNG 500
Cdd:PLN03218   754 TITYSILLVASERKDDADVGLDLLSQAKEDG 784
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
224-273 1.91e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 70.47  E-value: 1.91e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063732230 224 PNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTVLMDGYCK 273
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
189-463 3.55e-07

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 51.65  E-value: 3.55e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 189 PNIFTCNLLVKALC--KKNDIESAYKVLDEIpsMGLVPNLVTYTTILGG-YVARGDMESAKRVLEEMLDRgwYPDATTYT 265
Cdd:COG2956     4 PVAAALGWYFKGLNylLNGQPDKAIDLLEEA--LELDPETVEAHLALGNlYRRRGEYDRAIRIHQKLLER--DPDRAEAL 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 266 VLM-DGYCKLGRFSEAATVMDDMEKNEIEpNEVTYGVMIRALCKEKKSGEARNMFDEMLERSfmPDSS-LCCKVIDALCE 343
Cdd:COG2956    80 LELaQDYLKAGLLDRAEELLEKLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLG--PENAhAYCELAELYLE 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 344 DHKVDEACGLWRKMLKNNcmPDNALLST-LIHWLCKEGRVTEARKLFDEFEKGSIPSLLTYNTLIAGMCEKGELTEAGRL 422
Cdd:COG2956   157 QGDYDEAIEALEKALKLD--PDCARALLlLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPEEALEL 234
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 1063732230 423 WDDMYERkcKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLE 463
Cdd:COG2956   235 LRKALEL--DPSDDLLLALADLLERKEGLEAALALLERQLR 273
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
263-296 4.97e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.91  E-value: 4.97e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063732230 263 TYTVLMDGYCKLGRFSEAATVMDDMEKNEIEPNE 296
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
153-203 2.05e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.88  E-value: 2.05e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1063732230 153 RSVRSLNTLLNVLIQNQRFDLVHAMFKNSKESfGITPNIFTCNLLVKALCK 203
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKR-GVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
188-500 2.29e-23

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 104.57  E-value: 2.29e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  188 TPNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTVL 267
Cdd:PLN03218   434 NPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGAL 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  268 MDGYCKLGRFSEAA-----------------------------------TVMDDM--EKNEIEPNEVTYGVMIRALCKEK 310
Cdd:PLN03218   514 IDGCARAGQVAKAFgaygimrsknvkpdrvvfnalisacgqsgavdrafDVLAEMkaETHPIDPDHITVGALMKACANAG 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  311 KSGEARNMFDEMLERSFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTEARKLFD 390
Cdd:PLN03218   594 QVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQ 673
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  391 EFEK-GSIPSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPN 469
Cdd:PLN03218   674 DARKqGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPN 753
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1063732230  470 KTTFLILFEGLQKLGKEEDAMKIVSMAVMNG 500
Cdd:PLN03218   754 TITYSILLVASERKDDADVGLDLLSQAKEDG 784
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
132-463 1.53e-16

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 83.00  E-value: 1.53e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 132 LAGRYESSMRIF--LRIpDFGVKRSVRSLNTLLNVLIQNQRFDLVHAMFkNSKESFGITPNIFTCNLLVKALCKKNDIES 209
Cdd:PLN03081   99 ACGRHREALELFeiLEA-GCPFTLPASTYDALVEACIALKSIRCVKAVY-WHVESSGFEPDQYMMNRVLLMHVKCGMLID 176
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 210 AYKVLDEIPSMglvpNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTV----------------------- 266
Cdd:PLN03081  177 ARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVmlrasaglgsaragqqlhccvlk 252
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 267 ------------LMDGYCKLGRFSEAATVMDDMEkneiEPNEVTYGVMIRALCKEKKSGEARNMFDEMLERSFMPDS--- 331
Cdd:PLN03081  253 tgvvgdtfvscaLIDMYSKCGDIEDARCVFDGMP----EKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQftf 328
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 332 SLCCKVIDALCE-DHKVDEACGLWRKMLKNNCMPDNALlstlIHWLCKEGRVTEARKLFDefeKGSIPSLLTYNTLIAGM 410
Cdd:PLN03081  329 SIMIRIFSRLALlEHAKQAHAGLIRTGFPLDIVANTAL----VDLYSKWGRMEDARNVFD---RMPRKNLISWNALIAGY 401
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1063732230 411 CEKGELTEAGRLWDDMYERKCKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLE 463
Cdd:PLN03081  402 GNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSE 454
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
145-486 1.58e-16

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 83.00  E-value: 1.58e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 145 RIPDFGVKRSVRSLNTLLNVLIQNQRFDLVHAMFKNSKESFGITPNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGLVP 224
Cdd:PLN03081   77 RLDDTQIRKSGVSLCSQIEKLVACGRHREALELFEILEAGCPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEP 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 225 NLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYpdatTYTVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIR 304
Cdd:PLN03081  157 DQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLA----SWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLR 232
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 305 A--------------LCKEK------------------KSG---EARNMFDEMLERSFMPDSSlcckVIDALCEDHKVDE 349
Cdd:PLN03081  233 AsaglgsaragqqlhCCVLKtgvvgdtfvscalidmysKCGdieDARCVFDGMPEKTTVAWNS----MLAGYALHGYSEE 308
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 350 ACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTEARKLFDEFEKGSIPSLLTYNT-LIAGMCEKGELTEAGRLWDDMYE 428
Cdd:PLN03081  309 ALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTaLVDLYSKWGRMEDARNVFDRMPR 388
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063732230 429 RkckpNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPNKTTFLIL-----FEGLQKLGKE 486
Cdd:PLN03081  389 K----NLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVlsacrYSGLSEQGWE 447
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
224-273 1.91e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 70.47  E-value: 1.91e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063732230 224 PNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTVLMDGYCK 273
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
398-447 2.36e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 70.08  E-value: 2.36e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063732230 398 PSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIEGLSK 447
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
259-308 5.63e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.93  E-value: 5.63e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063732230 259 PDATTYTVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIRALCK 308
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
159-411 2.86e-13

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 72.60  E-value: 2.86e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  159 NTLLNVLIQN---QR-FDLVHAMfknSKESFGITPNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGLVPNLVTYTTILG 234
Cdd:PLN03218   546 NALISACGQSgavDRaFDVLAEM---KAETHPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVN 622
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  235 GYVARGDMESAKRVLEEMLDRGWYPDATTYTVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIRALCKEKKSGE 314
Cdd:PLN03218   623 SCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKK 702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230  315 ARNMFDEMLERSFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTEARKLFDEFEK 394
Cdd:PLN03218   703 ALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKE 782
                          250
                   ....*....|....*...
gi 1063732230  395 GSI-PSLLTYNTLIaGMC 411
Cdd:PLN03218   783 DGIkPNLVMCRCIT-GLC 799
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
189-236 1.49e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 59.30  E-value: 1.49e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063732230 189 PNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGLVPNLVTYTTILGGY 236
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
433-480 1.63e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 59.30  E-value: 1.63e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063732230 433 PNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPNKTTFLILFEGL 480
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
294-342 1.09e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.21  E-value: 1.09e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063732230 294 PNEVTYGVMIRALCKEKKSGEARNMFDEMLERSFMPDSSLCCKVIDALC 342
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PLN03077 PLN03077
Protein ECB2; Provisional
126-493 1.22e-08

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 57.94  E-value: 1.22e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 126 LLRNYGLAGRYESSMRIFLRIPdfgvKRSVRSLNTLLNVLIQN----QRFDLVHAMFKNSkesfgITPNIFTCNLLVKAL 201
Cdd:PLN03077  228 LITMYVKCGDVVSARLVFDRMP----RRDCISWNAMISGYFENgeclEGLELFFTMRELS-----VDPDLMTITSVISAC 298
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 202 CKKNDIESAYKVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRgwypDATTYTVLMDGYCKLGRFSEAA 281
Cdd:PLN03077  299 ELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDKAL 374
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 282 TVMDDMEKNEIEPNEVTY---------------GVMIRALCKEKK-------SGEARNMF------DEMLER-SFMPDSS 332
Cdd:PLN03077  375 ETYALMEQDNVSPDEITIasvlsacaclgdldvGVKLHELAERKGlisyvvvANALIEMYskckciDKALEVfHNIPEKD 454
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 333 LC--CKVIDALCEDHKVDEACGLWRKM---LKNN----------CMPDNALLS---------------------TLIHWL 376
Cdd:PLN03077  455 VIswTSIIAGLRLNNRCFEALIFFRQMlltLKPNsvtliaalsaCARIGALMCgkeihahvlrtgigfdgflpnALLDLY 534
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 377 CKEGRVTEARKLFDEFEKgsipSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIEGLSKNGNVKEGVR 456
Cdd:PLN03077  535 VRCGRMNYAWNQFNSHEK----DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLE 610
                         410       420       430
                  ....*....|....*....|....*....|....*...
gi 1063732230 457 VLEEM-LEIGCFPNKTTFLILFEGLQKLGKEEDAMKIV 493
Cdd:PLN03077  611 YFHSMeEKYSITPNLKHYACVVDLLGRAGKLTEAYNFI 648
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
371-411 1.58e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.82  E-value: 1.58e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 1063732230 371 TLIHWLCKEGRVTEARKLFDEFEKGSI-PSLLTYNTLIAGMC 411
Cdd:pfam13041   8 TLINGYCKKGKVEEAFKLFNEMKKRGVkPNVYTYTILINGLC 49
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
431-461 1.90e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.04  E-value: 1.90e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063732230 431 CKPNAFTYNVLIEGLSKNGNVKEGVRVLEEM 461
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PLN03077 PLN03077
Protein ECB2; Provisional
126-484 2.23e-08

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 57.17  E-value: 2.23e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 126 LLRNYGLAGRYESSMRIFLRIPDFGVKRSVRSLNTLLNV---LIQNQRFDLVHAMFKNskesFGITPNIFTCNLLVKALC 202
Cdd:PLN03077  158 LVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTcggIPDLARGREVHAHVVR----FGFELDVDVVNALITMYV 233
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 203 KKNDIESAYKVLDEIPSMGLV-------------------------------PNLVTYTTILGGYVARGDMESAKRVLEE 251
Cdd:PLN03077  234 KCGDVVSARLVFDRMPRRDCIswnamisgyfengecleglelfftmrelsvdPDLMTITSVISACELLGDERLGREMHGY 313
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 252 MLDRGWYPDATTYTVLMDGYCKLGRFSEAATVMDDMEKNEIepneVTYGVMIRALCKEKKSGEARNMFDEMLERSFMPD- 330
Cdd:PLN03077  314 VVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDA----VSWTAMISGYEKNGLPDKALETYALMEQDNVSPDe 389
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 331 ----SSL----CCKVIDALCEDHKVDEACGLWRKMLKNN--------------------CMPDNALLS--TLIHWLCKEG 380
Cdd:PLN03077  390 itiaSVLsacaCLGDLDVGVKLHELAERKGLISYVVVANaliemyskckcidkalevfhNIPEKDVISwtSIIAGLRLNN 469
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 381 RVTEARKLFDEFEKGSIPSLLTYNTLIAG-------MCEK---GELTEAGRLWD--------DMYER------------K 430
Cdd:PLN03077  470 RCFEALIFFRQMLLTLKPNSVTLIAALSAcarigalMCGKeihAHVLRTGIGFDgflpnallDLYVRcgrmnyawnqfnS 549
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1063732230 431 CKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPNKTTFLILFEGLQKLG 484
Cdd:PLN03077  550 HEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSG 603
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
189-463 3.55e-07

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 51.65  E-value: 3.55e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 189 PNIFTCNLLVKALC--KKNDIESAYKVLDEIpsMGLVPNLVTYTTILGG-YVARGDMESAKRVLEEMLDRgwYPDATTYT 265
Cdd:COG2956     4 PVAAALGWYFKGLNylLNGQPDKAIDLLEEA--LELDPETVEAHLALGNlYRRRGEYDRAIRIHQKLLER--DPDRAEAL 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 266 VLM-DGYCKLGRFSEAATVMDDMEKNEIEpNEVTYGVMIRALCKEKKSGEARNMFDEMLERSfmPDSS-LCCKVIDALCE 343
Cdd:COG2956    80 LELaQDYLKAGLLDRAEELLEKLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLG--PENAhAYCELAELYLE 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 344 DHKVDEACGLWRKMLKNNcmPDNALLST-LIHWLCKEGRVTEARKLFDEFEKGSIPSLLTYNTLIAGMCEKGELTEAGRL 422
Cdd:COG2956   157 QGDYDEAIEALEKALKLD--PDCARALLlLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPEEALEL 234
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 1063732230 423 WDDMYERkcKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLE 463
Cdd:COG2956   235 LRKALEL--DPSDDLLLALADLLERKEGLEAALALLERQLR 273
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
263-296 4.97e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.91  E-value: 4.97e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063732230 263 TYTVLMDGYCKLGRFSEAATVMDDMEKNEIEPNE 296
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
329-378 6.35e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.20  E-value: 6.35e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063732230 329 PDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCK 378
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
220-252 2.27e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 44.26  E-value: 2.27e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063732230 220 MGLVPNLVTYTTILGGYVARGDMESAKRVLEEM 252
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
297-331 2.36e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 2.36e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063732230 297 VTYGVMIRALCKEKKSGEARNMFDEMLERSFMPDS 331
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
402-435 2.84e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 2.84e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063732230 402 TYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNA 435
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
227-261 2.87e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 2.87e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063732230 227 VTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDA 261
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
436-466 4.69e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 43.22  E-value: 4.69e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063732230 436 FTYNVLIEGLSKNGNVKEGVRVLEEMLEIGC 466
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
169-305 4.89e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 47.78  E-value: 4.89e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 169 QRFDLVHAM--FKNSK-ESFGITPNIFTC--NLLVKA-----LCKKNDIESAYKVLDEIPSMGLVPNLVTYTTILGGYVA 238
Cdd:pfam17177  23 KHADATGALalYDAAKaEGVRLAQYHYNVllYLCSKAadatdLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1063732230 239 RGDMESAKRVLEEMLDRGWYPDATTYTVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIRA 305
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKV 169
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
394-426 5.22e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.10  E-value: 5.22e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063732230 394 KGSIPSLLTYNTLIAGMCEKGELTEAGRLWDDM 426
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
247-309 7.60e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.50  E-value: 7.60e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063732230 247 RVLEEMLDRGWYPDATTYTVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIRALCKE 309
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
436-469 8.96e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.44  E-value: 8.96e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063732230 436 FTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPN 469
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
212-271 1.32e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.73  E-value: 1.32e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 212 KVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTVLMDGY 271
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
259-288 1.63e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 41.56  E-value: 1.63e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063732230 259 PDATTYTVLMDGYCKLGRFSEAATVMDDME 288
Cdd:pfam12854   5 PDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
303-489 1.91e-05

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 47.56  E-value: 1.91e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 303 IRALCKEKKSGEARNMFdEMLER--SFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEG 380
Cdd:PLN03081   94 IEKLVACGRHREALELF-EILEAgcPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCG 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 381 RVTEARKLFDEfekgsIP--SLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIEGLSKNGNVKEGVRVL 458
Cdd:PLN03081  173 MLIDARRLFDE-----MPerNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLH 247
                         170       180       190
                  ....*....|....*....|....*....|.
gi 1063732230 459 EEMLEIGCFPNKTTFLILFEGLQKLGKEEDA 489
Cdd:PLN03081  248 CCVLKTGVVGDTFVSCALIDMYSKCGDIEDA 278
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
290-322 1.94e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 41.56  E-value: 1.94e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063732230 290 NEIEPNEVTYGVMIRALCKEKKSGEARNMFDEM 322
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
185-216 2.00e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 41.56  E-value: 2.00e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063732230 185 FGITPNIFTCNLLVKALCKKNDIESAYKVLDE 216
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
421-473 2.15e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.35  E-value: 2.15e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1063732230 421 RLWDDMYERKCKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPNKTTF 473
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTY 53
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
360-391 2.71e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 41.18  E-value: 2.71e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063732230 360 NNCMPDNALLSTLIHWLCKEGRVTEARKLFDE 391
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
297-327 5.95e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.14  E-value: 5.95e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063732230 297 VTYGVMIRALCKEKKSGEARNMFDEMLERSF 327
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
186-239 7.27e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.80  E-value: 7.27e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1063732230 186 GITPNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGLVPNLVTYTTILGGYVAR 239
Cdd:pfam13812  10 GIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
192-225 7.84e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.75  E-value: 7.84e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063732230 192 FTCNLLVKALCKKNDIESAYKVLDEIPSMGLVPN 225
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
263-292 9.17e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 39.76  E-value: 9.17e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063732230 263 TYTVLMDGYCKLGRFSEAATVMDDMEKNEI 292
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PLN03077 PLN03077
Protein ECB2; Provisional
340-473 1.73e-04

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 44.46  E-value: 1.73e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 340 ALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHwLCKEGRVTEARKLFDEFEKGSIPSLLTY--NTLIAGMCEKGELT 417
Cdd:PLN03077   60 ALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCEWKRAVEEGSRVCSRALSSHPSLGVRlgNAMLSMFVRFGELV 138
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1063732230 418 EAGRLWDDMYERkckpNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPNKTTF 473
Cdd:PLN03077  139 HAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTF 190
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
227-256 1.84e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.60  E-value: 1.84e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063732230 227 VTYTTILGGYVARGDMESAKRVLEEMLDRG 256
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
153-203 2.05e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.88  E-value: 2.05e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1063732230 153 RSVRSLNTLLNVLIQNQRFDLVHAMFKNSKESfGITPNIFTCNLLVKALCK 203
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKR-GVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
150-312 2.72e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 42.38  E-value: 2.72e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 150 GVKRSVRSLNTLLNV---------LIQNQRFDLVHAMFKNSKESfGITPNIFTCNLLVKALCKKNDIESAYKVLDEIPSM 220
Cdd:pfam17177  41 GVRLAQYHYNVLLYLcskaadatdLKPQLAADRGFEVFEAMKAQ-GVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAA 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 221 GLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYG 300
Cdd:pfam17177 120 GVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLHRLRDAVRQVSESTAG 199
                         170
                  ....*....|..
gi 1063732230 301 VMIRALCKEKKS 312
Cdd:pfam17177 200 VLEEWFRSEAAA 211
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
387-442 3.54e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.88  E-value: 3.54e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1063732230 387 KLFDEFE-KGSIPSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLI 442
Cdd:pfam13812   1 SILREMVrDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PLN03077 PLN03077
Protein ECB2; Provisional
159-476 3.59e-04

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 43.30  E-value: 3.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 159 NTLLNVLIQNQRFDLVHAMFKNSKESfgitpNIFTCNLLVKALCKKNDIESAYKVLDEIpSMGLVPNLVTYTTILGGYVA 238
Cdd:PLN03077  428 NALIEMYSKCKCIDKALEVFHNIPEK-----DVISWTSIIAGLRLNNRCFEALIFFRQM-LLTLKPNSVTLIAALSACAR 501
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 239 RGDMESAKRVLEEMLDRGWYPDATTYTVLMDGYCKLGRFSEAATVMDDMEKneiepNEVTYGVMIRALCKEKKSGEARNM 318
Cdd:PLN03077  502 IGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVEL 576
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 319 FDEMLERSFMPDSslcCKVIDALCedhkvdeACGlwrkmlknncmpdnallstlihwlcKEGRVTEARKLFDEFEK--GS 396
Cdd:PLN03077  577 FNRMVESGVNPDE---VTFISLLC-------ACS-------------------------RSGMVTQGLEYFHSMEEkySI 621
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 397 IPSLLTYNTLIAGMCEKGELTEAGRLWDDMyerKCKPNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGcfPNKTTFLIL 476
Cdd:PLN03077  622 TPNLKHYACVVDLLGRAGKLTEAYNFINKM---PITPDPAVWGALLNACRIHRHVELGELAAQHIFELD--PNSVGYYIL 696
PLN03077 PLN03077
Protein ECB2; Provisional
195-500 5.28e-04

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 42.91  E-value: 5.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 195 NLLVKALCKKNDIESAYKVLDEIPSMglvpNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWYPDATTYTvlmdgyCKL 274
Cdd:PLN03077  125 NAMLSMFVRFGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFP------CVL 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 275 grfSEAATVMDDMEKNEIEPNEVTYG-----------VMIRALCKEKKSgeARNMFDEMLERSFMPDSSLcckvIDALCE 343
Cdd:PLN03077  195 ---RTCGGIPDLARGREVHAHVVRFGfeldvdvvnalITMYVKCGDVVS--ARLVFDRMPRRDCISWNAM----ISGYFE 265
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 344 DHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTEARKLFD-EFEKGSIPSLLTYNTLIAGMCEKGELTEAGRL 422
Cdd:PLN03077  266 NGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGyVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKV 345
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1063732230 423 WDDMyERKckpNAFTYNVLIEGLSKNGNVKEGVRVLEEMLEIGCFPNKTTFLILFEGLQKLGKEEDAMKIVSMAVMNG 500
Cdd:PLN03077  346 FSRM-ETK---DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKG 419
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
334-365 3.67e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.12  E-value: 3.67e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063732230 334 CCKVIDALCEDHKVDEACGLWRKMLKNNCMPD 365
Cdd:TIGR00756   3 YNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
402-431 4.11e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 4.11e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063732230 402 TYNTLIAGMCEKGELTEAGRLWDDMYERKC 431
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
199-326 4.28e-03

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 37.86  E-value: 4.28e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 199 KALCKKNDIESAYKVLDEIpsMGLVPNLVTYTTILGG-YVARGDMESAKRVLEEMLDRgwYPD-ATTYTVLMDGYCKLGR 276
Cdd:COG4783    12 QALLLAGDYDEAEALLEKA--LELDPDNPEAFALLGEiLLQLGDLDEAIVLLHEALEL--DPDePEARLNLGLALLKAGD 87
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1063732230 277 FSEAATVMDD-MEKNEIEPNevTYGVMIRALCKEKKSGEARNMFDEMLERS 326
Cdd:COG4783    88 YDEALALLEKaLKLDPEHPE--AYLRLARAYRALGRPDEAIAALEKALELD 136
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
236-326 5.14e-03

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 36.30  E-value: 5.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063732230 236 YVARGDMESAKRVLEEMLDRGWYpDATTYTVLMDGYCKLGRFSEAAtvmdDMEK-NEIEPNEV-TYGVMIRALCKEKKSG 313
Cdd:COG3063     2 YLKLGDLEEAEEYYEKALELDPD-NADALNNLGLLLLEQGRYDEAI----ALEKaLKLDPNNAeALLNLAELLLELGDYD 76
                          90
                  ....*....|...
gi 1063732230 314 EARNMFDEMLERS 326
Cdd:COG3063    77 EALAYLERALELD 89
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH