|
Name |
Accession |
Description |
Interval |
E-value |
| PLN02993 |
PLN02993 |
lupeol synthase |
1-763 |
0e+00 |
|
lupeol synthase
Pssm-ID: 215537 [Multi-domain] Cd Length: 763 Bit Score: 1680.85 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 1 MWKLKIGEGNGEDPYLFSSNNFVGRQTWEFDPKAGTPEERAAVEDARRNYLDNRPRVKGCSDLLWRMQFLKEAKFEQVIP 80
Cdd:PLN02993 1 MWKLKIGEGNGEDPYLFSSNNFVGRQTWEFDPKAGTPEERAAVEEARRSFLDNRSRVKGCSDLLWRMQFLKEAKFEQVIP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 81 PVKIDDGEGITYKNATDALRRAVSFYSALQSSDGHWPAEITGTLFFLPPLVFCFYITGHLEKIFDAEHRKEMLRHIYCHQ 160
Cdd:PLN02993 81 PVKIDRGEEITYETATNALRRGVSFFSALQASDGHWPGEITGPLFFLPPLVFCLYITGHLEEVFDAEHRKEMLRHIYCHQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 161 NEDGGWGLHIEGKSVMFCTVLNYICLRMLGEGPNGGRNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMP 240
Cdd:PLN02993 161 NEDGGWGLHIESKSVMFCTVLNYICLRMLGEGPNGGRENACKRARQWILDHGGVTYIPSWGKFWLSILGIYDWSGTNPMP 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 241 PEIWLLPSFFPIHLGKTLCYTRMVYMPMSYLYGKRFVGPLTPLIMLLRKELHLQPYEEINWNKARRLCAKEDMIYPHPLV 320
Cdd:PLN02993 241 PEIWLLPSFLPIHLGKTLCYTRMVYMPMSYLYGKRFVGPITPLIMLLREELHLQPYEEINWNKARRLCAKEDMYYPHPLV 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 321 QDLLWDTLHNFVEPILTNWPLKKLVREKALRVAMEHIHYEDENSHYITIGCVEKVLCMLACWIENPNGDHFKKHLARIPD 400
Cdd:PLN02993 321 QDLIWDTLHNFVEPFLTRWPLNKLVREKALQVAMKHIHYEDENSHYITIGCVEKVLCMLACWIENPNGDHFKKHLARIPD 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 401 FMWVAEDGLKMQSFGSQLWDTVFAIQALLACDLSDETDDVLRKGHSFIKKSQVRENPSGDFKSMYRHISKGAWTLSDRDH 480
Cdd:PLN02993 401 YMWVAEDGMKMQSFGSQLWDTGFAIQALLASDLSDETDDVLRRGHNYIKKSQVRENPSGDFKSMYRHISKGAWTLSDRDH 480
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 481 GWQVSDCTAEALKCCMLLSLMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEWLELLNPTDFFTCVMAER 560
Cdd:PLN02993 481 GWQVSDCTAEALKCCMLLSMMPADVVGQKIDPEQLYDSVNLLLSLQSENGGVTAWEPVRAYKWLELLNPTDFFANTMVER 560
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 561 EYVECTSAVIQALVLFKQLYPDHRTKEIIKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLA 640
Cdd:PLN02993 561 EYVECTSAVIQALVLFKQLYPDHRTKEIIKSIEKAVQFIESKQTPDGSWYGNWGICFIYATWFALGGLAAAGKTYNDCLA 640
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 641 VRKGVDFLLAIQEEDGGWGESHLSCPEQRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDPTPLHCAAKLIITSQLENGDF 720
Cdd:PLN02993 641 MRKGVHFLLTIQRDDGGWGESYLSCPEQRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDLIPLHRAAKLIITSQLENGDF 720
|
730 740 750 760
....*....|....*....|....*....|....*....|...
gi 1901008554 721 PQQEILGVFMNTCMLHYATYRNIFPLWALAEYRKAAFATHQDL 763
Cdd:PLN02993 721 PQQEILGAFMNTCMLHYATYRNTFPLWALAEYRKAAFITHADL 763
|
|
| PLN03012 |
PLN03012 |
Camelliol C synthase |
1-754 |
0e+00 |
|
Camelliol C synthase
Pssm-ID: 166653 [Multi-domain] Cd Length: 759 Bit Score: 1416.32 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 1 MWKLKIGEGNGEDPYLFSSNNFVGRQTWEFDPKAGTPEERAAVEDARRNYLDNRPRVKGCSDLLWRMQFLKEAKFEQVIP 80
Cdd:PLN03012 1 MWKLKIAEGNGDDPYLFSTNNFAGRQTWEFDPDAGSPEELAAVEEARRIFYDDRFHVKASSDLIWRMQFLKEKKFEQRIA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 81 PVKIDDGEGITYKNATDALRRAVSFYSALQSSDGHWPAEITGTLFFLPPLVFCFYITGHLEKIFDAEHRKEMLRHIYCHQ 160
Cdd:PLN03012 81 PAKVEDAEKITFEIATNALRKGIHFFSALQASDGHWPAENAGPLFFLPPLVFCLYITGHLDEIFTQDHRKEILRYIYCHQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 161 NEDGGWGLHIEGKSVMFCTVLNYICLRMLGEGPNGGRNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMP 240
Cdd:PLN03012 161 KEDGGWGLHIEGHSTMFCTTLNYICMRILGEGPDGGHDNACGRARDWILDHGGATYIPSWGKTWLSILGVFDWSGSNPMP 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 241 PEIWLLPSFFPIHLGKTLCYTRMVYMPMSYLYGKRFVGPLTPLIMLLRKELHLQPYEEINWNKARRLCAKEDMIYPHPLV 320
Cdd:PLN03012 241 PEFWILPSFFPIHPAKMWCYCRLVYLPMSYLYGKRFVGPISPLILQLREEIYLQPYAEINWMKARHLCAKEDAYCPHPLI 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 321 QDLLWDTLHNFVEPILTNWPLKKLVREKALRVAMEHIHYEDENSHYITIGCVEKVLCMLACWIENPNGDHFKKHLARIPD 400
Cdd:PLN03012 321 QDLIWDCLYIFAEPFLACWPFNKLLREKALGLAMKHIHYEDENSRYITIGCVEKALCMLACWVEDPNGDHFKKHLLRISD 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 401 FMWVAEDGLKMQSFGSQLWDTVFAIQALLACDLSDETDDVLRKGHSFIKKSQVRENPSGDFKSMYRHISKGAWTLSDRDH 480
Cdd:PLN03012 401 YLWIAEDGMKMQSFGSQLWDSGFALQALLASNLSNEIPDVLRRGHDFIKNSQVGENPSGDFKNMYRHISKGAWTFSDRDH 480
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 481 GWQVSDCTAEALKCCMLLSLMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEWLELLNPTDFFTCVMAER 560
Cdd:PLN03012 481 GWQASDCTAEGFKCCLLFSMIAPDIVGPKMDPEQLHDAVNILLSLQSKNGGMTAWEPAGAPEWLELLNPTEMFADIVIEH 560
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 561 EYVECTSAVIQALVLFKQLYPDHRTKEIIKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLA 640
Cdd:PLN03012 561 EYNECTSSAIQALILFKQLYPDHRTEEINAFIKKAAEYIENIQMLDGSWYGNWGICFTYGTWFALAGLAAAGKTFNDCEA 640
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 641 VRKGVDFLLAIQEEDGGWGESHLSCPEQRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDPTPLHCAAKLIITSQLENGDF 720
Cdd:PLN03012 641 IRKGVHFLLAAQKDNGGWGESYLSCPKKIYIAQEGEISNLVQTAWALMGLIHAGQAERDPIPLHRAAKLIINSQLENGDF 720
|
730 740 750
....*....|....*....|....*....|....
gi 1901008554 721 PQQEILGVFMNTCMLHYATYRNIFPLWALAEYRK 754
Cdd:PLN03012 721 PQQEATGAFLKNCLLHYAAYRNIFPLWALAEYRA 754
|
|
| SQCY_1 |
cd02892 |
Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an ... |
99-752 |
0e+00 |
|
Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. This group contains bacterial SQCY which catalyzes the convertion of squalene to hopene or diplopterol and eukaryotic OSQCY which transforms the 2,3-epoxide of squalene to compounds such as, lanosterol in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain.
Pssm-ID: 239222 [Multi-domain] Cd Length: 634 Bit Score: 945.48 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 99 LRRAVSFYSALQSSDGHWPAEITGTLFFLPPLVFCFYITGHlekIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGKSVMFC 178
Cdd:cd02892 1 IRRALEFLLSLQAPDGHWPGELEGPLFITAEYILLLYILGI---PIDPEHRKEIARYLRNHQNPDGGWGLHHEGPSTMFG 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 179 TVLNYICLRMLGEGPNGGrnnACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMPPEIWLLPSFFPIHLGKTL 258
Cdd:cd02892 78 TVLNYVALRLLGVSPDDP---HMVKARNWILSHGGAARIPVWGKIWLALLGVYPWEGVPPLPPELWLLPSWLPFHPYKFW 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 259 CYTRMVYMPMSYLYGKRFVGPLTPLIMLLRKELHLQPYEEINWNKARRlcAKEDMIYPHPLVQDLLWDTLHnFVEPILTN 338
Cdd:cd02892 155 CWARTVYVPMSYLYGKRPVAPITPLVLSLRDELYVEPYEKINWYKHRN--DLYDYRPPWQRLFDALDRLLH-WYEPLPPK 231
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 339 WplkklVREKALRVAMEHIHYEDENSHYITIGCVEKVLCMLACWIENPNGDH-FKKHLARIPDFMWVAEDGLKM-QSFGS 416
Cdd:cd02892 232 P-----LRRKALRKAYEWILYRDENTGYLGIIPPPKANNMLALWVLGYPDSPaFKRHLERIDDFLWLGPEGMKMcQTNGS 306
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 417 QLWDTVFAIQALLACDLSDETDDVLRKGHSFIKKSQVRENPsGDFKSMYRHISKGAWTLSDRDHGWQVSDCTAEALKCCM 496
Cdd:cd02892 307 QVWDTALAVQALLEAGLAPEFDPALKKALDWLLESQILDNP-GDWKVKYRHLRKGGWAFSTANQGYPDSDDTAEALKALL 385
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 497 LLSLMPAEvvGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEWLELLNPTDFFTCVMAEREYVECTSAVIQALVLF 576
Cdd:cd02892 386 RLQELPPF--GEKVSRERLYDAVDWLLGMQNSNGGFAAFEPDNTYHWLENLNPFEDFGDIMIDPPYVECTGSVLEALGLF 463
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 577 KQLYPDHRtKEIIKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLAVRKGVDFLLAIQEEDG 656
Cdd:cd02892 464 GKLYPGHR-REIDPAIRRAVKYLLREQEPDGSWYGRWGVCYIYGTWFALEALAAAGEDYENSPYIRKACDFLLSKQNPDG 542
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 657 GWGESHLSCPEQRYipLEGNRSNLVQTAWAMMGLIHAGQAerDPTPLHCAAKLIITSQLENGDFPQQEILGVFMNTCMLH 736
Cdd:cd02892 543 GWGESYLSYEDKSY--AGGGRSTVVQTAWALLALMAAGEP--DSEAVERGIKYLLNTQLPDGDWPQEEITGVGFPNFYIR 618
|
650
....*....|....*.
gi 1901008554 737 YATYRNIFPLWALAEY 752
Cdd:cd02892 619 YHNYRNYFPLWALGRY 634
|
|
| squalene_cyclas |
TIGR01787 |
squalene/oxidosqualene cyclases; This family of enzymes catalyzes the cyclization of the ... |
98-754 |
0e+00 |
|
squalene/oxidosqualene cyclases; This family of enzymes catalyzes the cyclization of the triterpenes squalene or 2-3-oxidosqualene to a variety of products including hopene, lanosterol, cycloartenol, amyrin, lupeol, and isomultiflorenol.
Pssm-ID: 273809 [Multi-domain] Cd Length: 621 Bit Score: 791.64 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 98 ALRRAVSFYSALQSSDGHWPAEITGTLFFLPPLVFCFYITGhlekIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGKSVMF 177
Cdd:TIGR01787 1 TARRAVEFLLSLQAPDGYWWGELEGPLTLLAEYVLLCHIAD----TPLPGYREKIVRYLRHHQNEDGGWGLHIGGKSTVF 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 178 CTVLNYICLRMLGEGPNggrNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMPPEIWLLPSFFPIHLGKT 257
Cdd:TIGR01787 77 GTVLAYVALKILGMSPD---DPAMVRARNFILKQGGAVASPVFTKFWLALLGVYPWEGVPPLPPEIMLLPKWLPIHPSKS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 258 LCYTRMVYMPMSYLYGKRFVGPLTPlimllRKELHLQPYEEinwNKARRLCAKEDMIYPHPLVQDLLWDTLHNFVEPILT 337
Cdd:TIGR01787 154 WCRCRMVYLPMSYCYGERLSAPIDP-----REELYVEDDSI---RAQRNNVAKEDLYTPHSWLLRALYGLLNLFYHPFLR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 338 NWplkklVREKALRVAMEHIHYEDenshyiTIGCVEKVLCMLACWIEN-PNGDHFKKHLARIPDFMWVAEDGLKMQSFGS 416
Cdd:TIGR01787 226 KA-----LRKRALQWLYEHIAADG------SIGPISKAMAMLALWFLDgPNSPAFQKHLQRIDDYLWLQLDGMKMQGTGS 294
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 417 QLWDTVFAIQALLACDLS--DETDDVLRKGHSFIKKSQVRENPSGDFKsMYRHISK-GAWTLSDRDHGW-QVSDCTAEAL 492
Cdd:TIGR01787 295 QVWDTAFAIQALRESGDHrlPEFHPALVKAHEWLLLSQIPDNPPGDWK-VYRHNLKpGGWAFSFLNCGYpDVDDTAVVAL 373
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 493 KCCMLLSLMpaevvgQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEWLELLNPTDFFTCVMAEREYVECTSAVIQA 572
Cdd:TIGR01787 374 KAVLLLQED------EHVKRDRLRDAVNWILGMQSSNGGFAAYDPDNTGEWLELLNPSEVFGDIMIDPPYVDVTARVIQA 447
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 573 LVLFKqlypdHRTKEIIKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLAVRKGVDFLLAIQ 652
Cdd:TIGR01787 448 LGAFG-----HRADEIRNVLERALEYLRREQRADGSWFGRWGVNYTYGTGFVLSALAAAGRTYRNCPEVQKACDWLLSRQ 522
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 653 EEDGGWGESHLSCPEQRYIPLEGnrSNLVQTAWAMMGLIHAGQAERDptPLHCAAKLIITSQLENGDFPQQEILGVFMN- 731
Cdd:TIGR01787 523 MPDGGWGEDCFSYEDPSYVGSGG--STPSQTGWALMALIAAGEADSE--AIERGVKYLLETQRPDGDWPQEYITGVGFPk 598
|
650 660
....*....|....*....|...
gi 1901008554 732 TCMLHYATYRNIFPLWALAEYRK 754
Cdd:TIGR01787 599 NFYLKYTNYRNIFPLWALGRYRQ 621
|
|
| osq_cycl |
TIGR03463 |
2,3-oxidosqualene cyclase; This model identifies 2,3-oxidosqualene cyclases from Stigmatella ... |
108-752 |
0e+00 |
|
2,3-oxidosqualene cyclase; This model identifies 2,3-oxidosqualene cyclases from Stigmatella aurantiaca which produces cycloartenol, and Gemmata obscuriglobus and Methylococcus capsulatus, which each produce the closely related sterol, lanosterol.
Pssm-ID: 274591 [Multi-domain] Cd Length: 634 Bit Score: 535.34 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 108 ALQSSDGHWPAEITGTLFFLPPLVFCFYITGhleKIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGKSVMFCTVLNYICLR 187
Cdd:TIGR03463 3 ALQDSAGDWEGDMGGCQFIIAIAVAGLHVMG---RPPDAEERAAIIAHFELHQLADGAWGLDPEAPGQVFFSVLAYVALR 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 188 MLGEGPNggrNNACKRARQWILDH-GGVTYIPSWGKIWLSILGIYDWSGTNPMPPEIWLLPSFFPIHLGKTLCYTRMVYM 266
Cdd:TIGR03463 80 LLGLGKD---DAGLARARAWFHAQpEGPKASGAWGKFILALLGLYEREGLNAVPPELFLLPESLPFHPSRFWCHCRLIYL 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 267 PMSYLYGKRFVGPLT-PLIMLLRKELHLQPYEEINWNKARRLCAKEDMIYPHPLVQDLLWDTLHNFvepilTNWPLKKLv 345
Cdd:TIGR03463 157 GIAWLSGRGARAPESdPLLAAIRQEIFAEGYEQVDFGAARERVAPTDLFTPISFVLKAANDLLAGY-----ERLAGKAL- 230
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 346 REKALRVAMEHIHYEDENSHYITIGCVEKVLCMLACWIENPNGDHFKKHLARIPDFMWVAED-GLKMQSFGS-QLWDTVF 423
Cdd:TIGR03463 231 RARALDFAFEQILAEDEATHYICIGPINGLLNCLAIFAHDPDGPDLAAHLEGLEAWFWEDDAeGLRMNGANSnALWDTAF 310
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 424 AIQALLAC-DLSDETDDVLRKGHSFIKKSQVRENpSGDFKSMYRHISKGAWTLSDRDHGWQVSDCTAEALKCCMLLSLMP 502
Cdd:TIGR03463 311 AVQALAALgELDEEAKHALEEAAAFIDAAQMLAD-LADPQEAFRDPAKGGWCFSDGDHCWPISDCAAEALKALFALEELG 389
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 503 AEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEWLELLNPTDFFTCVMAEREYVECTSAVIQALVLFKQLYPD 582
Cdd:TIGR03463 390 DNRISEALGAARLQDAVEFILSMQNADGGFATYELQRGGKLLELLNPSDMFGQCMTDLSYVECTAACLGALAAWLKHHPD 469
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 583 HRTKEIIKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKScLAVRKGVDFLLAIQEEDGGWGESH 662
Cdd:TIGR03463 470 LPDAKIDAAIRKAEEFIRRRQLDDGSFMGFWGICFTYATFFGAKGLIAAGAEPAD-MALQAAAAFLLEKQRADGAWGEHV 548
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 663 LSCPEQRYIplEGNRSNLVQTAWAMMGLIHAGQAERDPTplHCAAKLIITSQLENGDFPQQEILGVFMNTCMLHYATYRN 742
Cdd:TIGR03463 549 ESCLEARWV--EGKHGHAVMTAWALLALAAAGEAAHDAA--ERGIAWLCEQQGEDGGWPPEGIAGIFFGAAAIDYDAYLR 624
|
650
....*....|
gi 1901008554 743 IFPLWALAEY 752
Cdd:TIGR03463 625 IFPTWALARC 634
|
|
| SqhC |
COG1657 |
Terpene cyclase SqhC [Lipid transport and metabolism]; |
97-756 |
2.15e-168 |
|
Terpene cyclase SqhC [Lipid transport and metabolism];
Pssm-ID: 441263 [Multi-domain] Cd Length: 644 Bit Score: 499.73 E-value: 2.15e-168
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 97 DALRRAVSFYSALQSSDGHWPAEITGTlfFLPPlvfCFYITGH-LEKIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGKSV 175
Cdd:COG1657 23 AAIAAAQALLLQQQDDGGWWGGELEAD--VTIA---AEYILLHhFLGPDDEELEAKIARYLRRQQNDDGGWPLYHGGPGD 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 176 MFCTVLNYICLRMLGEGPNggrNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMPPEIWLLPSFFPIHLG 255
Cdd:COG1657 98 LSTTVKAYFALKLLGDDPD---APHMVRAREFILARGGAARANVFTKIWLALFGQYPWRGVPALPPEIMLLPRWFPFHIY 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 256 KTLCYTRMVYMPMSYLYGKRFVGPLTPLImlLRKELHLQPYEEI-NWNKARRlcakedmiYPHPLVQDLLW-DTLHNFVE 333
Cdd:COG1657 175 KFSYWARTVIVPLLILYARKPVAPLPPGV--GIDELFVEPPEQVdYYFPAPR--------DRSPWSRFFLAlDRLLRAYE 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 334 PiltnWPLKKLvREKALRVAMEHIHYEDENS-HYITIG-CVEKVLCMLACWIENPNGDHFKKHLARIPDFMWVAEDGLKM 411
Cdd:COG1657 245 R----LPPKPL-RRRALRKAEDWILERLEGDgGLGGIFpAMVNSLMALLALGYPPDHPVVRRALEALEKLLVETEDGARC 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 412 QSFGSQLWDTVFAIQALLACDLsDETDDVLRKGHSFIKKSQVREnpSGDFKSMYRHISKGAWTLSDRDHGWQVSDCTAEA 491
Cdd:COG1657 320 QPCVSPVWDTALAVQALQEAGL-PEDHPALERAADWLLSKQILV--KGDWAVKRPDVEPGGWAFQFANDHYPDVDDTAVV 396
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 492 LKCCMLLSLMPAEVVGQKIDpeqlyDSVNLLLSLQGEKGGLTAWEPVRAQEWLELLNPTDFftCVMAEREYVECTSAVIQ 571
Cdd:COG1657 397 LMALLRLRLPDEPRYREAIE-----RAVEWILGMQSRDGGWGAFDKDNTKEWLNKIPFADH--GALLDPPTADVTARCLE 469
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 572 ALVLFKQlypdhrtKEIIKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTyKSCLAVRKGVDFLLAI 651
Cdd:COG1657 470 MLGQLGL-------TEDHPAIRRAVAYLRREQEPDGSWFGRWGVNYIYGTWSVLTGLNAAGVD-PDDPAIRRAVAWLLSI 541
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 652 QEEDGGWGESHLSCPEQRYIPLEgnRSNLVQTAWAMMGLIHAGQAERDptPLHCAAKLIITSQLENGDFPQQEILGV-FM 730
Cdd:COG1657 542 QNADGGWGEDCRSYEDPRYVGLG--PSTASQTAWALLALLAAGEADSP--AVARGIAYLLSTQREDGSWDEEYFTGTgFP 617
|
650 660
....*....|....*....|....*.
gi 1901008554 731 NTCMLHYATYRNIFPLWALAEYRKAA 756
Cdd:COG1657 618 RVFYLRYHLYRQYFPLWALARYRNLR 643
|
|
| SQCY |
cd02889 |
Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - ... |
399-752 |
2.16e-160 |
|
Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. Bacterial SQCY catalyzes the convertion of squalene to hopene or diplopterol. Eukaryotic OSQCY transforms the 2,3-epoxide of squalene to compounds such as, lanosterol (a metabolic precursor of cholesterol and steroid hormones) in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain. This group also contains SQCY-like archael sequences and some bacterial SQCY's which lack this minor domain.
Pssm-ID: 239219 [Multi-domain] Cd Length: 348 Bit Score: 468.24 E-value: 2.16e-160
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 399 PDFMWVAEdglkmqsfGSQLWDTVFAIQALLACDLSDETDDVLRKGHSFIKKSQVRENPsGDFKSMYRHISKGAWTLSDR 478
Cdd:cd02889 14 PDGHWPGE--------YSQVWDTALALQALLEAGLAPEFDPALKKALEWLLKSQIRDNP-DDWKVKYRHLRKGGWAFSTA 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 479 DHGWQVSDCTAEALKCCMLLSLMPAEvvGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEWLELlnPTDFFTCVMA 558
Cdd:cd02889 85 NQGYPDSDDTAEALKALLRLQKKPPD--GKKVSRERLYDAVDWLLSMQNSNGGFAAFEPDNTYKYLEL--IPEVDGDIMI 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 559 EREYVECTSAVIQALVLFKQLYPDHRtKEIIKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTyKSC 638
Cdd:cd02889 161 DPPYVECTGSVLEALGLFGKLYPEHR-REIDPAIRRAVKYLEREQEPDGSWYGRWGVCFIYGTWFALEALAAAGED-ENS 238
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 639 LAVRKGVDFLLAIQEEDGGWGESHLSCPEQRYipLEGNRSNLVQTAWAMMGLIHAGQAerDPTPLHCAAKLIITSQLENG 718
Cdd:cd02889 239 PYVRKACDWLLSKQNPDGGWGESYESYEDPSY--AGGGRSTVVQTAWALLALMAAGEP--DSEAVKRGVKYLLNTQQEDG 314
|
330 340 350
....*....|....*....|....*....|....
gi 1901008554 719 DFPQQEILGVFMNTCMLHYATYRNIFPLWALAEY 752
Cdd:cd02889 315 DWPQEEITGVFFKNFYIRYHNYRNYFPLWALGRY 348
|
|
| hopene_cyclase |
TIGR01507 |
squalene-hopene cyclase; SHC is an essential prokaryotic gene in hopanoid (triterpenoid) ... |
92-758 |
6.68e-58 |
|
squalene-hopene cyclase; SHC is an essential prokaryotic gene in hopanoid (triterpenoid) biosynthesis. Squalene hopene cyclase, an integral membrane protein, directly cyclizes squalene into hopanoid products. [Fatty acid and phospholipid metabolism, Other]
Pssm-ID: 273661 [Multi-domain] Cd Length: 635 Bit Score: 208.59 E-value: 6.68e-58
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 92 YKNATDALRRAVSFYSALQSSDGHWPAEITGTLFFLPPLVFCFYITGH-----LEKIfdaehrKEMLRHiycHQNEDGGW 166
Cdd:TIGR01507 11 TARTVEAIDRAVDYLLSCQKDEGYWWGELESNVTIEAEYVLLCHILDRvdrdrMEKI------RNYLLH---EQREDGTW 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 167 GLHIEGKSVMFCTVLNYICLRMLGEGPNggrNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMPPEIWLL 246
Cdd:TIGR01507 82 ALYPGGPGDLSTTVEAYVALKYIGMSRD---EPPMQKALRFIQSQGGIESSRVFTRMWLALVGEYPWRGVPMVPPEIMLL 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 247 PSFFPIHLGKTLCYTRMVYMPMSYLYGKRFVGPLTplimllrKELHLQPYEEINWNKARRLCAKEDMiyphPLVQDLLWD 326
Cdd:TIGR01507 159 PKRFPFNIYEFSSWARATVVPLSIVCSRKPVFPLP-------ERARVPELYETDVPKPRRRGAKGGT----GWGIFDALD 227
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 327 TLHNFVEPILTnwplkKLVREKALRVAMEHIHYEDENS------------HYITIgcveKVLCMlacwieNPNGDHFKKH 394
Cdd:TIGR01507 228 RALHGYEKLSV-----HPFRRAAEIRALDWLLERQAGDgswggiqpamfnALIAL----KILGM------TQHDAFIKLG 292
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 395 LARIPDFMWVAEDGLKMQSFGSQLWDTVFAIQALLACDLSDEtDDVLRKGHSFIKKSQVreNPSGDFKSMYRHISKGAWT 474
Cdd:TIGR01507 293 WEGIDLYGVELDDSWMFQACVSPVWDTALAVLALREAGLPAD-HDALVKAGEWLLDKQI--TVPGDWAVKRPNLEPGGWA 369
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 475 LS-DRDHGWQVSDctaealKCCMLLSLMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEWLELLNPTDFf 553
Cdd:TIGR01507 370 FQfDNVYYPDVDD------TAVVVWALNGLRLPDERRRRDAMTKAFRWIAGMQSSNGGWGAFDVDNTSDLLNHIPFCDF- 442
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 554 tcvmaeREYVECTSAVIQALVLfkQLYPDHRTKEIIKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGK 633
Cdd:TIGR01507 443 ------GAVTDPPTADVTARVL--ECLGSFGYDDAWPVIERAVEYLKREQEPDGSWFGRWGVNYLYGTGAVLSALKAVGI 514
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 634 TYKSClAVRKGVDFLLAIQEEDGGWGESHLSCPEQRYIPlEGNrSNLVQTAWAMMGLIHAGQAErdPTPLHCAAKLIITS 713
Cdd:TIGR01507 515 DTREP-YIQKALAWLESHQNPDGGWGEDCRSYEDPAYAG-KGA-STASQTAWALIALIAAGRAE--SEAARRGVQYLVET 589
|
650 660 670 680
....*....|....*....|....*....|....*....|....*.
gi 1901008554 714 QLENGDFPQQEILGV-FMNTCMLHYATYRNIFPLWALAEYRKAAFA 758
Cdd:TIGR01507 590 QRPDGGWDEPYYTGTgFPGDFYLGYHMYRHVFPLLALARYKQAIER 635
|
|
| ISOPREN_C2_like |
cd00688 |
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ... |
393-752 |
5.73e-52 |
|
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.
Pssm-ID: 238362 [Multi-domain] Cd Length: 300 Bit Score: 182.75 E-value: 5.73e-52
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 393 KHLARIPDfmwvAEDGLKMQSFGSQLWDTVFAIQALLACDLS----DETDDVLRKGHSFIKKSQvrenpsgdfksmyrhI 468
Cdd:cd00688 6 KYLLRYPY----GDGHWYQSLCGEQTWSTAWPLLALLLLLAAtgirDKADENIEKGIQRLLSYQ---------------L 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 469 SKGAWTLSDRdHGWQVSDCTAEALKCCMLLSLMPAevvgqkIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEwlelln 548
Cdd:cd00688 67 SDGGFSGWGG-NDYPSLWLTAYALKALLLAGDYIA------VDRIDLARALNWLLSLQNEDGGFREDGPGNHRI------ 133
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 549 ptdfftcvMAEREYVECTSAVIQALVLFKQLYPDhrtkeiiKSIEKGVQFIESKQTPDGSWhGNWGICFIYATWFALSGL 628
Cdd:cd00688 134 --------GGDESDVRLTAYALIALALLGKLDPD-------PLIEKALDYLLSCQNYDGGF-GPGGESHGYGTACAAAAL 197
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 629 AAAGKTykSCLAVRKGVDFLLAIQEEDGGWGESHLscpeqryipLEGNRSNLVQTAWAMMGLIHAGQAeRDPTPLHCAAK 708
Cdd:cd00688 198 ALLGDL--DSPDAKKALRWLLSRQRPDGGWGEGRD---------RTNKLSDSCYTEWAAYALLALGKL-GDLEDAEKLVK 265
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1901008554 709 LIITSQLENGDFPQQEILgvfmntcmlHYATYRNIFPLWALAEY 752
Cdd:cd00688 266 WLLSQQNEDGGFSSKPGK---------SYDTQHTVFALLALSLY 300
|
|
| SQHop_cyclase_N |
pfam13249 |
Squalene-hopene cyclase N-terminal domain; Squalene-hopene cyclase, EC:5.4.99.17, catalyzes ... |
99-357 |
7.07e-42 |
|
Squalene-hopene cyclase N-terminal domain; Squalene-hopene cyclase, EC:5.4.99.17, catalyzes the cyclization of squalene into hopene in bacteria. This reaction is part of a cationic cyclization cascade, which is homologous to a key step in cholesterol biosynthesis. This family is the N-terminal domain.
Pssm-ID: 433061 [Multi-domain] Cd Length: 290 Bit Score: 154.56 E-value: 7.07e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 99 LRRAVSFYSALQSSDGHWPAE----ITGT------LFFLPPLvfcfyitghlekifDAEHRKEMLRHIYCHQNEDGGWGL 168
Cdd:pfam13249 1 IARAQDALLSLQHPDGHWVGEleanVTITaeyillRHFLGPD--------------DPELEAKIARYLRSQQREDGGWPL 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 169 HIEGKSVMFCTVLNYICLRMLGEGPNggrNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSGTNPMPPEIWLLPS 248
Cdd:pfam13249 67 FHGGPGDLSTTVEAYFALKLLGDSPD---APHMVRAREFILARGGAAKANVFTRIWLALFGQYPWRGVPSMPPEIMLLPR 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 249 FFPIHLGKTLCYTRMVYMPMSYLYGKRFVGPLTPLIMLlrKELHLQPYEEInwnkaRRLCAKEDMIYPHPLVQDLlwDTL 328
Cdd:pfam13249 144 WFPFNIYKFSSWARTTIVPLLILSALKPVAPLPPGIGL--DELFVEPPENV-----RYYPRPHRLFSWTNLFLGL--DRV 214
|
250 260
....*....|....*....|....*....
gi 1901008554 329 HNFVEPiltnWPLKKLvREKALRVAMEHI 357
Cdd:pfam13249 215 LKLYER----LPPKPL-RRRALRKAEEWI 238
|
|
| SQHop_cyclase_C |
pfam13243 |
Squalene-hopene cyclase C-terminal domain; Squalene-hopene cyclase, EC:5.4.99.17, catalyzes ... |
416-754 |
1.03e-40 |
|
Squalene-hopene cyclase C-terminal domain; Squalene-hopene cyclase, EC:5.4.99.17, catalyzes the cyclization of squalene into hopene in bacteria. This reaction is part of a cationic cyclization cascade, which is homologous to a key step in cholesterol biosynthesis. This family is the C-terminal half of the molecule.
Pssm-ID: 433057 [Multi-domain] Cd Length: 319 Bit Score: 152.10 E-value: 1.03e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 416 SQLWDTVFAIQALLACDLSDEtDDVLRKGHSFIKKSQVREnpSGDFKSMYRHISKGAWTLS-DRDHGWQVSDCTAealkc 494
Cdd:pfam13243 2 SPVWDTALALHALLEAGVPAD-HPALVKAAQWLLDRQVLV--KGDWAVKRPDLEPGGWAFQfANDHYPDVDDTAV----- 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 495 cMLLSLMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEPVRAQEWLELLNPTDFFTcvMAEREYVECTSAVIQALV 574
Cdd:pfam13243 74 -VVLALDRVRLPDERRRDDAIARGIEWILGMQSKNGGWGAFDKDNTKYYLNKIPFADHGA--LLDPPTADVTARVLEMLG 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 575 LFKqlYPDHRtkeiiKSIEKGVQFIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSClAVRKGVDFLLAIQEE 654
Cdd:pfam13243 151 QLG--YPDDH-----PVAARALEYLKKEQEPDGSWFGRWGVNYIYGTWSVLCGLAAVGEDHNRP-YIRKAVDWLKSRQNP 222
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 655 DGGWGESHLSCPEQRYipLEGNRSNLVQTAWAMMGLIHAGQAERDptplhcAAK----LIITSQLENGDFPQQEILGV-F 729
Cdd:pfam13243 223 DGGWGEDCESYKDPKL--AGRGPSTASQTAWALLALMAAGEVDSP------AVRrgiqYLLETQKPDGTWDEPYFTGTgF 294
|
330 340
....*....|....*....|....*
gi 1901008554 730 MNTCMLHYATYRNIFPLWALAEYRK 754
Cdd:pfam13243 295 PRVFYLKYHGYRNYFPLWALARYRN 319
|
|
| squa_tetra_cyc |
TIGR04277 |
squalene--tetrahymanol cyclase; This enzyme, also called squalene--tetrahymanol cyclase, ... |
160-754 |
4.81e-13 |
|
squalene--tetrahymanol cyclase; This enzyme, also called squalene--tetrahymanol cyclase, occurs a small number of eukaryotes, some of them anaerobic. The pathway can occur under anaerobic conditions, and the product is thought to replace sterols, letting organisms with this compound build membrane suitable for performing phagocytosis.
Pssm-ID: 212000 [Multi-domain] Cd Length: 624 Bit Score: 72.74 E-value: 4.81e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 160 QNEDGGWgLHIEGKSV----MFCTVLNYICLRMLGEGPNggRNNACKRARQWILDHGGVTYIPSWGKIWLSILGIYDWSg 235
Cdd:TIGR04277 81 QFEDGSW-EQVEDANIetgqLDATIFNYWYLKAIGIDIH--IDAALKKAQEWIKANGGIEAAQTMTKFKLAAFGQYPWE- 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 236 tnpmppEIWLLPSFfpihlgktLCYTRMVYmpmSYLYGK----RFVGPLTPLIMLLRkelhlqpYEEINWNKArrlcake 311
Cdd:TIGR04277 157 ------DLFKIPLF--------IFKKKGIF---KPLYIKditaQWVYPHLTALAYLQ-------NQRIIFNVA------- 205
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 312 dmiyphplVQDL--LWDtlhNFVEPILTNWPLKK--LVREKALRVAMEHIHYEDENSHYITIGCVEKVLCMLAcwIENPN 387
Cdd:TIGR04277 206 --------VADIreLWI---NKAKKGIKHQKKERpsFFIDNDLLILMDEIFKLKQPLGSFGAYTISTLLSLLA--FKDFQ 272
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 388 GDHFKKHLARIPDFMwvaEDGLKMQSF----------GS----QLWDTVFAIQALLAcdlSDETDDVLRKghsfIKKSQV 453
Cdd:TIGR04277 273 GKHPHKHKNEIQDAL---EDGLDFVEFnyfnfreayhGSlddgRWWDTILISWAMLE---SGEDKEKIFP----IVENML 342
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 454 RE--NPSGDFKSMYrhiskgawtlsdrDHGWqvsdcTAEALKCCMLLSLMP--AEVVGQKIDpeqlyDSVNLLLSLQGEK 529
Cdd:TIGR04277 343 KEglQPKGGIEYGY-------------DFEY-----APDADDTGLLLQVLSyyGEAFADAID-----EGAEFLFSMQNDD 399
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 530 GGLTAWEPVRAQEwlellNPTDFFTCVMAereyvectsAVIQALVLFKQLYPD---HRTKEIIKS--------IEKGVQF 598
Cdd:TIGR04277 400 GGFPAFDKGKMED-----NLLFKFAFKIA---------GIADSAEIFDPSCPDitaHILEGLGEFgdqanhdqIQKMIKY 465
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 599 IESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLaVRKGVDFLLAIQEEDGGWGESHLSCPEqryiPLEGN-- 676
Cdd:TIGR04277 466 FMDTQEKFGSWEARWGINYIMAAGAVLPALAKMNYDLNEGW-AKNAINWLLNKQNADGGFGECTLSYND----PEKWNgi 540
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 677 -RSNLVQTAWAMMGLIHA-GQAERDPTPLHCAAK-LIITSQLENGDFPQQEILGVFMNTCM-LHYATYRNIFPLWALAEY 752
Cdd:TIGR04277 541 gKSTVTQTSWGLLALLAVeDHNDQIKEAADKAAQyLLDQFKRDDGEFKDHSTIGTGHRGLLyLQYPSYAQSFPLIALGRF 620
|
..
gi 1901008554 753 RK 754
Cdd:TIGR04277 621 LD 622
|
|
| Prenyltrans |
pfam00432 |
Prenyltransferase and squalene oxidase repeat; |
592-632 |
1.37e-08 |
|
Prenyltransferase and squalene oxidase repeat;
Pssm-ID: 395346 [Multi-domain] Cd Length: 44 Bit Score: 50.97 E-value: 1.37e-08
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1901008554 592 IEKGVQFIESKQTPDGSWHGNWGIC-FIYATWFALSGLAAAG 632
Cdd:pfam00432 3 KEKLVDYLLSCQNEDGGFGGRPGGEsDTYYTYCALAALALLG 44
|
|
| CAL1 |
COG5029 |
Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones, ... |
598-720 |
1.63e-05 |
|
Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones, Lipid transport and metabolism];
Pssm-ID: 444045 [Multi-domain] Cd Length: 259 Bit Score: 47.01 E-value: 1.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 598 FIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKsclaVR-KGVDFLLAIQEEDGGWGeshlSCPEQRYIplegn 676
Cdd:COG5029 27 YLRASQNPDGGFAGRSGPSDLYSTYYAVRTLALLGESPK----WRdRVADLLSSLRVEDGGFA----KAPEGGAG----- 93
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1901008554 677 rsNLVQTAWAMMGLIHAGQAERDPTPLhcaAKLIITSQLENGDF 720
Cdd:COG5029 94 --STYHTYLATLLAELLGRPPPDPDRL---VRFLISQQNDDGGF 132
|
|
| A2M_like |
cd02891 |
Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier ... |
569-701 |
5.50e-05 |
|
Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor: pregnancy zone protein (PZP). PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement. The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.
Pssm-ID: 239221 [Multi-domain] Cd Length: 282 Bit Score: 45.84 E-value: 5.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 569 VIQALVLFKQLYPDHRTKeIIKSIEKGVQFIESKQTPDGS---WHGNWGicfiYATW---FALSGLAAAGK-TYKSCLAV 641
Cdd:cd02891 29 VLKYLDATGQLTPEIREK-ALEYIRKGYQRLLTYQRSDGSfsaWGNSDS----GSTWltaYVVKFLSQARKyIDVDENVL 103
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 642 RKGVDFLLAIQEEDGGWGESHlscPEQRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDPT 701
Cdd:cd02891 104 ARALGWLVPQQKEDGSFRELG---PVIHREMKGGVDDSVSLTAYVLIALAEAGKACDASI 160
|
|
| Prenyltrans |
pfam00432 |
Prenyltransferase and squalene oxidase repeat; |
147-190 |
1.27e-04 |
|
Prenyltransferase and squalene oxidase repeat;
Pssm-ID: 395346 [Multi-domain] Cd Length: 44 Bit Score: 39.80 E-value: 1.27e-04
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 1901008554 147 EHRKEMLRHIYCHQNEDGGWGLHIEGKSVMFCTVLNYICLRMLG 190
Cdd:pfam00432 1 IDKEKLVDYLLSCQNEDGGFGGRPGGESDTYYTYCALAALALLG 44
|
|
| SQCY |
cd02889 |
Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - ... |
99-209 |
1.34e-03 |
|
Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. Bacterial SQCY catalyzes the convertion of squalene to hopene or diplopterol. Eukaryotic OSQCY transforms the 2,3-epoxide of squalene to compounds such as, lanosterol (a metabolic precursor of cholesterol and steroid hormones) in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain. This group also contains SQCY-like archael sequences and some bacterial SQCY's which lack this minor domain.
Pssm-ID: 239219 [Multi-domain] Cd Length: 348 Bit Score: 41.82 E-value: 1.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 99 LRRAVSFYSALQSSDGHWPAEitgtlfFLPPLVFCFYITGHLEKIFDAEHRKEMLR-----------------HIYCHQN 161
Cdd:cd02889 1 IRRALDFLLSLQAPDGHWPGE------YSQVWDTALALQALLEAGLAPEFDPALKKalewllksqirdnpddwKVKYRHL 74
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1901008554 162 EDGGWGLHIEGKS--VMFCTVLNYICLRMLGEGPNGGRNN---ACKRARQWIL 209
Cdd:cd02889 75 RKGGWAFSTANQGypDSDDTAEALKALLRLQKKPPDGKKVsreRLYDAVDWLL 127
|
|
| Prenyltrans |
pfam00432 |
Prenyltransferase and squalene oxidase repeat; |
641-689 |
2.31e-03 |
|
Prenyltransferase and squalene oxidase repeat;
Pssm-ID: 395346 [Multi-domain] Cd Length: 44 Bit Score: 36.34 E-value: 2.31e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 1901008554 641 VRKGVDFLLAIQEEDGGWGESHLSCPEQRYiplegnrSNLVQTAWAMMG 689
Cdd:pfam00432 3 KEKLVDYLLSCQNEDGGFGGRPGGESDTYY-------TYCALAALALLG 44
|
|
| ISOPREN_C2_like |
cd00688 |
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ... |
99-211 |
3.19e-03 |
|
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.
Pssm-ID: 238362 [Multi-domain] Cd Length: 300 Bit Score: 40.23 E-value: 3.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 99 LRRAVSFYSALQSSDGHWPAEITG---TLFFLPPLVFCFYITGHLEKIFDAEHRKEMLRHIYCHQNEDGGWGLHIEGK-S 174
Cdd:cd00688 1 IEKHLKYLLRYPYGDGHWYQSLCGeqtWSTAWPLLALLLLLAATGIRDKADENIEKGIQRLLSYQLSDGGFSGWGGNDyP 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 1901008554 175 VMFCTVLNYICLRMLGEGPNGGRNNACkRARQWILDH 211
Cdd:cd00688 81 SLWLTAYALKALLLAGDYIAVDRIDLA-RALNWLLSL 116
|
|
| SQCY |
cd02889 |
Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - ... |
641-719 |
4.46e-03 |
|
Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. Bacterial SQCY catalyzes the convertion of squalene to hopene or diplopterol. Eukaryotic OSQCY transforms the 2,3-epoxide of squalene to compounds such as, lanosterol (a metabolic precursor of cholesterol and steroid hormones) in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain. This group also contains SQCY-like archael sequences and some bacterial SQCY's which lack this minor domain.
Pssm-ID: 239219 [Multi-domain] Cd Length: 348 Bit Score: 39.90 E-value: 4.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 641 VRKGVDFLLAIQEEDGGWGeshlscpeqryipleGNRSNLVQTAWAMMGLIHAGQAERDPTPLHCAAKLIITSQ-LENGD 719
Cdd:cd02889 1 IRRALDFLLSLQAPDGHWP---------------GEYSQVWDTALALQALLEAGLAPEFDPALKKALEWLLKSQiRDNPD 65
|
|
| GGTase-II |
cd02894 |
Geranylgeranyltransferase type II (GGTase-II)_like proteins containing the protein ... |
570-662 |
4.93e-03 |
|
Geranylgeranyltransferase type II (GGTase-II)_like proteins containing the protein prenyltransferase (PTase) domain, beta subunit (alpha 6 - alpha 6 barrel fold). GGTase-IIs are a subgroup of the protein prenyltransferase family of lipid-modifying enzymes. PTases catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Prenyltransferases employ a Zn2+ ion to alkylate a thiol group catalyzing the formation of thioether linkages between cysteine residues at or near the C-terminus of protein acceptors and the C1 atom of isoprenoid lipids (geranylgeranyl (20-carbon) in the case of GGTase-II ). GGTase-II catalyzes alkylation of both cysteine residues in Rab proteins containing carboxy-terminal "CC", "CXCX" or "CXC" motifs. PTases are heterodimeric with both alpha and beta subunits required for catalytic activity. In contrast to other prenyltransferases, GGTas-II requires an escort protein to bring the substrate protein to the catalytic heterodimer and to escort the geryanylgeranylated product to the membrane.
Pssm-ID: 239224 [Multi-domain] Cd Length: 287 Bit Score: 39.56 E-value: 4.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 570 IQALVLFKQLYpdhrtkEIIKSIEKGVQFIESKQTPDGSWHGN-WG---ICFIYAtwfALSGLAAAGKTYKSClaVRKGV 645
Cdd:cd02894 86 IQILALYDLLN------KIDENKEKIAKFIKGLQNEDGSFSGDkWGevdTRFSYC---AVLCLTLLGKLDLID--VDKAV 154
|
90 100
....*....|....*....|..
gi 1901008554 646 DFLLAIQEEDGGWG-----ESH 662
Cdd:cd02894 155 DYLLSCYNFDGGFGcrpgaESH 176
|
|
| ISOPREN_C2_like |
cd00688 |
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ... |
91-211 |
6.17e-03 |
|
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.
Pssm-ID: 238362 [Multi-domain] Cd Length: 300 Bit Score: 39.46 E-value: 6.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1901008554 91 TYKNATDALRRAVSFYSALQSSDGHWPAEITG-------TLFFLppLVFCfyITGHLEKIfDAEHRKEMLRHIYCHQNED 163
Cdd:cd00688 46 IRDKADENIEKGIQRLLSYQLSDGGFSGWGGNdypslwlTAYAL--KALL--LAGDYIAV-DRIDLARALNWLLSLQNED 120
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1901008554 164 GGWGLHIEGKSVMFC----TVLNYICLRMLGEGPNGGRNNACKRARQWILDH 211
Cdd:cd00688 121 GGFREDGPGNHRIGGdesdVRLTAYALIALALLGKLDPDPLIEKALDYLLSC 172
|
|
|