|
Name |
Accession |
Description |
Interval |
E-value |
| Sterol-sensing |
pfam12349 |
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins ... |
308-452 |
1.47e-57 |
|
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins (SREBPs) are membrane-bound transcription factors that promote lipid synthesis in animal cells. They are embedded in the membranes of the endoplasmic reticulum (ER) in a helical hairpin orientation and are released from the ER by a two-step proteolytic process. Proteolysis begins when the SREBPs are cleaved at Site-1, which is located at a leucine residue in the middle of the hydrophobic loop in the lumen of the ER. Upon proteolytic processing SREBP can activate the expression of genes involved in cholesterol biosynthesis and uptake. SCAP stimulates cleavage of SREBPs via fusion of the their two C-termini. This domain is the transmembrane region that traverses the membrane eight times and is the sterol-sensing domain of the cleavage protein. WD40 domains are found towards the C-terminus. :
Pssm-ID: 463544 [Multi-domain] Cd Length: 153 Bit Score: 195.49 E-value: 1.47e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 308 MVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSS 387
Cdd:pfam12349 1 MVKSKFGLGLAGVIIVLASVASSLGLCAYFGLPLTLIISEVIPFLVLAIGVDNIFLLVKAVVRTPRSLDVSERIAEALGE 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568963771 388 ESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMELADL 452
Cdd:pfam12349 81 VGPSITLTSLTEILAFLLGALTDMPAVQEFCLFAAVAVLFDFLLQMTFFVAVLSLDIRRLESNRL 145
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1065-1232 |
9.88e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 108.58 E-value: 9.88e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1065 CHRTHTvpcAHQKPITALRAAA--GRLVTGSQDHTLRVFRLDDSCCLFTLKGHSGAITAV-YIDQTMVLASGGQDGAICL 1141
Cdd:cd00200 1 LRRTLK---GHTGGVTCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVaASADGTYLASGSSDKTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1142 WDVLTGSRVSQTFAHRGDVTSLTCTASC--VISSGLDDFISIWDRSTGIKLYSIQQDLGCGASLGVISDNLLVTGGQ--G 1217
Cdd:cd00200 78 WDLETGECVRTLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSqdG 157
|
170
....*....|....*
gi 568963771 1218 CVSFWDLNYGDLLQT 1232
Cdd:cd00200 158 TIKLWDLRTGKCVAT 172
|
|
| 2A060601 super family |
cl36767 |
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in ... |
11-466 |
5.09e-22 |
|
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in eukaryotes. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterized by the lysosomal accumulation of unestrified cholesterol. It is an integral membrane protein, which indicates that this protein is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis. [Transport and binding proteins, Other] The actual alignment was detected with superfamily member TIGR00917:
Pssm-ID: 273337 [Multi-domain] Cd Length: 1205 Bit Score: 103.45 E-value: 5.09e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 11 ISQAFYNHGLLCASYPIPIILFTGLCILACCYPLLKLPLPgtgpvefSTPVKGYSPPpadsDHKQGEPSEQPEWYVGaPV 90
Cdd:TIGR00917 309 LARFFGKYGIWVARHPTLVICLSVSVVLLLCVGLIRFKVE-------TRPVKLWVAP----GSRAALEKQYFDTHFG-PF 376
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 91 AYIQQIFVKSSVSPWHRNllavdvFRSPLSraFQLVEEIRNHVLRDSSGTKSLEEVCLQVTDLLpglrkLRSLLPeHGCL 170
Cdd:TIGR00917 377 YRIEQLIIATVQTSSHEK------APEILT--DDNLKLLFDIQKKVSQLFANYEGELITLDSPC-----FKPNHP-YNCF 442
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 171 LLSPGNFWQNDWERF---HADPDIIGTIHQH-EPKTLQTSATLKDllFGVP-------GKYSGVSLYTRKRMVsytITLV 239
Cdd:TIGR00917 443 IYSTCKKLQNMYSKLkpeNYDDYGGVDYVKYcFEHFTSPESCLSA--FGGPvdpttvlGGFSGNNFSEASAFV---VTFP 517
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 240 FQRYHAK-------------FLSSLRARLmLLHPSPNCSLRAENLVHVHFKEEiGIAELIPLVTTYIILFAYIY----FS 302
Cdd:TIGR00917 518 VNNFVNKtnktekavawekaFIQLAKDEL-LPMVQATISFSAERSIEDELKRE-STADVITIAISYLVMFAYISltlgDS 595
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 303 TR-KIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSV-----------VS 370
Cdd:TIGR00917 596 PRlKSLYVTSKVLLGLSGILIVMLSVLGSVGVFSAVGLKSTLIIMEVIPFLVLAVGVDNIFILVFFYfyleyfyrqvgVD 675
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 371 TPVDLEVKLRIAQGLSSESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMEla 450
Cdd:TIGR00917 676 NEQELTLERRLSRALMEVGPSITLASLSEILAFALGALIKMPAVRVFSMFAVLAVFLDFLLQITAFVALLVLDFKRTE-- 753
|
490
....*....|....*.
gi 568963771 451 dlNKRLppeSCLPSAK 466
Cdd:TIGR00917 754 --DKRV---DCFPCIK 764
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
952-1144 |
6.78e-16 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 81.50 E-value: 6.78e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 952 KGSPPLAWTPSTAGSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLD 1027
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFspDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSpdGKLLASGSDDGTVR 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1028 FFSLETHTSLSPLQfrgtpgrgsspsssvysssntvtchrthtvpcAHQKPITALRAAA--GRLVTGSQDHTLRVFRLDD 1105
Cdd:COG2319 314 LWDLATGKLLRTLT--------------------------------GHTGAVRSVAFSPdgKTLASGSDDGTVRLWDLAT 361
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 568963771 1106 SCCLFTLKGHSGAITAVYI--DQTMvLASGGQDGAICLWDV 1144
Cdd:COG2319 362 GELLRTLTGHTGAVTSVAFspDGRT-LASGSADGTVRLWDL 401
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
771-802 |
2.01e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain. :
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 2.01e-03
10 20 30
....*....|....*....|....*....|....
gi 568963771 771 VLRGHLMDIECLA--SDGMLLVSCCLAGQVCVWD 802
Cdd:smart00320 7 TLKGHTGPVTSVAfsPDGKYLASGSDDGTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Sterol-sensing |
pfam12349 |
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins ... |
308-452 |
1.47e-57 |
|
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins (SREBPs) are membrane-bound transcription factors that promote lipid synthesis in animal cells. They are embedded in the membranes of the endoplasmic reticulum (ER) in a helical hairpin orientation and are released from the ER by a two-step proteolytic process. Proteolysis begins when the SREBPs are cleaved at Site-1, which is located at a leucine residue in the middle of the hydrophobic loop in the lumen of the ER. Upon proteolytic processing SREBP can activate the expression of genes involved in cholesterol biosynthesis and uptake. SCAP stimulates cleavage of SREBPs via fusion of the their two C-termini. This domain is the transmembrane region that traverses the membrane eight times and is the sterol-sensing domain of the cleavage protein. WD40 domains are found towards the C-terminus.
Pssm-ID: 463544 [Multi-domain] Cd Length: 153 Bit Score: 195.49 E-value: 1.47e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 308 MVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSS 387
Cdd:pfam12349 1 MVKSKFGLGLAGVIIVLASVASSLGLCAYFGLPLTLIISEVIPFLVLAIGVDNIFLLVKAVVRTPRSLDVSERIAEALGE 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568963771 388 ESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMELADL 452
Cdd:pfam12349 81 VGPSITLTSLTEILAFLLGALTDMPAVQEFCLFAAVAVLFDFLLQMTFFVAVLSLDIRRLESNRL 145
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1065-1232 |
9.88e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 108.58 E-value: 9.88e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1065 CHRTHTvpcAHQKPITALRAAA--GRLVTGSQDHTLRVFRLDDSCCLFTLKGHSGAITAV-YIDQTMVLASGGQDGAICL 1141
Cdd:cd00200 1 LRRTLK---GHTGGVTCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVaASADGTYLASGSSDKTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1142 WDVLTGSRVSQTFAHRGDVTSLTCTASC--VISSGLDDFISIWDRSTGIKLYSIQQDLGCGASLGVISDNLLVTGGQ--G 1217
Cdd:cd00200 78 WDLETGECVRTLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSqdG 157
|
170
....*....|....*
gi 568963771 1218 CVSFWDLNYGDLLQT 1232
Cdd:cd00200 158 TIKLWDLRTGKCVAT 172
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
952-1232 |
1.68e-22 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 101.53 E-value: 1.68e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 952 KGSPPLAWTPSTAGSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLD 1027
Cdd:COG2319 108 ATGLLLRTLTGHTGAVRSVAFspDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSpdGKLLASGSDDGTVR 187
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1028 FFSLETHTSLSPLQfrgtpgrgsspsssvysssntvtchrthtvpcAHQKPITALR-AAAGR-LVTGSQDHTLRVFRLDD 1105
Cdd:COG2319 188 LWDLATGKLLRTLT--------------------------------GHTGAVRSVAfSPDGKlLASGSADGTVRLWDLAT 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1106 SCCLFTLKGHSGAITAVYI--DQTMvLASGGQDGAICLWDVLTGSRVSQTFAHRGDVTSLTCTA--SCVISSGLDDFISI 1181
Cdd:COG2319 236 GKLLRTLTGHSGSVRSVAFspDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRL 314
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 568963771 1182 WDRSTGIKLYSIQQDLGCGASLGVISD-NLLVTGGQ-GCVSFWDLNYGDLLQT 1232
Cdd:COG2319 315 WDLATGKLLRTLTGHTGAVRSVAFSPDgKTLASGSDdGTVRLWDLATGELLRT 367
|
|
| 2A060601 |
TIGR00917 |
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in ... |
11-466 |
5.09e-22 |
|
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in eukaryotes. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterized by the lysosomal accumulation of unestrified cholesterol. It is an integral membrane protein, which indicates that this protein is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis. [Transport and binding proteins, Other]
Pssm-ID: 273337 [Multi-domain] Cd Length: 1205 Bit Score: 103.45 E-value: 5.09e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 11 ISQAFYNHGLLCASYPIPIILFTGLCILACCYPLLKLPLPgtgpvefSTPVKGYSPPpadsDHKQGEPSEQPEWYVGaPV 90
Cdd:TIGR00917 309 LARFFGKYGIWVARHPTLVICLSVSVVLLLCVGLIRFKVE-------TRPVKLWVAP----GSRAALEKQYFDTHFG-PF 376
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 91 AYIQQIFVKSSVSPWHRNllavdvFRSPLSraFQLVEEIRNHVLRDSSGTKSLEEVCLQVTDLLpglrkLRSLLPeHGCL 170
Cdd:TIGR00917 377 YRIEQLIIATVQTSSHEK------APEILT--DDNLKLLFDIQKKVSQLFANYEGELITLDSPC-----FKPNHP-YNCF 442
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 171 LLSPGNFWQNDWERF---HADPDIIGTIHQH-EPKTLQTSATLKDllFGVP-------GKYSGVSLYTRKRMVsytITLV 239
Cdd:TIGR00917 443 IYSTCKKLQNMYSKLkpeNYDDYGGVDYVKYcFEHFTSPESCLSA--FGGPvdpttvlGGFSGNNFSEASAFV---VTFP 517
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 240 FQRYHAK-------------FLSSLRARLmLLHPSPNCSLRAENLVHVHFKEEiGIAELIPLVTTYIILFAYIY----FS 302
Cdd:TIGR00917 518 VNNFVNKtnktekavawekaFIQLAKDEL-LPMVQATISFSAERSIEDELKRE-STADVITIAISYLVMFAYISltlgDS 595
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 303 TR-KIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSV-----------VS 370
Cdd:TIGR00917 596 PRlKSLYVTSKVLLGLSGILIVMLSVLGSVGVFSAVGLKSTLIIMEVIPFLVLAVGVDNIFILVFFYfyleyfyrqvgVD 675
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 371 TPVDLEVKLRIAQGLSSESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMEla 450
Cdd:TIGR00917 676 NEQELTLERRLSRALMEVGPSITLASLSEILAFALGALIKMPAVRVFSMFAVLAVFLDFLLQITAFVALLVLDFKRTE-- 753
|
490
....*....|....*.
gi 568963771 451 dlNKRLppeSCLPSAK 466
Cdd:TIGR00917 754 --DKRV---DCFPCIK 764
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
952-1144 |
6.78e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 81.50 E-value: 6.78e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 952 KGSPPLAWTPSTAGSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLD 1027
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFspDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSpdGKLLASGSDDGTVR 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1028 FFSLETHTSLSPLQfrgtpgrgsspsssvysssntvtchrthtvpcAHQKPITALRAAA--GRLVTGSQDHTLRVFRLDD 1105
Cdd:COG2319 314 LWDLATGKLLRTLT--------------------------------GHTGAVRSVAFSPdgKTLASGSDDGTVRLWDLAT 361
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 568963771 1106 SCCLFTLKGHSGAITAVYI--DQTMvLASGGQDGAICLWDV 1144
Cdd:COG2319 362 GELLRTLTGHTGAVTSVAFspDGRT-LASGSADGTVRLWDL 401
|
|
| 2A060605 |
TIGR00920 |
3-hydroxy-3-methylglutaryl-coenzyme A reductase; [Transport and binding proteins, ... |
276-442 |
2.77e-14 |
|
3-hydroxy-3-methylglutaryl-coenzyme A reductase; [Transport and binding proteins, Carbohydrates, organic alcohols, and acids]
Pssm-ID: 273339 [Multi-domain] Cd Length: 886 Bit Score: 77.97 E-value: 2.77e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 276 FKEEIGIAELIPLVTTYIILFAYIYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPT-LNggEIFPYLVV 354
Cdd:TIGR00920 53 FEEEYLSSDVIVMTITRCIAVLYIYYQFCNLRQLGSKYILGIAGLFTIFSSFVFSTAVIHFLGSELTgLN--EALPFFLL 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 355 VIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSSESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQML 434
Cdd:TIGR00920 131 LIDLSKASALAKFALSSNSQDEVRDNIARGMAILGPTITLDTVVETLVIGVGTMSGVRRLEVLCCFGCMSVLANYFVFMT 210
|
....*...
gi 568963771 435 FFTTVLSI 442
Cdd:TIGR00920 211 FFPACLSL 218
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1105-1143 |
4.72e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 44.61 E-value: 4.72e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568963771 1105 DSCCLFTLKGHSGAITAVYIDQT-MVLASGGQDGAICLWD 1143
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDgKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1108-1143 |
2.56e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.33 E-value: 2.56e-05
10 20 30
....*....|....*....|....*....|....*..
gi 568963771 1108 CLFTLKGHSGAITAVYIDQT-MVLASGGQDGAICLWD 1143
Cdd:pfam00400 3 LLKTLEGHTGSVTSLAFSPDgKLLASGSDDGTVKVWD 39
|
|
| MMPL |
COG1033 |
Predicted exporter protein, RND superfamily [General function prediction only]; |
282-442 |
3.72e-04 |
|
Predicted exporter protein, RND superfamily [General function prediction only];
Pssm-ID: 440656 [Multi-domain] Cd Length: 767 Bit Score: 44.85 E-value: 3.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 282 IAELIPLVTTYIILFAYIYFstrkidmvKSKWGLALAaVVTVLSSLLMSVGLCTLFG--LTPTLNggeIFPYLVVVIGLE 359
Cdd:COG1033 220 LAIFFPLALLLILLLLFLFF--------RSLRGVLLP-LLVVLLAVIWTLGLMGLLGipLSPLTI---LVPPLLLAIGID 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 360 NVL-VLTKsvvstpvdleVKLRIAQGLSSESwsIMKNAATELGI-IL-------IGYFTL----VPAIQEFCLFAVVGLV 426
Cdd:COG1033 288 YGIhLLNR----------YREERRKGLDKRE--ALREALRKLGPpVLltslttaIGFLSLlfsdIPPIRDFGIVAAIGVL 355
|
170
....*....|....*.
gi 568963771 427 SDFFLQMLFFTTVLSI 442
Cdd:COG1033 356 LAFLTSLTLLPALLSL 371
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
771-802 |
2.01e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 2.01e-03
10 20 30
....*....|....*....|....*....|....
gi 568963771 771 VLRGHLMDIECLA--SDGMLLVSCCLAGQVCVWD 802
Cdd:smart00320 7 TLKGHTGPVTSVAfsPDGKYLASGSDDGTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Sterol-sensing |
pfam12349 |
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins ... |
308-452 |
1.47e-57 |
|
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins (SREBPs) are membrane-bound transcription factors that promote lipid synthesis in animal cells. They are embedded in the membranes of the endoplasmic reticulum (ER) in a helical hairpin orientation and are released from the ER by a two-step proteolytic process. Proteolysis begins when the SREBPs are cleaved at Site-1, which is located at a leucine residue in the middle of the hydrophobic loop in the lumen of the ER. Upon proteolytic processing SREBP can activate the expression of genes involved in cholesterol biosynthesis and uptake. SCAP stimulates cleavage of SREBPs via fusion of the their two C-termini. This domain is the transmembrane region that traverses the membrane eight times and is the sterol-sensing domain of the cleavage protein. WD40 domains are found towards the C-terminus.
Pssm-ID: 463544 [Multi-domain] Cd Length: 153 Bit Score: 195.49 E-value: 1.47e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 308 MVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSS 387
Cdd:pfam12349 1 MVKSKFGLGLAGVIIVLASVASSLGLCAYFGLPLTLIISEVIPFLVLAIGVDNIFLLVKAVVRTPRSLDVSERIAEALGE 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568963771 388 ESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMELADL 452
Cdd:pfam12349 81 VGPSITLTSLTEILAFLLGALTDMPAVQEFCLFAAVAVLFDFLLQMTFFVAVLSLDIRRLESNRL 145
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1065-1232 |
9.88e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 108.58 E-value: 9.88e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1065 CHRTHTvpcAHQKPITALRAAA--GRLVTGSQDHTLRVFRLDDSCCLFTLKGHSGAITAV-YIDQTMVLASGGQDGAICL 1141
Cdd:cd00200 1 LRRTLK---GHTGGVTCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVaASADGTYLASGSSDKTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1142 WDVLTGSRVSQTFAHRGDVTSLTCTASC--VISSGLDDFISIWDRSTGIKLYSIQQDLGCGASLGVISDNLLVTGGQ--G 1217
Cdd:cd00200 78 WDLETGECVRTLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSqdG 157
|
170
....*....|....*
gi 568963771 1218 CVSFWDLNYGDLLQT 1232
Cdd:cd00200 158 TIKLWDLRTGKCVAT 172
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
965-1232 |
2.62e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 107.42 E-value: 2.62e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 965 GSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLDFFSLEthtslspl 1040
Cdd:cd00200 10 GGVTCVAFspDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLE-------- 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1041 qfrgtpgrgsspsssvysssnTVTCHRTHTvpcAHQKPITALRAAAGR--LVTGSQDHTLRVFRLDDSCCLFTLKGHSGA 1118
Cdd:cd00200 82 ---------------------TGECVRTLT---GHTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTDW 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1119 ITAVYIDQT-MVLASGGQDGAICLWDVLTGSRVsQTF-AHRGDVTSLTC--TASCVISSGLDDFISIWDRSTGIKLYSIQ 1194
Cdd:cd00200 138 VNSVAFSPDgTFVASSSQDGTIKLWDLRTGKCV-ATLtGHTGEVNSVAFspDGEKLLSSSSDGTIKLWDLSTGKCLGTLR 216
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 568963771 1195 QDLGCGASLGVISDNLLVTGG--QGCVSFWDLNYGDLLQT 1232
Cdd:cd00200 217 GHENGVNSVAFSPDGYLLASGseDGTIRVWDLRTGECVQT 256
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
952-1232 |
1.68e-22 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 101.53 E-value: 1.68e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 952 KGSPPLAWTPSTAGSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLD 1027
Cdd:COG2319 108 ATGLLLRTLTGHTGAVRSVAFspDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSpdGKLLASGSDDGTVR 187
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1028 FFSLETHTSLSPLQfrgtpgrgsspsssvysssntvtchrthtvpcAHQKPITALR-AAAGR-LVTGSQDHTLRVFRLDD 1105
Cdd:COG2319 188 LWDLATGKLLRTLT--------------------------------GHTGAVRSVAfSPDGKlLASGSADGTVRLWDLAT 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1106 SCCLFTLKGHSGAITAVYI--DQTMvLASGGQDGAICLWDVLTGSRVSQTFAHRGDVTSLTCTA--SCVISSGLDDFISI 1181
Cdd:COG2319 236 GKLLRTLTGHSGSVRSVAFspDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRL 314
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 568963771 1182 WDRSTGIKLYSIQQDLGCGASLGVISD-NLLVTGGQ-GCVSFWDLNYGDLLQT 1232
Cdd:COG2319 315 WDLATGKLLRTLTGHTGAVRSVAFSPDgKTLASGSDdGTVRLWDLATGELLRT 367
|
|
| 2A060601 |
TIGR00917 |
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in ... |
11-466 |
5.09e-22 |
|
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in eukaryotes. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterized by the lysosomal accumulation of unestrified cholesterol. It is an integral membrane protein, which indicates that this protein is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis. [Transport and binding proteins, Other]
Pssm-ID: 273337 [Multi-domain] Cd Length: 1205 Bit Score: 103.45 E-value: 5.09e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 11 ISQAFYNHGLLCASYPIPIILFTGLCILACCYPLLKLPLPgtgpvefSTPVKGYSPPpadsDHKQGEPSEQPEWYVGaPV 90
Cdd:TIGR00917 309 LARFFGKYGIWVARHPTLVICLSVSVVLLLCVGLIRFKVE-------TRPVKLWVAP----GSRAALEKQYFDTHFG-PF 376
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 91 AYIQQIFVKSSVSPWHRNllavdvFRSPLSraFQLVEEIRNHVLRDSSGTKSLEEVCLQVTDLLpglrkLRSLLPeHGCL 170
Cdd:TIGR00917 377 YRIEQLIIATVQTSSHEK------APEILT--DDNLKLLFDIQKKVSQLFANYEGELITLDSPC-----FKPNHP-YNCF 442
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 171 LLSPGNFWQNDWERF---HADPDIIGTIHQH-EPKTLQTSATLKDllFGVP-------GKYSGVSLYTRKRMVsytITLV 239
Cdd:TIGR00917 443 IYSTCKKLQNMYSKLkpeNYDDYGGVDYVKYcFEHFTSPESCLSA--FGGPvdpttvlGGFSGNNFSEASAFV---VTFP 517
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 240 FQRYHAK-------------FLSSLRARLmLLHPSPNCSLRAENLVHVHFKEEiGIAELIPLVTTYIILFAYIY----FS 302
Cdd:TIGR00917 518 VNNFVNKtnktekavawekaFIQLAKDEL-LPMVQATISFSAERSIEDELKRE-STADVITIAISYLVMFAYISltlgDS 595
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 303 TR-KIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSV-----------VS 370
Cdd:TIGR00917 596 PRlKSLYVTSKVLLGLSGILIVMLSVLGSVGVFSAVGLKSTLIIMEVIPFLVLAVGVDNIFILVFFYfyleyfyrqvgVD 675
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 371 TPVDLEVKLRIAQGLSSESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMEla 450
Cdd:TIGR00917 676 NEQELTLERRLSRALMEVGPSITLASLSEILAFALGALIKMPAVRVFSMFAVLAVFLDFLLQITAFVALLVLDFKRTE-- 753
|
490
....*....|....*.
gi 568963771 451 dlNKRLppeSCLPSAK 466
Cdd:TIGR00917 754 --DKRV---DCFPCIK 764
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
962-1232 |
1.97e-21 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 98.44 E-value: 1.97e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 962 STAGSIWSLELQGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVF--LDRRIVAARLNGSLDFFSLETHTSLSP 1039
Cdd:COG2319 36 AAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFspDGRLLASASADGTVRLWDLATGLLLRT 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1040 LQfrgtpgrgsspsssvysssntvtchrthtvpcAHQKPITALRAAA--GRLVTGSQDHTLRVFRLDDSCCLFTLKGHSG 1117
Cdd:COG2319 116 LT--------------------------------GHTGAVRSVAFSPdgKTLASGSADGTVRLWDLATGKLLRTLTGHSG 163
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1118 AITAVYI--DQTMvLASGGQDGAICLWDVLTGSRVSQTFAHRGDVTSLTCTA--SCVISSGLDDFISIWDRSTGIKLYSI 1193
Cdd:COG2319 164 AVTSVAFspDGKL-LASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPdgKLLASGSADGTVRLWDLATGKLLRTL 242
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 568963771 1194 QQDLGCGASLGVISDN-LLVTGGQ-GCVSFWDLNYGDLLQT 1232
Cdd:COG2319 243 TGHSGSVRSVAFSPDGrLLASGSAdGTVRLWDLATGELLRT 283
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
964-1225 |
1.92e-20 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 95.36 E-value: 1.92e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 964 AGSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLDFFSLETHTSLSP 1039
Cdd:COG2319 162 SGAVTSVAFspDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLLRT 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1040 LQfrgtpgrgsspsssvysssntvtchrthtvpcAHQKPITALRAAAG--RLVTGSQDHTLRVFRLDDSCCLFTLKGHSG 1117
Cdd:COG2319 242 LT--------------------------------GHSGSVRSVAFSPDgrLLASGSADGTVRLWDLATGELLRTLTGHSG 289
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1118 AITAVYI--DQTMvLASGGQDGAICLWDVLTGSRVSQTFAHRGDVTSLTCTA--SCVISSGLDDFISIWDRSTGIKLYSI 1193
Cdd:COG2319 290 GVNSVAFspDGKL-LASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPdgKTLASGSDDGTVRLWDLATGELLRTL 368
|
250 260 270
....*....|....*....|....*....|....
gi 568963771 1194 QQDLGCGASLGVISD-NLLVTGGQ-GCVSFWDLN 1225
Cdd:COG2319 369 TGHTGAVTSVAFSPDgRTLASGSAdGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
965-1186 |
8.99e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 90.36 E-value: 8.99e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 965 GSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLDFFSLETHTSLSPL 1040
Cdd:COG2319 205 GAVRSVAFspDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSpdGRLLASGSADGTVRLWDLATGELLRTL 284
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1041 QfrgtpgrgsspsssvysssntvtchrthtvpcAHQKPITALRAAA-GR-LVTGSQDHTLRVFRLDDSCCLFTLKGHSGA 1118
Cdd:COG2319 285 T--------------------------------GHSGGVNSVAFSPdGKlLASGSDDGTVRLWDLATGKLLRTLTGHTGA 332
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568963771 1119 ITAVYI---DQTmvLASGGQDGAICLWDVLTGSRVSQTFAHRGDVTSLTCTA--SCVISSGLDDFISIWDRST 1186
Cdd:COG2319 333 VRSVAFspdGKT--LASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPdgRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
965-1183 |
2.36e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 83.92 E-value: 2.36e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 965 GSIWSLELQ--GNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLDFFSLETHTslspl 1040
Cdd:cd00200 94 SYVSSVAFSpdGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSpdGTFVASSSQDGTIKLWDLRTGK----- 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1041 qfrgtpgrgsspsssvysssntvtCHRTHTvpcAHQKPITALRAAA--GRLVTGSQDHTLRVFRLDDSCCLFTLKGHSGA 1118
Cdd:cd00200 169 ------------------------CVATLT---GHTGEVNSVAFSPdgEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENG 221
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568963771 1119 ITAV-YIDQTMVLASGGQDGAICLWDVLTGSRVSQTFAHRGDVTSLTC--TASCVISSGLDDFISIWD 1183
Cdd:cd00200 222 VNSVaFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWspDGKRLASGSADGTIRIWD 289
|
|
| Patched |
pfam02460 |
Patched family; The transmembrane protein Patched is a receptor for the morphogene Sonic ... |
284-533 |
2.42e-17 |
|
Patched family; The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals.
Pssm-ID: 308203 [Multi-domain] Cd Length: 793 Bit Score: 87.80 E-value: 2.42e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 284 ELIP-LVTTYIILFAY-----IYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLtPTLNGGEIFPYLVVVIG 357
Cdd:pfam02460 214 TLTPfFVIGFFLLLTFsiivsVTLSSYTIDWVRSKPILAALGLLSPVMAIVSSFGLLFWMGF-PFNSIVCVTPFLVLAIG 292
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 358 LENVLVLTKSVVSTPVDLEVKLRIAQGLSSESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFT 437
Cdd:pfam02460 293 VDDMFLMVAAWQRTTATLSVKKRMGEALSEAGVSITITSLTDVLSFGIGTYTPTPAIQLFCAYTAVAIFFDFIYQITFFA 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 438 TVLSIdirrMELADLNKRLPPESCLPSakpvgRPARYERQQAVRpstphtitlQPSSFRNLRLPKRLRVIY--FLARTRL 515
Cdd:pfam02460 373 AIMAI----CAKPEAEGRHCLFVWATS-----SPQRIDSEGSEP---------DKSHNIEQLKSRFFLDIYcpFLLNPSV 434
|
250
....*....|....*....
gi 568963771 516 aqRLIMAGT-VVWIGILVY 533
Cdd:pfam02460 435 --RVCMLVLfVVYIAIAIY 451
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
952-1144 |
6.78e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 81.50 E-value: 6.78e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 952 KGSPPLAWTPSTAGSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSNEEISSGITALVFL--DRRIVAARLNGSLD 1027
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFspDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSpdGKLLASGSDDGTVR 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1028 FFSLETHTSLSPLQfrgtpgrgsspsssvysssntvtchrthtvpcAHQKPITALRAAA--GRLVTGSQDHTLRVFRLDD 1105
Cdd:COG2319 314 LWDLATGKLLRTLT--------------------------------GHTGAVRSVAFSPdgKTLASGSDDGTVRLWDLAT 361
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 568963771 1106 SCCLFTLKGHSGAITAVYI--DQTMvLASGGQDGAICLWDV 1144
Cdd:COG2319 362 GELLRTLTGHTGAVTSVAFspDGRT-LASGSADGTVRLWDL 401
|
|
| 2A060605 |
TIGR00920 |
3-hydroxy-3-methylglutaryl-coenzyme A reductase; [Transport and binding proteins, ... |
276-442 |
2.77e-14 |
|
3-hydroxy-3-methylglutaryl-coenzyme A reductase; [Transport and binding proteins, Carbohydrates, organic alcohols, and acids]
Pssm-ID: 273339 [Multi-domain] Cd Length: 886 Bit Score: 77.97 E-value: 2.77e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 276 FKEEIGIAELIPLVTTYIILFAYIYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPT-LNggEIFPYLVV 354
Cdd:TIGR00920 53 FEEEYLSSDVIVMTITRCIAVLYIYYQFCNLRQLGSKYILGIAGLFTIFSSFVFSTAVIHFLGSELTgLN--EALPFFLL 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 355 VIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSSESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQML 434
Cdd:TIGR00920 131 LIDLSKASALAKFALSSNSQDEVRDNIARGMAILGPTITLDTVVETLVIGVGTMSGVRRLEVLCCFGCMSVLANYFVFMT 210
|
....*...
gi 568963771 435 FFTTVLSI 442
Cdd:TIGR00920 211 FFPACLSL 218
|
|
| 2A060602 |
TIGR00918 |
The Eukaryotic (Putative) Sterol Transporter (EST) Family; |
286-448 |
7.91e-14 |
|
The Eukaryotic (Putative) Sterol Transporter (EST) Family;
Pssm-ID: 273338 [Multi-domain] Cd Length: 1145 Bit Score: 76.84 E-value: 7.91e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 286 IPLVTTYIILFAYIYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLT 365
Cdd:TIGR00918 400 IRIVSGYLLMLAYACLTMLRWDCAKSQGSVGLAGVLLVALSVAAGLGLCALLGISFNAATTQVLPFLALGVGVDDVFLLA 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 366 KSVVSTPVDLEVKLRIAQGLSSESWSIMKNAATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIR 445
Cdd:TIGR00918 480 HAFSETGQNIPFEERTGECLKRTGASVVLTSISNVTAFFMAALIPIPALRAFSLQAAIVVVFNFAAVLLVFPAILSLDLR 559
|
...
gi 568963771 446 RME 448
Cdd:TIGR00918 560 RRE 562
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1081-1233 |
3.72e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 73.02 E-value: 3.72e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 1081 ALRAAAGRLVTGSQDHTLRVFRLDDSCCLFTLKGHSGAITAV-YIDQTMVLASGGQDGAICLWDVLTGSRVSQTFAHRGD 1159
Cdd:COG2319 43 AASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVaFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGA 122
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568963771 1160 VTSLTCTA--SCVISSGLDDFISIWDRSTGIKLYSIQQDLGCGASLGVISD-NLLVTGGQ-GCVSFWDLNYGDLLQTV 1233
Cdd:COG2319 123 VRSVAFSPdgKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDgKLLASGSDdGTVRLWDLATGKLLRTL 200
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1105-1143 |
4.72e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 44.61 E-value: 4.72e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568963771 1105 DSCCLFTLKGHSGAITAVYIDQT-MVLASGGQDGAICLWD 1143
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDgKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1108-1143 |
2.56e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.33 E-value: 2.56e-05
10 20 30
....*....|....*....|....*....|....*..
gi 568963771 1108 CLFTLKGHSGAITAVYIDQT-MVLASGGQDGAICLWD 1143
Cdd:pfam00400 3 LLKTLEGHTGSVTSLAFSPDgKLLASGSDDGTVKVWD 39
|
|
| MMPL |
COG1033 |
Predicted exporter protein, RND superfamily [General function prediction only]; |
282-442 |
3.72e-04 |
|
Predicted exporter protein, RND superfamily [General function prediction only];
Pssm-ID: 440656 [Multi-domain] Cd Length: 767 Bit Score: 44.85 E-value: 3.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 282 IAELIPLVTTYIILFAYIYFstrkidmvKSKWGLALAaVVTVLSSLLMSVGLCTLFG--LTPTLNggeIFPYLVVVIGLE 359
Cdd:COG1033 220 LAIFFPLALLLILLLLFLFF--------RSLRGVLLP-LLVVLLAVIWTLGLMGLLGipLSPLTI---LVPPLLLAIGID 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568963771 360 NVL-VLTKsvvstpvdleVKLRIAQGLSSESwsIMKNAATELGI-IL-------IGYFTL----VPAIQEFCLFAVVGLV 426
Cdd:COG1033 288 YGIhLLNR----------YREERRKGLDKRE--ALREALRKLGPpVLltslttaIGFLSLlfsdIPPIRDFGIVAAIGVL 355
|
170
....*....|....*.
gi 568963771 427 SDFFLQMLFFTTVLSI 442
Cdd:COG1033 356 LAFLTSLTLLPALLSL 371
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
771-802 |
2.01e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 2.01e-03
10 20 30
....*....|....*....|....*....|....
gi 568963771 771 VLRGHLMDIECLA--SDGMLLVSCCLAGQVCVWD 802
Cdd:smart00320 7 TLKGHTGPVTSVAfsPDGKYLASGSDDGTIKLWD 40
|
|
|