|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
361-662 |
1.58e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 137.47 E-value: 1.58e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 361 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 440
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 441 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQQAGGGFTSkrgTFGsiGKLETMMCVSYGRMEDLVFSGA 518
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVETGKCLT---TLR--GHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 519 ATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLLL 591
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYLL 234
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622855130 592 ednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 662
Cdd:cd00200 235 ----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
344-776 |
2.61e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 137.35 E-value: 2.61e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 344 AVAVVYNRQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCL 423
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTL 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 424 VSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPhhvD--KLVTVGI-KHIKFWQqagggftskrgtfgsigkle 500
Cdd:COG2319 136 ASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWD-------------------- 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 501 tmmcvsygrmedlvfsgAATGDifiwkdilLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaik 578
Cdd:COG2319 191 -----------------LATGK--------LLRTLTGHTGAVRSVaFSPDgKLLASGSADGTVRLWD------------- 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 579 rsalstsskglllednpsiraitlghghilvgTKNGEILeidksgpmtLLVQGHmEGEVWGLAAHP---LLpicATVSDD 655
Cdd:COG2319 233 --------------------------------LATGKLL---------RTLTGH-SGSVRSVAFSPdgrLL---ASGSAD 267
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 656 KTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLA 735
Cdd:COG2319 268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPD-GKTLA 346
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1622855130 736 VASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 776
Cdd:COG2319 347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1115-1600 |
1.77e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 131.96 E-value: 1.77e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1115 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1194
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1195 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1271
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1272 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1351
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1352 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1429
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1430 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteamviekitwasw 1508
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1509 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1588
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 1622855130 1589 tGGDDCSVFVWR 1600
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
516-902 |
1.51e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.24 E-value: 1.51e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 516 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 593
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 594 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 667
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 668 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 747
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 748 VLTSKRVGICKGASSYITHIDWDSRGKLLqvnsgakeqlffeAPRGKRHIIRpseiekiQWDTWTcvlgPTCEGIWPAHS 827
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-------------ASGSDDGTVR-------LWDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622855130 828 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVANVRWLHNDSVLLTvGGADTALMIW 902
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
4-480 |
1.07e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.46 E-value: 1.07e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 4 AVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGEC 83
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLAT-----GKL 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 84 SKSLS----FITHIDWSLDSKYLQTNDGAGERLFYKMPSGKSLTSkeeikgipwaswtcVKGPEVSgiwpkytevtdINS 159
Cdd:COG2319 155 LRTLTghsgAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT--------------LTGHTGA-----------VRS 209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 160 VDANYSSSVLVSGDDFGLVKLFRfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRfipegvsngml 239
Cdd:COG2319 210 VAFSPDGKLLASGSADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLAS-GSADGTVRLWD----------- 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 240 etapqeggtdsyseesdsdlsdvpeldsdieqetqinydrqvykedlpqlkqqskeknhvvpflkrekapedslklqfih 319
Cdd:COG2319 --------------------------------------------------------------------------------
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 320 gyrgydcrnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKC 399
Cdd:COG2319 275 -------------LATGELL-----------------RTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKL 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 400 LSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKF 478
Cdd:COG2319 323 LRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRLWDLATGELLRTLTGHTGAVTSVAFSPDG-RTLASGSAdGTVRL 398
|
..
gi 1622855130 479 WQ 480
Cdd:COG2319 399 WD 400
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1057-1407 |
6.35e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.79 E-value: 6.35e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1057 HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHVWDamtKHTLSMLRCF--HSKGVNYVNFSATGKLLVSVGVDpeHTITV 1134
Cdd:cd00200 8 HTGGVTCVAFSPDGKL--LATGSGDGT---IKVWD---LETGELLRTLkgHTGPVRDVAASADGTYLASGSSD--KTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1135 WRWQEGAKVASRGGHLERIFVVEFRPDSdtQFVSVGVKH--MKFWTLAgsallykKGVIGSLGAAKMQTMLSVAF-GANN 1211
Cdd:cd00200 78 WDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVE-------TGKCLTTLRGHTDWVNSVAFsPDGT 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1212 LTFTGAINGDVYVW--KDHFLIRlVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRCRAfqletgql 1289
Cdd:cd00200 149 FVASSSQDGTIKLWdlRTGKCVA-TLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLG-------- 213
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1290 vecvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKV 1369
Cdd:cd00200 214 -----------------------------------TLRGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 1622855130 1370 SlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNSLKVW 1407
Cdd:cd00200 258 S-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
304-351 |
4.64e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 4.64e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1622855130 304 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 351
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
531-857 |
9.83e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 88.55 E-value: 9.83e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 531 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 601
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 602 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRC 681
Cdd:cd00200 77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 682 CAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 761
Cdd:cd00200 141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 762 SYITHIDWDSRGKLLQvnSGAKEQlffeaprgkrhIIRpseiekiQWDTWTCVLGPTCEGiwpaHSdiTDVNAASLTKDC 841
Cdd:cd00200 220 NGVNSVAFSPDGYLLA--SGSEDG-----------TIR-------VWDLRTGECVQTLSG----HT--NSVTSLAWSPDG 273
|
330
....*....|....*.
gi 1622855130 842 SLLATGDDFGFVKLFS 857
Cdd:cd00200 274 KRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
157-462 |
1.82e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 1.82e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 157 INSVDANYSSSVLVSGDDFGLVKLFRfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 236
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 237 gmletapqEGGTDSYS-EESDSDLSDVpeldsDIEQETQI----NYDRQVYKEDLPQLKQQSKEKNHvvpflkrekapED 311
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 312 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 388
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622855130 389 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 462
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
994-1045 |
6.43e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 6.43e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1622855130 994 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGtDIIFHTAAAGIVQN 1045
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1326-1359 |
2.09e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.09e-04
10 20 30
....*....|....*....|....*....|....
gi 1622855130 1326 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1359
Cdd:smart00320 8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1326-1359 |
3.37e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.64 E-value: 3.37e-04
10 20 30
....*....|....*....|....*....|....
gi 1622855130 1326 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1359
Cdd:pfam00400 7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
396-438 |
9.71e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 9.71e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 1622855130 396 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 438
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
361-662 |
1.58e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 137.47 E-value: 1.58e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 361 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 440
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 441 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQQAGGGFTSkrgTFGsiGKLETMMCVSYGRMEDLVFSGA 518
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVETGKCLT---TLR--GHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 519 ATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLLL 591
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYLL 234
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622855130 592 ednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 662
Cdd:cd00200 235 ----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
344-776 |
2.61e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 137.35 E-value: 2.61e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 344 AVAVVYNRQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCL 423
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTL 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 424 VSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPhhvD--KLVTVGI-KHIKFWQqagggftskrgtfgsigkle 500
Cdd:COG2319 136 ASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWD-------------------- 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 501 tmmcvsygrmedlvfsgAATGDifiwkdilLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaik 578
Cdd:COG2319 191 -----------------LATGK--------LLRTLTGHTGAVRSVaFSPDgKLLASGSADGTVRLWD------------- 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 579 rsalstsskglllednpsiraitlghghilvgTKNGEILeidksgpmtLLVQGHmEGEVWGLAAHP---LLpicATVSDD 655
Cdd:COG2319 233 --------------------------------LATGKLL---------RTLTGH-SGSVRSVAFSPdgrLL---ASGSAD 267
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 656 KTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLA 735
Cdd:COG2319 268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPD-GKTLA 346
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1622855130 736 VASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 776
Cdd:COG2319 347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1115-1600 |
1.77e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 131.96 E-value: 1.77e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1115 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1194
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1195 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1271
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1272 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1351
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1352 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1429
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1430 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteamviekitwasw 1508
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1509 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1588
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 1622855130 1589 tGGDDCSVFVWR 1600
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1064-1444 |
8.27e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.03 E-value: 8.27e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1064 LTVNQHPKYRNVVATSQIGTT-------PSIHVWDAMTKHTLSMLRcFHSKGVNYVNFSATGKLLVSVGVDpeHTITVWR 1136
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLlasasadGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1137 WQEGAKVASRGGHLERIFVVEFRPDSdTQFVSVGV-KHMKFWTLAGSALLYKkgVIGSLGAAkmqtmLSVAFGANNLTF- 1214
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLRT--LTGHTGAV-----RSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1215 TGAINGDVYVW--KDHFLIRLVaKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWDqemkrcrafqLETGQLVE 1291
Cdd:COG2319 221 SGSADGTVRLWdlATGKLLRTL-TGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGELLR 282
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1292 cvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKVSl 1371
Cdd:COG2319 283 ---------------------------------TLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT- 327
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622855130 1372 GHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFLAVGSSEHTVDFYDLT 1444
Cdd:COG2319 328 GHTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
354-707 |
2.27e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 125.41 E-value: 2.27e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 354 HSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHS 433
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 434 IVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKFWQQAGGGFtskRGTFGsiGKLETMMCVSY---GR 509
Cdd:COG2319 186 VRLWDLATGKLLRTLTGHTGAVRSVAFSPDG-KLLASGSAdGTVRLWDLATGKL---LRTLT--GHSGSVRSVAFspdGR 259
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 510 medLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaikrsalsts 585
Cdd:COG2319 260 ---LLASGSADGTVRLWdlATGELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWD-------------------- 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 586 skglllednpsiraitlghghilvgTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSA 665
Cdd:COG2319 317 -------------------------LATGKLLRT---------LTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLAT 361
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 1622855130 666 QHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADT 707
Cdd:COG2319 362 GELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
516-902 |
1.51e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.24 E-value: 1.51e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 516 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 593
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 594 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 667
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 668 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 747
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 748 VLTSKRVGICKGASSYITHIDWDSRGKLLqvnsgakeqlffeAPRGKRHIIRpseiekiQWDTWTcvlgPTCEGIWPAHS 827
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-------------ASGSDDGTVR-------LWDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622855130 828 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVANVRWLHNDSVLLTvGGADTALMIW 902
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
579-902 |
7.82e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.85 E-value: 7.82e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 579 RSALSTSSKGLLLEDNPSIRAITLGHGHILVGTKNGEILEIDKSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTL 658
Cdd:COG2319 24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 659 RIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVAS 738
Cdd:COG2319 103 RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPD-GKLLASGS 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 739 HDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLLQVNSGAKEqlffeaprgkrhiIRpseiekiQWDtwtcVLGPT 818
Cdd:COG2319 182 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT-------------VR-------LWD----LATGK 237
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 819 CEGIWPAHSDitDVNAASLTKDCSLLATGDDFGFVKLFSyPVKGQHARFKKyvGHSAHVANVRWLHNDSVLLTvGGADTA 898
Cdd:COG2319 238 LLRTLTGHSG--SVRSVAFSPDGRLLASGSADGTVRLWD-LATGELLRTLT--GHSGGVNSVAFSPDGKLLAS-GSDDGT 311
|
....
gi 1622855130 899 LMIW 902
Cdd:COG2319 312 VRLW 315
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
4-480 |
1.07e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.46 E-value: 1.07e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 4 AVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGEC 83
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLAT-----GKL 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 84 SKSLS----FITHIDWSLDSKYLQTNDGAGERLFYKMPSGKSLTSkeeikgipwaswtcVKGPEVSgiwpkytevtdINS 159
Cdd:COG2319 155 LRTLTghsgAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT--------------LTGHTGA-----------VRS 209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 160 VDANYSSSVLVSGDDFGLVKLFRfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRfipegvsngml 239
Cdd:COG2319 210 VAFSPDGKLLASGSADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLAS-GSADGTVRLWD----------- 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 240 etapqeggtdsyseesdsdlsdvpeldsdieqetqinydrqvykedlpqlkqqskeknhvvpflkrekapedslklqfih 319
Cdd:COG2319 --------------------------------------------------------------------------------
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 320 gyrgydcrnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKC 399
Cdd:COG2319 275 -------------LATGELL-----------------RTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKL 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 400 LSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKF 478
Cdd:COG2319 323 LRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRLWDLATGELLRTLTGHTGAVTSVAFSPDG-RTLASGSAdGTVRL 398
|
..
gi 1622855130 479 WQ 480
Cdd:COG2319 399 WD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1045-1362 |
2.26e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 107.69 E-value: 2.26e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1045 NLSTGSQSFYLE-HTDDILCLTVnqHPKYRNVVATSQIGTtpsIHVWDAMTKHTLSMLRCfHSKGVNYVNFSATGKLLVS 1123
Cdd:COG2319 106 DLATGLLLRTLTgHTGAVRSVAF--SPDGKTLASGSADGT---VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLAS 179
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1124 VGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTqFVSVGV-KHMKFWTLAGSALLYKKGviGSLGAAkmqtm 1202
Cdd:COG2319 180 GSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKL-LASGSAdGTVRLWDLATGKLLRTLT--GHSGSV----- 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1203 LSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDqemkrcr 1280
Cdd:COG2319 250 RSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTLTGHSGGVNSVAFSPDGKLLASGS------DDGTVRLWD------- 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1281 afqLETGQLV-------ECVRSVC-RGKGKILV-GTKDGEI----IEVGEKNAAsnilIDGHmEGEIWGLATHPSKDLFI 1347
Cdd:COG2319 317 ---LATGKLLrtltghtGAVRSVAfSPDGKTLAsGSDDGTVrlwdLATGELLRT----LTGH-TGAVTSVAFSPDGRTLA 388
|
330
....*....|....*
gi 1622855130 1348 SASNDGTARIWDLAD 1362
Cdd:COG2319 389 SGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
359-565 |
5.88e-23 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 100.87 E-value: 5.88e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 359 YLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 438
Cdd:cd00200 89 LTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQD--GTIKLWD 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 439 WKKGEKIATTRGHKDKIFVVKCNPHHVDKLVTVGIKHIKFWQQAGGgftSKRGTFgsIGKLETMMCVSYGRMEDLVFSGA 518
Cdd:cd00200 164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG---KCLGTL--RGHENGVNSVAFSPDGYLLASGS 238
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1622855130 519 ATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD 565
Cdd:cd00200 239 EDGTIRVWdlRTGECVQTLSGHTNSVTSLAWSPDGkrLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1057-1407 |
6.35e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.79 E-value: 6.35e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1057 HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHVWDamtKHTLSMLRCF--HSKGVNYVNFSATGKLLVSVGVDpeHTITV 1134
Cdd:cd00200 8 HTGGVTCVAFSPDGKL--LATGSGDGT---IKVWD---LETGELLRTLkgHTGPVRDVAASADGTYLASGSSD--KTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1135 WRWQEGAKVASRGGHLERIFVVEFRPDSdtQFVSVGVKH--MKFWTLAgsallykKGVIGSLGAAKMQTMLSVAF-GANN 1211
Cdd:cd00200 78 WDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVE-------TGKCLTTLRGHTDWVNSVAFsPDGT 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1212 LTFTGAINGDVYVW--KDHFLIRlVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRCRAfqletgql 1289
Cdd:cd00200 149 FVASSSQDGTIKLWdlRTGKCVA-TLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLG-------- 213
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1290 vecvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKV 1369
Cdd:cd00200 214 -----------------------------------TLRGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 1622855130 1370 SlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNSLKVW 1407
Cdd:cd00200 258 S-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2-438 |
1.05e-21 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 99.60 E-value: 1.05e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 2 EEAVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiG 81
Cdd:COG2319 120 TGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT-----G 194
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 82 ECSKSL----SFITHIDWSLDSKYLQTNDGAGE-RLFykmpsgkSLTSKEEIKGIPWASWTcvkgpevsgiwpkytevtd 156
Cdd:COG2319 195 KLLRTLtghtGAVRSVAFSPDGKLLASGSADGTvRLW-------DLATGKLLRTLTGHSGS------------------- 248
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 157 INSVDANYSSSVLVSGDDFGLVKLFRfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWrfipegvsn 236
Cdd:COG2319 249 VRSVAFSPDGRLLASGSADGTVRLWD---LATGELLRTLTGHSGGVNSVAFSPDGKLLAS-GSDDGTVRLW--------- 315
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 237 gmletapqeggtdsyseesdsDLSDvpeldsdieqetqinydrqvykedlpqlkqqskeknhvvpflkrekapedslklq 316
Cdd:COG2319 316 ---------------------DLAT------------------------------------------------------- 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 317 fihgyrgydcrnnlfytqaGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQT 396
Cdd:COG2319 320 -------------------GKLL-----------------RTLTGHTGAVRSVAFSPDGKTLASG--SDDGTVRLWDLAT 361
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 1622855130 397 LKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 438
Cdd:COG2319 362 GELLRTLTG-HTGAVTSVAFSPDGRTLASGSAD--GTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
157-748 |
6.32e-21 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 97.29 E-value: 6.32e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 157 INSVDANYSSSVLVSGDDFGLVKLFRfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRfipegvsn 236
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWD---LATGLLLRTLTGHTGAVRSVAFSPDGKTLAS-GSADGTVRLWD-------- 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 237 gmletapqeggtdsyseesdsdlsdvpeldsdieqetqinydrqvykedlpqlkqqskeknhvvpflkrekapedslklq 316
Cdd:COG2319 --------------------------------------------------------------------------------
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 317 fihgyrgydcrnnlfyTQAGEVVYHIAavavvynrqqhsqrlylGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQT 396
Cdd:COG2319 149 ----------------LATGKLLRTLT-----------------GHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLAT 193
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 397 LKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFvvkcnphhvdklvtvgikhi 476
Cdd:COG2319 194 GKLLRTLTG-HTGAVRSVAFSPDGKLLASGSAD--GTVRLWDLATGKLLRTLTGHSGSVR-------------------- 250
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 477 kfwqqagggftskrgtfgsigkletmmcvsygrmeDLVFSgaatgdifiwkdilllktvkaHDGPVFAmyaldkgfvTGG 556
Cdd:COG2319 251 -----------------------------------SVAFS---------------------PDGRLLA---------SGS 265
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 557 KDGIVELWDdmferclktyaikrsalstsskglllednpsiraitlghghilvgTKNGEILEidksgpmtlLVQGHmEGE 636
Cdd:COG2319 266 ADGTVRLWD---------------------------------------------LATGELLR---------TLTGH-SGG 290
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 637 VWGLAAHP---LLpicATVSDDKTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVS 713
Cdd:COG2319 291 VNSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRT 367
|
570 580 590
....*....|....*....|....*....|....*
gi 1622855130 714 FHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNV 748
Cdd:COG2319 368 LTGHTGAVTSVAFSPD-GRTLASGSADGTVRLWDL 401
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
304-351 |
4.64e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 4.64e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1622855130 304 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 351
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1105-1442 |
5.77e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 92.01 E-value: 5.77e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1105 HSKGVNYVNFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSdTQFVSVGVKHM-KFWTLAGSA 1183
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSDKTiRLWDLETGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1184 LLYkkgvigSLGAAKmQTMLSVAFGANNLTFTGAI-NGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDGLIVTGGKEr 1261
Cdd:cd00200 85 CVR------TLTGHT-SYVSSVAFSPDGRILSSSSrDKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVASSSQ- 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1262 ptkeGGAVKLWDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHP 1341
Cdd:cd00200 156 ----DGTIKLWD----------LRTGKCVA---------------------------------TLTGH-TGEVNSVAFSP 187
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1342 SKDLFISASNDGTARIWDLADKKLLnKVSLGH--AARCAAYSPDGEMVAIGMKNGefvillvnSLKVW-GKKRDRK---- 1414
Cdd:cd00200 188 DGEKLLSSSSDGTIKLWDLSTGKCL-GTLRGHenGVNSVAFSPDGYLLASGSEDG--------TIRVWdLRTGECVqtls 258
|
330 340 350
....*....|....*....|....*....|.
gi 1622855130 1415 ---SAIQDIRISPDNRFLAVGSSEHTVDFYD 1442
Cdd:cd00200 259 ghtNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1236-1600 |
1.87e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 90.47 E-value: 1.87e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1236 KAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRC-RAFQLETGQlVECVRSVCRGKgKILVGTKDGEIIE 1314
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGS------GDGTIKVWDLETGELlRTLKGHTGP-VRDVAASADGT-YLASGSSDKTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1315 VGEKNAASNILIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLnKVSLGH--AARCAAYSPDGEMVAIGMK 1392
Cdd:cd00200 78 WDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCL-TTLRGHtdWVNSVAFSPDGTFVASSSQ 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1393 NGefvillvnSLKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEHTVDFYDLTQGTNLNRIGYCKDipsFVIQ 1464
Cdd:cd00200 156 DG--------TIKLWdlrtGKCVATltghTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN---GVNS 224
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1465 MDFSADGKYIqvstgaykrqvhevplgkqvteamviekitwaswTSVLGDEVIGIWprnadkadvncacvthaglNIVTG 1544
Cdd:cd00200 225 VAFSPDGYLL----------------------------------ASGSEDGTIRVW-------------------DLRTG 251
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1622855130 1545 DdfglvklfdfpctekfaKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVWR 1600
Cdd:cd00200 252 E-----------------CVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
531-857 |
9.83e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 88.55 E-value: 9.83e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 531 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 601
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 602 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRC 681
Cdd:cd00200 77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 682 CAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 761
Cdd:cd00200 141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 762 SYITHIDWDSRGKLLQvnSGAKEQlffeaprgkrhIIRpseiekiQWDTWTCVLGPTCEGiwpaHSdiTDVNAASLTKDC 841
Cdd:cd00200 220 NGVNSVAFSPDGYLLA--SGSEDG-----------TIR-------VWDLRTGECVQTLSG----HT--NSVTSLAWSPDG 273
|
330
....*....|....*.
gi 1622855130 842 SLLATGDDFGFVKLFS 857
Cdd:cd00200 274 KRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1326-1600 |
6.97e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 85.85 E-value: 6.97e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1326 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNgefvillvNS 1403
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK-GHTGpvRDVAASADGTYLASGSSD--------KT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1404 LKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEHTVDFYDLTQGTNLNRIGYCKDipsFVIQMDFSADGKYIq 1475
Cdd:cd00200 75 IRLWdletGECVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTFV- 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1476 vstgaykrqvhevplgkqvteamviekitwaswTSVLGDEVIGIWprNAD-----------KADVNCACVTHAGLNIVTG 1544
Cdd:cd00200 151 ---------------------------------ASSSQDGTIKLW--DLRtgkcvatltghTGEVNSVAFSPDGEKLLSS 195
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*.
gi 1622855130 1545 DDFGLVKLFDFpctEKFAKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVWR 1600
Cdd:cd00200 196 SSDGTIKLWDL---STGKCLGTLRGHENGVNSVAFSPDGYLLAS-GSEDGTIRVWD 247
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
4-565 |
8.98e-17 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 84.58 E-value: 8.98e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 4 AVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQRyKKIGEC 83
Cdd:COG2319 38 AVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATG-LLLRTL 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 84 SKSLSFITHIDWSLDSKYLqtndgagerlfykmpsgksltskeeikgipwaswtcvkgpevsgiwpkytevtdinsvdan 163
Cdd:COG2319 117 TGHTGAVRSVAFSPDGKTL------------------------------------------------------------- 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 164 ysssvlVSGDDFGLVKLFRfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWvLSTGGADHSVFQWRfipegvsngmletap 243
Cdd:COG2319 136 ------ASGSADGTVRLWD---LATGKLLRTLTGHSGAVTSVAFSPDGKL-LASGSDDGTVRLWD--------------- 190
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 244 qeggtdsyseesdsdlsdvpeldsdieqetqinydrqvykedlpqlkqqskeknhvvpflkrekapedslklqfihgyrg 323
Cdd:COG2319 --------------------------------------------------------------------------------
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 324 ydcrnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLL 403
Cdd:COG2319 191 ---------LATGKLL-----------------RTLTGHTGAVRSVAFSPDGKLLASG--SADGTVRLWDLATGKLLRTL 242
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 404 KGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPhhvD--KLVTVGI-KHIKFWQ 480
Cdd:COG2319 243 TG-HSGSVRSVAFSPDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSP---DgkLLASGSDdGTVRLWD 316
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 481 QAGGgftSKRGTFGsiGKLETMMCVSYGRMEDLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGG 556
Cdd:COG2319 317 LATG---KLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGTVRLWdlATGELLRTLTGHTGAVTSVAFSPDGrtLASGS 391
|
....*....
gi 1622855130 557 KDGIVELWD 565
Cdd:COG2319 392 ADGTVRLWD 400
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1204-1480 |
8.70e-16 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 79.69 E-value: 8.70e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1204 SVAFGA-NNLTFTGAINGDVYVWK---DHFLIRLvaKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRC 1279
Cdd:cd00200 14 CVAFSPdGKLLATGSGDGTIKVWDletGELLRTL--KGHTGPVRDVAASADGTYLASGS------SDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1280 -RAFQLETGQlvecVRSVC-RGKGKILVGT-KDGEIIEVGEKNAASNILIDGHmEGEIWGLATHPSKDLFISASNDGTAR 1356
Cdd:cd00200 86 vRTLTGHTSY----VSSVAfSPDGRILSSSsRDKTIKVWDVETGKCLTTLRGH-TDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1357 IWDLADKKLLnKVSLGH--AARCAAYSPDGEMVAIGMKNGefvillvnSLKVWGKKRDR--------KSAIQDIRISPDN 1426
Cdd:cd00200 161 LWDLRTGKCV-ATLTGHtgEVNSVAFSPDGEKLLSSSSDG--------TIKLWDLSTGKclgtlrghENGVNSVAFSPDG 231
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1622855130 1427 RFLAVGSSEHTVDFYDLTQGTNLNRI-GYckdiPSFVIQMDFSADGKYIqVSTGA 1480
Cdd:cd00200 232 YLLASGSEDGTIRVWDLRTGECVQTLsGH----TNSVTSLAWSPDGKRL-ASGSA 281
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
157-462 |
1.82e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 1.82e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 157 INSVDANYSSSVLVSGDDFGLVKLFRfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 236
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 237 gmletapqEGGTDSYS-EESDSDLSDVpeldsDIEQETQI----NYDRQVYKEDLPQLKQQSKEKNHvvpflkrekapED 311
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 312 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 388
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622855130 389 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 462
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1011-1273 |
4.59e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 74.68 E-value: 4.59e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1011 HVFGYRGFDCRNNLHYL----NDGTDIIFhtaaagivqNLSTGSQSFYLE-HTDDILCLTVNQHPKYrnVVATSQIGTtp 1085
Cdd:cd00200 50 HTGPVRDVAASADGTYLasgsSDKTIRLW---------DLETGECVRTLTgHTSYVSSVAFSPDGRI--LSSSSRDKT-- 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1086 sIHVWDAMTKHTLSMLRCfHSKGVNYVNFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQ 1165
Cdd:cd00200 117 -IKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQD--GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKL 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1166 FVSVGVKHMKFWTLAGSALLYkkgvigsLGAAKMQTMLSVAFGANNLTFTGA-INGDVYVWK-DHFLIRLVAKAHTGPVF 1243
Cdd:cd00200 193 LSSSSDGTIKLWDLSTGKCLG-------TLRGHENGVNSVAFSPDGYLLASGsEDGTIRVWDlRTGECVQTLSGHTNSVT 265
|
250 260 270
....*....|....*....|....*....|
gi 1622855130 1244 TMYTTLRDGLIVTGGkerptkEGGAVKLWD 1273
Cdd:cd00200 266 SLAWSPDGKRLASGS------ADGTIRIWD 289
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
994-1045 |
6.43e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 6.43e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1622855130 994 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGtDIIFHTAAAGIVQN 1045
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
4-74 |
3.72e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 60.70 E-value: 3.72e-09
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622855130 4 AVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 74
Cdd:COG2319 332 AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2-228 |
5.41e-09 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 59.27 E-value: 5.41e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 2 EEAVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVaqrykKIG 81
Cdd:cd00200 93 TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDL-----RTG 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 82 ECSKSL----SFITHIDWSLDSKYLQTNDGAGERLFYKMPSGK---SLTSKEEIkgipwaswtcvkgpevsgiwpkytev 154
Cdd:cd00200 168 KCVATLtghtGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKclgTLRGHENG-------------------------- 221
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622855130 155 tdINSVDANYSSSVLVSGDDFGLVKLFRfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWR 228
Cdd:cd00200 222 --VNSVAFSPDGYLLASGSEDGTIRVWD---LRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1343-1474 |
3.20e-05 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 47.38 E-value: 3.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1343 KDLFISASNDGTARIWDLADKKLLNKVSLGHAARCAAYSPDGEMVAI-GMKNGEFVILLVNSLKVWGKKRDRKSAiQDIR 1421
Cdd:COG3391 80 RRLYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLYVaDSGNGRVSVIDTATGKVVATIPVGAGP-HGIA 158
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 1622855130 1422 ISPDNRFLAVGSSE-HTVDFY----DLTQGTNLNRIgyckDIPSFVIQMDFSADGKYI 1474
Cdd:COG3391 159 VDPDGKRLYVANSGsNTVSVIvsviDTATGKVVATI----PVGGGPVGVAVSPDGRRL 212
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
4-102 |
5.25e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.11 E-value: 5.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 4 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 78
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100
....*....|....*....|....
gi 1622855130 79 KIGEcSKSLSFITHIDWSLDSKYL 102
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWL 446
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1335-1446 |
1.80e-04 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 45.07 E-value: 1.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622855130 1335 WGLATHPSKD-LFISASNDGTARIWDLADKKLLNKVSLGHAARCAAYSPDGEMVAIGMKNGEFVILLV-----NSLKVWg 1408
Cdd:COG3391 113 RGLAVDPDGGrLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVSVIVsvidtATGKVV- 191
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 1622855130 1409 KKRDRKSAIQDIRISPDNRFLAV--------GSSEHTVDFYDLTQG 1446
Cdd:COG3391 192 ATIPVGGGPVGVAVSPDGRRLYVanrgsntsNGGSNTVSVIDLATL 237
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1326-1359 |
2.09e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.09e-04
10 20 30
....*....|....*....|....*....|....
gi 1622855130 1326 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1359
Cdd:smart00320 8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1326-1359 |
3.37e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.64 E-value: 3.37e-04
10 20 30
....*....|....*....|....*....|....
gi 1622855130 1326 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1359
Cdd:pfam00400 7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
396-438 |
9.71e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 9.71e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 1622855130 396 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 438
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1378-1442 |
1.77e-03 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 39.18 E-value: 1.77e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622855130 1378 AAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRD-RKSAIQDIRISPDNRFLAVGSSEHTVDFYD 1442
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDkEDLEVTSLAWRPDGKLLAVGYSDGTVRLLD 66
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1565-1599 |
2.39e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 37.32 E-value: 2.39e-03
10 20 30
....*....|....*....|....*....|....*
gi 1622855130 1565 KRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVW 1599
Cdd:pfam00400 5 KTLEGHTGSVTSLAFSPDGKLLAS-GSDDGTVKVW 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1560-1599 |
4.51e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 4.51e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1622855130 1560 KFAKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVW 1599
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLAS-GSDDGTIKLW 39
|
|
|