|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1063 |
8.50e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 135.93 E-value: 8.50e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 2462539124 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
58-438 |
2.55e-31 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.88 E-value: 2.55e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319 75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319 184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319 256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462539124 370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1420-1807 |
3.31e-31 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.49 E-value: 3.31e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1420 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1499
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1500 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1577
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1578 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1655
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1656 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1735
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539124 1736 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1807
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
895-1266 |
6.73e-24 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.53 E-value: 6.73e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462539124 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-49 |
7.14e-19 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 82.60 E-value: 7.14e-19
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462539124 2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
672-715 |
9.17e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 9.17e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2462539124 672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1355-1402 |
1.35e-14 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 70.27 E-value: 1.35e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462539124 1355 KKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1402
Cdd:pfam03451 26 KKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1689-1945 |
4.10e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 71.98 E-value: 4.10e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1689 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1766
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1767 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1837
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1838 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1915
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270
....*....|....*....|....*....|....
gi 2462539124 1916 FDFPCPEKFVSLC----FVYYYQFTPNFDVLSSA 1945
Cdd:cd00200 204 WDLSTGKCLGTLRghenGVNSVAFSPDGYLLASG 237
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1063 |
8.50e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 135.93 E-value: 8.50e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 2462539124 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
720-1112 |
1.58e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 138.12 E-value: 1.58e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 720 QRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVV 799
Cdd:COG2319 71 LATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGA-VRSVAFSPDGKTLASGSAD--GTVR 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 800 LWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIgrkgyigtlgkndtmmcavygwtee 876
Cdd:COG2319 146 LWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWDLATGKLL------------------------- 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 877 mafsgtstgdvciwrdiflvKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAikraalapgskgll 954
Cdd:COG2319 198 --------------------RTLTGHTGAVRSVAFSPDGklLASGSADGTVRLWDLATGKLLRTLT-------------- 243
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 955 lEDNPSIRAISLGH-GHILV-GTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLS 1028
Cdd:COG2319 244 -GHSGSVRSVAFSPdGRLLAsGSADGTVRLWDlATGELLRTLTGH-SGGVNSVAFSPdgkLL---ASGSDDGTVRLWDLA 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1029 PSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFID 1108
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397
|
....
gi 2462539124 1109 IYNV 1112
Cdd:COG2319 398 LWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
58-438 |
2.55e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.88 E-value: 2.55e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319 75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319 184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319 256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462539124 370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1420-1807 |
3.31e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.49 E-value: 3.31e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1420 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1499
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1500 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1577
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1578 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1655
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1656 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1735
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539124 1736 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1807
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
2.32e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 122.83 E-value: 2.32e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 57 YRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSVGLDskNAVCVWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 137 WKRGKMLSMAPGHTDRIFDISWDlyqPNK--LVSCGV-KHIKFWSLcgNALTPKRGVFGKTGDlqtILCLACARDE-LTY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFS---PDGriLSSSSRdKTIKVWDV--ETGKCLTTLRGHTDW---VNSVAFSPDGtFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 213 SGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDlretdqgYKGLSVR 287
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLsTGKCLGTLR-------GHENGVN 223
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 288 SVCW--RGDHILVGTQDS--EIFEIVVQERNKPFlimQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:cd00200 224 SVAFspDGYLLASGSEDGtiRVWDLRTGECVQTL---SGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
895-1266 |
6.73e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.53 E-value: 6.73e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462539124 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1468-1770 |
3.96e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 3.96e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1468 HSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLagra 1546
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDL---- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1547 llSKKGLLSTLEDARmQTMLAIAFGANNLTFTGTIS-GDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDGLIVTGgker 1624
Cdd:cd00200 81 --ETGECVRTLTGHT-SYVSSVAFSPDGRILSSSSRdKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVAS---- 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1625 pSKEGGAVKLWD-QELRRCRAFRLETGQatdcVRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLA 1701
Cdd:cd00200 153 -SSQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH-ENGVNSVA 226
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462539124 1702 THPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHAAR--TVCYSPEGDMVAIGMKNGefiillvsSLKIW 1770
Cdd:cd00200 227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSADG--------TIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1140 |
3.26e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 93.17 E-value: 3.26e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAIKRAALApgskglllednpsiRAISLGHGH-I 971
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDLETGELLRTLKGHTGPVR--------------DVAASADGTyL 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 972 LVGTKNGEILEVDKSGP---ITLlvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRKLKKGGRCCCF 1048
Cdd:cd00200 67 ASGSSDKTIRLWDLETGecvRTL--TGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAF 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1049 SPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKGATSYI 1128
Cdd:cd00200 144 SPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGV 222
|
250
....*....|..
gi 2462539124 1129 THIDWDIRGKLL 1140
Cdd:cd00200 223 NSVAFSPDGYLL 234
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-49 |
7.14e-19 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 82.60 E-value: 7.14e-19
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462539124 2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
672-715 |
9.17e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 9.17e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2462539124 672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1355-1402 |
1.35e-14 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 70.27 E-value: 1.35e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462539124 1355 KKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1402
Cdd:pfam03451 26 KKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1689-1945 |
4.10e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 71.98 E-value: 4.10e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1689 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1766
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1767 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1837
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1838 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1915
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270
....*....|....*....|....*....|....
gi 2462539124 1916 FDFPCPEKFVSLC----FVYYYQFTPNFDVLSSA 1945
Cdd:cd00200 204 WDLSTGKCLGTLRghenGVNSVAFSPDGYLLASG 237
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1741-1823 |
1.41e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 45.35 E-value: 1.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1741 VCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRR-CAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCK 1819
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDKEdLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80
|
....
gi 2462539124 1820 DIPS 1823
Cdd:pfam12894 81 DLIT 84
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1701-1917 |
1.86e-04 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 46.06 E-value: 1.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1701 ATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVC-YSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCA 1779
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLaASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1780 IHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKH-- 1856
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL---TGHTGAVRSVAFSPDGKTL-ASGSADGTvRLWDLATGKLlr 156
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462539124 1857 -LMDHA-AIDRITWA----TWTSILGDEVLGIWSRHAEK---------ADVNCACVSHSGISLVTGDDFGMVKLFD 1917
Cdd:COG2319 157 tLTGHSgAVTSVAFSpdgkLLASGSDDGTVRLWDLATGKllrtltghtGAVRSVAFSPDGKLLASGSADGTVRLWD 232
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
3.71e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 3.71e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2462539124 760 TIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:smart00320 1 SGELLKTLKGHTGP-VTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1690-1722 |
3.98e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 3.98e-04
10 20 30
....*....|....*....|....*....|...
gi 2462539124 1690 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1722
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
314-353 |
1.18e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 1.18e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 2462539124 314 NKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:pfam00400 1 GKLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
315-353 |
2.10e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 2.10e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2462539124 315 KPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
762-802 |
4.54e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 4.54e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462539124 762 KPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:pfam00400 2 KLLKTLEGHTG-SVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1063 |
8.50e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 135.93 E-value: 8.50e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 2462539124 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
720-1112 |
1.58e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 138.12 E-value: 1.58e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 720 QRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVV 799
Cdd:COG2319 71 LATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGA-VRSVAFSPDGKTLASGSAD--GTVR 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 800 LWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIgrkgyigtlgkndtmmcavygwtee 876
Cdd:COG2319 146 LWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWDLATGKLL------------------------- 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 877 mafsgtstgdvciwrdiflvKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAikraalapgskgll 954
Cdd:COG2319 198 --------------------RTLTGHTGAVRSVAFSPDGklLASGSADGTVRLWDLATGKLLRTLT-------------- 243
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 955 lEDNPSIRAISLGH-GHILV-GTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLS 1028
Cdd:COG2319 244 -GHSGSVRSVAFSPdGRLLAsGSADGTVRLWDlATGELLRTLTGH-SGGVNSVAFSPdgkLL---ASGSDDGTVRLWDLA 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1029 PSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFID 1108
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397
|
....
gi 2462539124 1109 IYNV 1112
Cdd:COG2319 398 LWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
58-438 |
2.55e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.88 E-value: 2.55e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319 75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319 184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319 256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462539124 370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1420-1807 |
3.31e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.49 E-value: 3.31e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1420 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1499
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1500 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1577
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1578 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1655
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1656 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1735
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462539124 1736 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1807
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
52-354 |
1.30e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 1.30e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 52 HRQKFYRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNA 131
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 132 VCVWDWKRGKMLSMAPGHTDRIFDISWDlyqPN--KLVSCGV-KHIKFWSLcgnaltpKRGVFGKT--GDLQTILCLACA 206
Cdd:COG2319 186 VRLWDLATGKLLRTLTGHTGAVRSVAFS---PDgkLLASGSAdGTVRLWDL-------ATGKLLRTltGHSGSVRSVAFS 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 207 RDELT-YSGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIdlretdQG 280
Cdd:COG2319 256 PDGRLlASGSADGTVRLWdlATGELLRTLTG-HSGGVNSVAFSPDGklLASGSDDGTVRLWDLaTGKLLRTL------TG 328
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462539124 281 YKGlSVRSVCWR--GDHILVGTQDSEI--FEIvvqERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSL 354
Cdd:COG2319 329 HTG-AVRSVAFSpdGKTLASGSDDGTVrlWDL---ATGELLRTLTGH-TGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
719-1063 |
1.69e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.18 E-value: 1.69e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 719 TQRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTV 798
Cdd:COG2319 112 LLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTGHSG-AVTSVAFSPDGKLLASGSDD--GTV 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 799 VLWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIGrkgyigTLGKNDtmmcavyGWTE 875
Cdd:COG2319 187 RLWDLATGKLLRTLTGHTGAVRSVAFSP---DgkLLASGSAdGTVRLWDLATGKLLR------TLTGHS-------GSVR 250
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 876 EMAFS--------GTSTGDVCIWR--DIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikr 943
Cdd:COG2319 251 SVAFSpdgrllasGSADGTVRLWDlaTGELLRTLTGHSGGVNSVAFSPDGklLASGSDDGTVRLWDLATGKLLRTL---- 326
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 944 aalapgskglllednpsiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLR 1023
Cdd:COG2319 327 --------------------------------------------------TGH-TGAVRSVAFSPDGKTLASGSDDGTVR 355
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 2462539124 1024 IWDLSPSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:COG2319 356 LWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGT 395
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
2.32e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 122.83 E-value: 2.32e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 57 YRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSVGLDskNAVCVWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 137 WKRGKMLSMAPGHTDRIFDISWDlyqPNK--LVSCGV-KHIKFWSLcgNALTPKRGVFGKTGDlqtILCLACARDE-LTY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFS---PDGriLSSSSRdKTIKVWDV--ETGKCLTTLRGHTDW---VNSVAFSPDGtFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 213 SGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDlretdqgYKGLSVR 287
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLsTGKCLGTLR-------GHENGVN 223
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 288 SVCW--RGDHILVGTQDS--EIFEIVVQERNKPFlimQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:cd00200 224 SVAFspDGYLLASGSEDGtiRVWDLRTGECVQTL---SGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
260-802 |
3.82e-28 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 119.25 E-value: 3.82e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 260 RLWDLTFKPITVIDLRETDQGYKGLSVRSVCWRGDHILVGTQDSEIFEIVVQERNKPFLIMQGHcEGELWALAVHPTKPL 339
Cdd:COG2319 14 ADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 340 AVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSP 418
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 419 DGTYLAVGCNDSSVDIYGVAQrykkvGECLGSL----SFITHLDWSSDSRYLQTNDGNGK-RLfYRMPGGKEVTSTEEik 493
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLLASGSADGTvRL-WDLATGKLLRTLTG-- 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 494 gvHWASWTCVSglevngiWpkysdindinSVDGnyigQVLVTADDYGIIKLFRypcLRKGAKFRKYIGHSAHVTNVRWSH 573
Cdd:COG2319 245 --HSGSVRSVA-------F----------SPDG----RLLASGSADGTVRLWD---LATGELLRTLTGHSGGVNSVAFSP 298
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 574 DYQWVISiGGADHSVFQWkfiperklkdavhiapqesladshsdesdsdlsdvpeldseieqetqltyrrqvykedlpql 653
Cdd:COG2319 299 DGKLLAS-GSDDGTVRLW-------------------------------------------------------------- 315
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 654 keqckekqksatskrrERAPGNSIRLHfvhgyrgydcrsnlfytqigeivyhvaavgviynrqqntqrfyLGHDDDILCL 733
Cdd:COG2319 316 ----------------DLATGKLLRTL-------------------------------------------TGHTGAVRSV 336
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462539124 734 TIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:COG2319 337 AFSPDGKTLASG--SDDGTVRLWDLATGELLRTLTGHTG-AVTSVAFSPDGRTLASGSAD--GTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
251-591 |
7.30e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.39 E-value: 7.30e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 251 ATGGRDGCIRLWDLTFKPITVIDLRETDqgykglSVRSVCWR--GDHILVGTQDSEI--FEIvvqERNKPFLIMQGHcEG 326
Cdd:COG2319 94 ASASADGTVRLWDLATGLLLRTLTGHTG------AVRSVAFSpdGKTLASGSADGTVrlWDL---ATGKLLRTLTGH-SG 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 327 ELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIK 405
Cdd:COG2319 164 AVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT 243
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 406 DRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVAQrykkvGECLGSL----SFITHLDWSSDSRYLQTNDGNGKrlfyrmp 481
Cdd:COG2319 244 GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLAT-----GELLRTLtghsGGVNSVAFSPDGKLLASGSDDGT------- 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 482 ggkevtsteeIKGVHWASWTCVSGLEVNGIWpkysdindINSVDGNYIGQVLVTADDYGIIKLFRypcLRKGAKFRKYIG 561
Cdd:COG2319 312 ----------VRLWDLATGKLLRTLTGHTGA--------VRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTG 370
|
330 340 350
....*....|....*....|....*....|
gi 2462539124 562 HSAHVTNVRWSHDYQWVISiGGADHSVFQW 591
Cdd:COG2319 371 HTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
148-435 |
2.39e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 111.27 E-value: 2.39e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 148 GHTDRIFDISWDlYQPNKLVSCGV-KHIKFWSLCGNalTPKRGVFGKTGDLQTilCLACARDELTYSGALNGDIYVW--K 224
Cdd:cd00200 7 GHTGGVTCVAFS-PDGKLLATGSGdGTIKVWDLETG--ELLRTLKGHTGPVRD--VAASADGTYLASGSSDKTIRLWdlE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 225 GINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIdlretdQGYKGlSVRSVCWRGDHILV--G 299
Cdd:cd00200 82 TGECVRTLTG-HTSYVSSVAFSPDGriLSSSSRDKTIKVWDVeTGKCLTTL------RGHTD-WVNSVAFSPDGTFVasS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 300 TQDSEIFEIVVQErNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAVNADG 378
Cdd:cd00200 154 SQDGTIKLWDLRT-GKCVATLTGH-TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDG 231
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 2462539124 379 IHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIY 435
Cdd:cd00200 232 YLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1449-1725 |
3.55e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 110.39 E-value: 3.55e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1449 SIHIWDAMNKQTLSILRCyHSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTq 1528
Cdd:COG2319 143 TVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKL- 218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1529 FVSVGV-KHVKFWTLAGRALLSkkgllsTLEDARmQTMLAIAFGANNLTF-TGTISGDVCVWK-DHILCRIVARAHNGPV 1605
Cdd:COG2319 219 LASGSAdGTVRLWDLATGKLLR------TLTGHS-GSVRSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTLTGHSGGV 291
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1606 FAMyTTLRDG-LIVTGGkerpskEGGAVKLWDQELRRC-RAFRLETGQatdcVRSVC-RGKGKILVGTRNAEIIEVGE-K 1681
Cdd:COG2319 292 NSV-AFSPDGkLLASGS------DDGTVRLWDLATGKLlRTLTGHTGA----VRSVAfSPDGKTLASGSDDGTVRLWDlA 360
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 2462539124 1682 NAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIAD 1725
Cdd:COG2319 361 TGELLRTLTGH-TGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1420-1917 |
2.36e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.07 E-value: 2.36e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1420 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1499
Cdd:COG2319 30 LLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLL-GHTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1500 WQEGAKIASRAGHNQRIFVAEFRPDSDTqFVSVGV-KHVKFWTLAGRALLSkkgllstledarmqtmlaiafgannlTFT 1578
Cdd:COG2319 107 LATGLLLRTLTGHTGAVRSVAFSPDGKT-LASGSAdGTVRLWDLATGKLLR--------------------------TLT 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1579 GtisgdvcvwkdhilcrivaraHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdCVR 1657
Cdd:COG2319 160 G---------------------HSGAVTSV-AFSPDGkLLASGS------DDGTVRLWD----------LATGK---LLR 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1658 SVcrgkgkilvgtrnaeiievgeknaacnilvNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHA 1737
Cdd:COG2319 199 TL------------------------------TGH-TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT-GHS 246
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1738 A--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRI 1815
Cdd:COG2319 247 GsvRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTL 326
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1816 sycKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGKhlmdhaaidritwatwtsilgdeVLGIWSRHAekADVNCA 1895
Cdd:COG2319 327 ---TGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE-----------------------LLRTLTGHT--GAVTSV 378
|
490 500
....*....|....*....|..
gi 2462539124 1896 CVSHSGISLVTGDDFGMVKLFD 1917
Cdd:COG2319 379 AFSPDGRTLASGSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
895-1266 |
6.73e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.53 E-value: 6.73e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462539124 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
712-929 |
1.20e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.41 E-value: 1.20e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 712 IYNRQQNTQRFYL-GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASV 790
Cdd:cd00200 77 LWDLETGECVRTLtGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRGHTDW-VNSVAFSPDGTFVASS 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 791 GIDdsHTVVLWDWKKGEKLSIARGSKDKIFVVKmnpYVPD--KLITAGI-KHMKFWRKAGGGLigrkgyIGTL-GKNDTM 866
Cdd:cd00200 154 SQD--GTIKLWDLRTGKCVATLTGHTGEVNSVA---FSPDgeKLLSSSSdGTIKLWDLSTGKC------LGTLrGHENGV 222
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462539124 867 MCAVYGWTEEMAFSGTSTGDVCIW--RDIFLVKTVKAHDGPVFSM--HALEKGFVTGGKDGIVALWD 929
Cdd:cd00200 223 NSVAFSPDGYLLASGSEDGTIRVWdlRTGECVQTLSGHTNSVTSLawSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1468-1770 |
3.96e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 3.96e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1468 HSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLagra 1546
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDL---- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1547 llSKKGLLSTLEDARmQTMLAIAFGANNLTFTGTIS-GDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDGLIVTGgker 1624
Cdd:cd00200 81 --ETGECVRTLTGHT-SYVSSVAFSPDGRILSSSSRdKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVAS---- 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1625 pSKEGGAVKLWD-QELRRCRAFRLETGQatdcVRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLA 1701
Cdd:cd00200 153 -SSQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH-ENGVNSVA 226
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462539124 1702 THPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHAAR--TVCYSPEGDMVAIGMKNGefiillvsSLKIW 1770
Cdd:cd00200 227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSADG--------TIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
228-592 |
1.69e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 93.94 E-value: 1.69e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 228 LIRTIQGaHAAGIFSM--NACEEGFATGGRDGCIRLWDLTFKpitviDLRETDQGYKGlSVRSVCWRGDH--ILVGTQDS 303
Cdd:cd00200 1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADGtyLASGSSDK 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 304 EIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLA 382
Cdd:cd00200 74 TIR-LWDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 383 LGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVAQRyKKVGECLGSLSFITHLDWSSD 462
Cdd:cd00200 152 SSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSPD 230
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 463 SRYLqtndgngkrlfyrmpggkevtsteeikgvhwaswtcvsglevngiwpkysdindinsvdgnyigqvlVTADDYGII 542
Cdd:cd00200 231 GYLL-------------------------------------------------------------------ASGSEDGTI 243
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 2462539124 543 KLFRypcLRKGAKFRKYIGHSAHVTNVRWSHDYQWVISiGGADHSVFQWK 592
Cdd:cd00200 244 RVWD---LRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1140 |
3.26e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 93.17 E-value: 3.26e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAIKRAALApgskglllednpsiRAISLGHGH-I 971
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDLETGELLRTLKGHTGPVR--------------DVAASADGTyL 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 972 LVGTKNGEILEVDKSGP---ITLlvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRKLKKGGRCCCF 1048
Cdd:cd00200 67 ASGSSDKTIRLWDLETGecvRTL--TGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAF 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1049 SPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKGATSYI 1128
Cdd:cd00200 144 SPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGV 222
|
250
....*....|..
gi 2462539124 1129 THIDWDIRGKLL 1140
Cdd:cd00200 223 NSVAFSPDGYLL 234
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-49 |
7.14e-19 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 82.60 E-value: 7.14e-19
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462539124 2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
672-715 |
9.17e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 9.17e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2462539124 672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1410-1722 |
1.13e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 85.85 E-value: 1.13e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1410 YQEHNDDILCLTVNQHPKFInivATGqvgdSADmsataPSIHIWDaMNKQTLSILRCYHSKGVCSVSFSATGKLLLSVGL 1489
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLL---ATG----SGD-----GTIKVWD-LETGELLRTLKGHTGPVRDVAASADGTYLASGSS 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1490 DpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdtQFVSVGVKH--VKFWTLAgrallsKKGLLSTLEDARMQTMlA 1567
Cdd:cd00200 72 D--KTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVE------TGKCLTTLRGHTDWVN-S 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1568 IAF-GANNLTFTGTISGDVCVWKDHIL-CRIVARAHNGPVFAMYTTLRDGLIVTGGkerpskEGGAVKLWDQELRRCRA- 1644
Cdd:cd00200 141 VAFsPDGTFVASSSQDGTIKLWDLRTGkCVATLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLGt 214
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462539124 1645 FRLETGQATDCVRSvcrGKGKILVGTRNAEIIEVGE-KNAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1722
Cdd:cd00200 215 LRGHENGVNSVAFS---PDGYLLASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1355-1402 |
1.35e-14 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 70.27 E-value: 1.35e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462539124 1355 KKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1402
Cdd:pfam03451 26 KKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1656-1917 |
2.80e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 72.75 E-value: 2.80e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1656 VRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVN 1733
Cdd:cd00200 12 VTCVAfsPDGKLLATGSGDGTIKVWDLETGELLRTLKGH-TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1734 lGHAA--RTVCYSPEGDMVAIGMKNGefiillvsSLKIW----GKK----RDRRCAIHDIRFSPDSRYLAVGSSENSVDF 1803
Cdd:cd00200 91 -GHTSyvSSVAFSPDGRILSSSSRDK--------TIKVWdvetGKClttlRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1804 YDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGKHLMD-HAAIDRITWATW-------TSIL 1875
Cdd:cd00200 162 WDLRTGKCVATL---TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTlRGHENGVNSVAFspdgyllASGS 238
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 2462539124 1876 GDEVLGIWSRHAEK---------ADVNCACVSHSGISLVTGDDFGMVKLFD 1917
Cdd:cd00200 239 EDGTIRVWDLRTGEcvqtlsghtNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1689-1945 |
4.10e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 71.98 E-value: 4.10e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1689 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1766
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1767 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1837
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1838 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1915
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270
....*....|....*....|....*....|....
gi 2462539124 1916 FDFPCPEKFVSLC----FVYYYQFTPNFDVLSSA 1945
Cdd:cd00200 204 WDLSTGKCLGTLRghenGVNSVAFSPDGYLLASG 237
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
46-178 |
3.48e-12 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 69.29 E-value: 3.48e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 46 VYSPREHR-QKFYRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSV 124
Cdd:cd00200 161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 2462539124 125 GLDSKnaVCVWDWKRGKMLSMAPGHTDRIFDISWDlYQPNKLVSCGV-KHIKFWS 178
Cdd:cd00200 238 SEDGT--IRVWDLRTGECVQTLSGHTNSVTSLAWS-PDGKRLASGSAdGTIRIWD 289
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1741-1823 |
1.41e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 45.35 E-value: 1.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1741 VCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRR-CAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCK 1819
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDKEdLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80
|
....
gi 2462539124 1820 DIPS 1823
Cdd:pfam12894 81 DLIT 84
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1701-1917 |
1.86e-04 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 46.06 E-value: 1.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1701 ATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVC-YSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCA 1779
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLaASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1780 IHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKH-- 1856
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL---TGHTGAVRSVAFSPDGKTL-ASGSADGTvRLWDLATGKLlr 156
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462539124 1857 -LMDHA-AIDRITWA----TWTSILGDEVLGIWSRHAEK---------ADVNCACVSHSGISLVTGDDFGMVKLFD 1917
Cdd:COG2319 157 tLTGHSgAVTSVAFSpdgkLLASGSDDGTVRLWDLATGKllrtltghtGAVRSVAFSPDGKLLASGSADGTVRLWD 232
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1690-1722 |
3.00e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.02 E-value: 3.00e-04
10 20 30
....*....|....*....|....*....|...
gi 2462539124 1690 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1722
Cdd:pfam00400 8 EGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
3.71e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 3.71e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2462539124 760 TIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:smart00320 1 SGELLKTLKGHTGP-VTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1690-1722 |
3.98e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 3.98e-04
10 20 30
....*....|....*....|....*....|...
gi 2462539124 1690 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1722
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
314-353 |
1.18e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 1.18e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 2462539124 314 NKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:pfam00400 1 GKLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
250-354 |
1.91e-03 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 42.60 E-value: 1.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 250 FATGGRDGCIRLWDLTF--KPITVIDLRETdqGYKGLSVRSvcwRGDHILVGTQDSEIFEIVVQErNKPFLIMQGHCEGE 327
Cdd:cd22857 195 IVTGTGYHQVRLYDTRAqrRPVVSVDFGET--PIKAVAEDP---DGHTVYVGDTSGDLASIDLRT-GKLLGCFKGKCGGS 268
|
90 100
....*....|....*....|....*..
gi 2462539124 328 LWALAVHPTKPLAVTGSDDRSVRIWSL 354
Cdd:cd22857 269 IRSIARHPELPLIASCGLDRYLRIWDT 295
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
315-353 |
2.10e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 2.10e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2462539124 315 KPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
994-1026 |
3.05e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 3.05e-03
10 20 30
....*....|....*....|....*....|...
gi 2462539124 994 QGHmEGEVWGLATHPYLPICATVSDDKTLRIWD 1026
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
762-802 |
4.54e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 4.54e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462539124 762 KPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:pfam00400 2 KLLKTLEGHTG-SVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1698-1808 |
9.74e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 40.06 E-value: 9.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462539124 1698 WGLATHPSRDF-FLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVCYSPEGDMVAIGMKNGEFIILLVS-----SLKIWg 1771
Cdd:COG3391 113 RGLAVDPDGGRlYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVSVIVSvidtaTGKVV- 191
|
90 100 110
....*....|....*....|....*....|....*...
gi 2462539124 1772 KKRDRRCAIHDIRFSPDSRYLAVGSSE-NSVDFYDLTL 1808
Cdd:COG3391 192 ATIPVGGGPVGVAVSPDGRRLYVANRGsNTSNGGSNTV 229
|
|
|