NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462573201|ref|XP_054197999|]
View 

echinoderm microtubule-associated protein-like 6 isoform X8 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
57-397 8.74e-40

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 152.76  E-value: 8.74e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319    116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319    191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319    225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319    298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
                          330       340       350
                   ....*....|....*....|....*....|
gi 2462573201  368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319    374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1026 3.20e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 139.01  E-value: 3.20e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200      7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200     82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200    154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462573201  955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200    234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-48 3.00e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


:

Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 3.00e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2462573201    2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
668-715 3.12e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


:

Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 3.12e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2462573201  668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
521-826 5.88e-15

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 76.60  E-value: 5.88e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200     12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200     81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200    137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462573201  753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200    201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
COG4946 super family cl27624
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
368-488 2.86e-05

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


The actual alignment was detected with superfamily member COG4946:

Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 48.50  E-value: 2.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946    344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462573201  443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946    424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
57-397 8.74e-40

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 152.76  E-value: 8.74e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319    116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319    191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319    225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319    298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
                          330       340       350
                   ....*....|....*....|....*....|
gi 2462573201  368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319    374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
57-353 2.50e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 139.39  E-value: 2.50e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200     80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200    152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462573201  289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200    225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1026 3.20e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 139.01  E-value: 3.20e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200      7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200     82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200    154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462573201  955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200    234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
718-1028 4.98e-28

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 118.09  E-value: 4.98e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  718 HSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHS 797
Cdd:COG2319    111 LLLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  798 IVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---G 872
Cdd:COG2319    186 VRLWDLATGKLLRTLTGHTGAVRSVAFSPDG-KLLASGSAdGTVRLWDLATGKLlRTLTGHSGSV------RSVAFspdG 258
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  873 RmedLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDDMFERCLKTYA---IKRSA 945
Cdd:COG2319    259 R---LLASGSADGTVRLWdlATGELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWDLATGKLLRTLTghtGAVRS 335
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  946 LSTSSKGlllednpsiraitlghGHILVGTKNGEILEIDKSGPMTLLV-QGHmEGEVWGLAAHP---LLpicATVSDDKT 1021
Cdd:COG2319    336 VAFSPDG----------------KTLASGSDDGTVRLWDLATGELLRTlTGH-TGAVTSVAFSPdgrTL---ASGSADGT 395

                   ....*..
gi 2462573201 1022 LRIWELS 1028
Cdd:COG2319    396 VRLWDLA 402
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-48 3.00e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 3.00e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2462573201    2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
668-715 3.12e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 3.12e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2462573201  668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
521-826 5.88e-15

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 76.60  E-value: 5.88e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200     12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200     81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200    137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462573201  753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200    201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
368-488 2.86e-05

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 48.50  E-value: 2.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946    344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462573201  443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946    424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
105-136 1.59e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 1.59e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462573201   105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320   11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
315-353 3.80e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.87  E-value: 3.80e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2462573201  315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400    2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
760-802 5.90e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.45  E-value: 5.90e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 2462573201   760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320    1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
762-802 8.29e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 35.01  E-value: 8.29e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2462573201  762 KCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:pfam00400    2 KLLKTLEG-HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
57-397 8.74e-40

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 152.76  E-value: 8.74e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319    116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319    191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319    225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319    298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
                          330       340       350
                   ....*....|....*....|....*....|
gi 2462573201  368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319    374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
57-466 1.23e-39

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 152.37  E-value: 1.23e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319     74 LLGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgkTGDLqtilcLAcakeditySGAL 216
Cdd:COG2319    149 LATGKLLRTLTGHSGAVTSVAFSP---------------------------------DGKL-----LA--------SGSD 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpitkidlreteqgykglsirsvcwk 292
Cdd:COG2319    183 DGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA--------------------------- 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  293 adrllagtqdseifevivreRDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEE-AVRS 371
Cdd:COG2319    235 --------------------TGKLLRTLTGH-SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSgGVNS 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  372 VAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGECSKSL 451
Cdd:COG2319    294 VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAT-----GELLRTL 368
                          410
                   ....*....|....*....
gi 2462573201  452 ----SFITHIDWSLDSKYL 466
Cdd:COG2319    369 tghtGAVTSVAFSPDGRTL 387
WD40 COG2319
WD40 repeat [General function prediction only];
57-438 3.47e-37

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 145.05  E-value: 3.47e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319     32 LLGLAAAVASLAASPDGARLAAGAGDLT--LLLLDAAAGALLATLLG-HTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  137 WRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWtlcgNALTAKRgIFGKTGDLQTILCLAC-AKEDITYSG 214
Cdd:COG2319    107 LATGLLLRTLTGHTGAVRSVAFSP-DGKTLASGSAdGTVRLW----DLATGKL-LRTLTGHSGAVTSVAFsPDGKLLASG 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  215 ALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpiTKiDLRETEQGYKGlSIRSVC 290
Cdd:COG2319    181 SDDGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA----TG-KLLRTLTGHSG-SVRSVA 253
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  291 WKAD--RLLAGTQDSEIfEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEA 368
Cdd:COG2319    254 FSPDgrLLASGSADGTV-RLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG 331
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462573201  369 -VRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 438
Cdd:COG2319    332 aVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
57-353 2.50e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 139.39  E-value: 2.50e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200     80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200    152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462573201  289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200    225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1026 3.20e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 139.01  E-value: 3.20e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200      7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200     82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200    154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462573201  955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200    234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
274-844 9.74e-30

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 123.10  E-value: 9.74e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  274 LRETEQGYKGLSIRSVCWKADRLLAGTQDSEIFEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:COG2319     28 LLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTVRLWD 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  354 LADHALIARCNM-EEAVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPV 432
Cdd:COG2319    107 LATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTV 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  433 DVYAVAQrykkiGECSKSL----SFITHIDWSLDSKYLqtndgagerlfyrmpsgkpltskeeikgipwaswtcvkgpev 508
Cdd:COG2319    187 RLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLL------------------------------------------ 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  509 sgiwpkytevtdinsvdanynssvlVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSV 588
Cdd:COG2319    220 -------------------------ASGSADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLAS-GSADGTV 270
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  589 FQWRfipegvsngmletapqeggadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflk 668
Cdd:COG2319    271 RLWD---------------------------------------------------------------------------- 274
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  669 rekapedslklqfihgyrgydcrnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvG 748
Cdd:COG2319    275 ----------------------------LATGELL-----------------RTLTGHSGGVNSVAFSPDGKLLASG--S 307
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  749 RDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPHH 828
Cdd:COG2319    308 DDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRLWDLATGELLRTLTGHTGAVTSVAFSPDG 384
                          570
                   ....*....|....*..
gi 2462573201  829 vDKLVTVGI-KHIKFWQ 844
Cdd:COG2319    385 -RTLASGSAdGTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
718-1028 4.98e-28

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 118.09  E-value: 4.98e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  718 HSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHS 797
Cdd:COG2319    111 LLLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  798 IVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---G 872
Cdd:COG2319    186 VRLWDLATGKLLRTLTGHTGAVRSVAFSPDG-KLLASGSAdGTVRLWDLATGKLlRTLTGHSGSV------RSVAFspdG 258
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  873 RmedLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDDMFERCLKTYA---IKRSA 945
Cdd:COG2319    259 R---LLASGSADGTVRLWdlATGELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWDLATGKLLRTLTghtGAVRS 335
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  946 LSTSSKGlllednpsiraitlghGHILVGTKNGEILEIDKSGPMTLLV-QGHmEGEVWGLAAHP---LLpicATVSDDKT 1021
Cdd:COG2319    336 VAFSPDG----------------KTLASGSDDGTVRLWDLATGELLRTlTGH-TGAVTSVAFSPdgrTL---ASGSADGT 395

                   ....*..
gi 2462573201 1022 LRIWELS 1028
Cdd:COG2319    396 VRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
723-929 2.49e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 101.26  E-value: 2.49e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  723 YLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:cd00200     89 LTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQD--GTIKLWD 163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  803 WKKGEKIATTRGHKDKIFVVKCNPHHVDKLVTVGIKHIKFWQQAGGgftSKRGTFgsVGKLETMMCVSYGRMEDLVFSGA 882
Cdd:cd00200    164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG---KCLGTL--RGHENGVNSVAFSPDGYLLASGS 238
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462573201  883 ATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD 929
Cdd:cd00200    239 EDGTIRVWdlRTGECVQTLSGHTNSVTSLAWSPDGkrLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
228-466 3.23e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 95.09  E-value: 3.23e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  228 LVRTIQGaHSAGIFSM--YACEEGFATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWKAD--RLLAGTQDS 303
Cdd:cd00200      1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADgtYLASGSSDK 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  304 EIFevIVRERDKPML-ILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVAFSPDGSQL 381
Cdd:cd00200     74 TIR--LWDLETGECVrTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFV 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  382 ALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQRyKKIGECSKSLSFITHIDWSL 461
Cdd:cd00200    151 ASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSP 229

                   ....*
gi 2462573201  462 DSKYL 466
Cdd:cd00200    230 DGYLL 234
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-48 3.00e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 3.00e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2462573201    2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
668-715 3.12e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 3.12e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2462573201  668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 COG2319
WD40 repeat [General function prediction only];
521-929 2.69e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 91.51  E-value: 2.69e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWrfipegvsn 600
Cdd:COG2319     81 VLSVAFSPDGRLLASASADGTVRLWD---LATGLLLRTLTGHTGAVRSVAFSPDGKTLAS-GSADGTVRLW--------- 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  601 gmletapqeggadsyseesdsDLSDVPELDSDIEQEAQINydrqvykedlpqlkqqskeknhAVPFlkrekAPEdslklq 680
Cdd:COG2319    148 ---------------------DLATGKLLRTLTGHSGAVT----------------------SVAF-----SPD------ 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  681 fihGYR----GYDCRNNLFYTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVW 756
Cdd:COG2319    174 ---GKLlasgSDDGTVRLWDLATGKLL-----------------RTLTGHTGAVRSVAFSPDGKLLASG--SADGTVRLW 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  757 DTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPhhvD--KLVT 834
Cdd:COG2319    232 DLATGKLLRTLTG-HSGSVRSVAFSPDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSP---DgkLLAS 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  835 VGI-KHIKFWQQAGGgftSKRGTFGsvGKLETMMCVSYGRMEDLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAMYA 911
Cdd:COG2319    306 GSDdGTVRLWDLATG---KLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGTVRLWdlATGELLRTLTGHTGAVTSVAF 380
                          410       420
                   ....*....|....*....|
gi 2462573201  912 LDKG--FVTGGKDGIVELWD 929
Cdd:COG2319    381 SPDGrtLASGSADGTVRLWD 400
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
46-177 1.57e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 81.23  E-value: 1.57e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   46 VYNTREHS-QKFFLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASV 124
Cdd:cd00200    161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462573201  125 GLDakNTVCIWDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFW 177
Cdd:cd00200    238 SED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP-DGKRLASGSAdGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
55-179 2.23e-16

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 82.65  E-value: 2.23e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201   55 KFFLGHNDDIISLALHPDKTLVATGqvGKEPYICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCI 134
Cdd:COG2319    282 RTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRL 356
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2462573201  135 WDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWTL 179
Cdd:COG2319    357 WDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSAdGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
521-826 5.88e-15

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 76.60  E-value: 5.88e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200     12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200     81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200    137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462573201  753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200    201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
WD40 COG2319
WD40 repeat [General function prediction only];
743-1028 1.26e-10

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 64.93  E-value: 1.26e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  743 ATGQVGRDAAIHVWDTQTLKCLSLLKGQHQRGVcALDFSADGKCLVSVGLDDfhSIVFWDWKKGEKIATTRGHKDKIFVV 822
Cdd:COG2319      8 ALAAASADLALALLAAALGALLLLLLGLAAAVA-SLAASPDGARLAAGAGDL--TLLLLDAAAGALLATLLGHTAAVLSV 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  823 KCNPHHVDKLVTVGIKHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---GRMedlVFSGAATGDIFIW--KDILLL 896
Cdd:COG2319     85 AFSPDGRLLASASADGTVRLWDLATGLLlRTLTGHTGAV------RSVAFspdGKT---LASGSADGTVRLWdlATGKLL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  897 KTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDDMFERCLKTyaikrsalstsskglLLEDNPSIRAITLGH-GHILV 973
Cdd:COG2319    156 RTLTGHSGAVTSVaFSPDgKLLASGSDDGTVRLWDLATGKLLRT---------------LTGHTGAVRSVAFSPdGKLLA 220
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  974 -GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELS 1028
Cdd:COG2319    221 sGSADGTVRLWDlATGKLLRTLTGH-SGSVRSVAFSPdgrLL---ASGSADGTVRLWDLA 276
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
895-1027 1.56e-05

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 48.10  E-value: 1.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  895 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 965
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462573201  966 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWEL 1027
Cdd:cd00200     77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDV 122
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
368-488 2.86e-05

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 48.50  E-value: 2.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946    344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462573201  443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946    424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
105-136 1.59e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 1.59e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462573201   105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320   11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
315-353 3.80e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.87  E-value: 3.80e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2462573201  315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400    2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
760-802 5.90e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.45  E-value: 5.90e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 2462573201   760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320    1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
315-353 8.92e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.06  E-value: 8.92e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 2462573201   315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:smart00320    3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
105-136 9.68e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 37.71  E-value: 9.68e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2462573201  105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:pfam00400   10 HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
339-430 1.17e-03

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 41.99  E-value: 1.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  339 LAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQL-ALGMKDGSFIVLRVRDMTEVVHIKDRKEViHEMKFS 417
Cdd:COG3391     82 LYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLyVADSGNGRVSVIDTATGKVVATIPVGAGP-HGIAVD 160
                           90
                   ....*....|...
gi 2462573201  418 PDGSYLAVGSNDG 430
Cdd:COG3391    161 PDGKRLYVANSGS 173
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
329-425 1.42e-03

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 41.60  E-value: 1.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201  329 WALALHPK-KPLAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQLALGMKDGSFI-----VLRVRDMTEVV 402
Cdd:COG3391    113 RGLAVDPDgGRLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVsvivsVIDTATGKVVA 192
                           90       100
                   ....*....|....*....|...
gi 2462573201  403 HIkDRKEVIHEMKFSPDGSYLAV 425
Cdd:COG3391    193 TI-PVGGGPVGVAVSPDGRRLYV 214
WD40 pfam00400
WD domain, G-beta repeat;
762-802 8.29e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 35.01  E-value: 8.29e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2462573201  762 KCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:pfam00400    2 KLLKTLEG-HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
227-263 8.68e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 34.98  E-value: 8.68e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 2462573201   227 NLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWD 263
Cdd:smart00320    3 ELLKTLKG-HTGPVTSVAFSPDGkyLASGSDDGTIKLWD 40
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH