|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
8.74e-40 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.76 E-value: 8.74e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 2462573201 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
3.20e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 139.01 E-value: 3.20e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462573201 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
3.00e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 3.00e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2462573201 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
3.12e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 3.12e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462573201 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
5.88e-15 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 76.60 E-value: 5.88e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462573201 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| COG4946 super family |
cl27624 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
2.86e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown]; The actual alignment was detected with superfamily member COG4946:
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 2.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 2462573201 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
8.74e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.76 E-value: 8.74e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 2462573201 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
2.50e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 2.50e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462573201 289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200 225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
3.20e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.01 E-value: 3.20e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462573201 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
718-1028 |
4.98e-28 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 118.09 E-value: 4.98e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 718 HSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHS 797
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 798 IVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---G 872
Cdd:COG2319 186 VRLWDLATGKLLRTLTGHTGAVRSVAFSPDG-KLLASGSAdGTVRLWDLATGKLlRTLTGHSGSV------RSVAFspdG 258
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 873 RmedLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDDMFERCLKTYA---IKRSA 945
Cdd:COG2319 259 R---LLASGSADGTVRLWdlATGELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWDLATGKLLRTLTghtGAVRS 335
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 946 LSTSSKGlllednpsiraitlghGHILVGTKNGEILEIDKSGPMTLLV-QGHmEGEVWGLAAHP---LLpicATVSDDKT 1021
Cdd:COG2319 336 VAFSPDG----------------KTLASGSDDGTVRLWDLATGELLRTlTGH-TGAVTSVAFSPdgrTL---ASGSADGT 395
|
....*..
gi 2462573201 1022 LRIWELS 1028
Cdd:COG2319 396 VRLWDLA 402
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
3.00e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 3.00e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2462573201 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
3.12e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 3.12e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462573201 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
5.88e-15 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 76.60 E-value: 5.88e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462573201 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
2.86e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 2.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 2462573201 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
105-136 |
1.59e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 1.59e-04
10 20 30
....*....|....*....|....*....|..
gi 2462573201 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320 11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
315-353 |
3.80e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 3.80e-04
10 20 30
....*....|....*....|....*....|....*....
gi 2462573201 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400 2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
5.90e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 5.90e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2462573201 760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
762-802 |
8.29e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 35.01 E-value: 8.29e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462573201 762 KCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:pfam00400 2 KLLKTLEG-HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
8.74e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.76 E-value: 8.74e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 2462573201 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-466 |
1.23e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.37 E-value: 1.23e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 74 LLGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgkTGDLqtilcLAcakeditySGAL 216
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSP---------------------------------DGKL-----LA--------SGSD 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpitkidlreteqgykglsirsvcwk 292
Cdd:COG2319 183 DGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA--------------------------- 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 293 adrllagtqdseifevivreRDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEE-AVRS 371
Cdd:COG2319 235 --------------------TGKLLRTLTGH-SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSgGVNS 293
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 372 VAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGECSKSL 451
Cdd:COG2319 294 VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAT-----GELLRTL 368
|
410
....*....|....*....
gi 2462573201 452 ----SFITHIDWSLDSKYL 466
Cdd:COG2319 369 tghtGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-438 |
3.47e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 145.05 E-value: 3.47e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 32 LLGLAAAVASLAASPDGARLAAGAGDLT--LLLLDAAAGALLATLLG-HTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 137 WRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWtlcgNALTAKRgIFGKTGDLQTILCLAC-AKEDITYSG 214
Cdd:COG2319 107 LATGLLLRTLTGHTGAVRSVAFSP-DGKTLASGSAdGTVRLW----DLATGKL-LRTLTGHSGAVTSVAFsPDGKLLASG 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 215 ALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpiTKiDLRETEQGYKGlSIRSVC 290
Cdd:COG2319 181 SDDGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA----TG-KLLRTLTGHSG-SVRSVA 253
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 291 WKAD--RLLAGTQDSEIfEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEA 368
Cdd:COG2319 254 FSPDgrLLASGSADGTV-RLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG 331
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462573201 369 -VRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 438
Cdd:COG2319 332 aVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
2.50e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 2.50e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462573201 289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200 225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
3.20e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.01 E-value: 3.20e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462573201 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
274-844 |
9.74e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 123.10 E-value: 9.74e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 274 LRETEQGYKGLSIRSVCWKADRLLAGTQDSEIFEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:COG2319 28 LLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 354 LADHALIARCNM-EEAVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPV 432
Cdd:COG2319 107 LATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTV 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 433 DVYAVAQrykkiGECSKSL----SFITHIDWSLDSKYLqtndgagerlfyrmpsgkpltskeeikgipwaswtcvkgpev 508
Cdd:COG2319 187 RLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLL------------------------------------------ 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 509 sgiwpkytevtdinsvdanynssvlVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSV 588
Cdd:COG2319 220 -------------------------ASGSADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLAS-GSADGTV 270
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 589 FQWRfipegvsngmletapqeggadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflk 668
Cdd:COG2319 271 RLWD---------------------------------------------------------------------------- 274
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 669 rekapedslklqfihgyrgydcrnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvG 748
Cdd:COG2319 275 ----------------------------LATGELL-----------------RTLTGHSGGVNSVAFSPDGKLLASG--S 307
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 749 RDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPHH 828
Cdd:COG2319 308 DDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRLWDLATGELLRTLTGHTGAVTSVAFSPDG 384
|
570
....*....|....*..
gi 2462573201 829 vDKLVTVGI-KHIKFWQ 844
Cdd:COG2319 385 -RTLASGSAdGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
718-1028 |
4.98e-28 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 118.09 E-value: 4.98e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 718 HSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHS 797
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 798 IVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---G 872
Cdd:COG2319 186 VRLWDLATGKLLRTLTGHTGAVRSVAFSPDG-KLLASGSAdGTVRLWDLATGKLlRTLTGHSGSV------RSVAFspdG 258
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 873 RmedLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDDMFERCLKTYA---IKRSA 945
Cdd:COG2319 259 R---LLASGSADGTVRLWdlATGELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWDLATGKLLRTLTghtGAVRS 335
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 946 LSTSSKGlllednpsiraitlghGHILVGTKNGEILEIDKSGPMTLLV-QGHmEGEVWGLAAHP---LLpicATVSDDKT 1021
Cdd:COG2319 336 VAFSPDG----------------KTLASGSDDGTVRLWDLATGELLRTlTGH-TGAVTSVAFSPdgrTL---ASGSADGT 395
|
....*..
gi 2462573201 1022 LRIWELS 1028
Cdd:COG2319 396 VRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
723-929 |
2.49e-23 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 101.26 E-value: 2.49e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 723 YLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:cd00200 89 LTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQD--GTIKLWD 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 803 WKKGEKIATTRGHKDKIFVVKCNPHHVDKLVTVGIKHIKFWQQAGGgftSKRGTFgsVGKLETMMCVSYGRMEDLVFSGA 882
Cdd:cd00200 164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG---KCLGTL--RGHENGVNSVAFSPDGYLLASGS 238
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 2462573201 883 ATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD 929
Cdd:cd00200 239 EDGTIRVWdlRTGECVQTLSGHTNSVTSLAWSPDGkrLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
228-466 |
3.23e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.09 E-value: 3.23e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 228 LVRTIQGaHSAGIFSM--YACEEGFATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWKAD--RLLAGTQDS 303
Cdd:cd00200 1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADgtYLASGSSDK 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 304 EIFevIVRERDKPML-ILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVAFSPDGSQL 381
Cdd:cd00200 74 TIR--LWDLETGECVrTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 382 ALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQRyKKIGECSKSLSFITHIDWSL 461
Cdd:cd00200 151 ASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSP 229
|
....*
gi 2462573201 462 DSKYL 466
Cdd:cd00200 230 DGYLL 234
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
3.00e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 3.00e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2462573201 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
3.12e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 3.12e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2462573201 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
521-929 |
2.69e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 91.51 E-value: 2.69e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWrfipegvsn 600
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWD---LATGLLLRTLTGHTGAVRSVAFSPDGKTLAS-GSADGTVRLW--------- 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 601 gmletapqeggadsyseesdsDLSDVPELDSDIEQEAQINydrqvykedlpqlkqqskeknhAVPFlkrekAPEdslklq 680
Cdd:COG2319 148 ---------------------DLATGKLLRTLTGHSGAVT----------------------SVAF-----SPD------ 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 681 fihGYR----GYDCRNNLFYTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVW 756
Cdd:COG2319 174 ---GKLlasgSDDGTVRLWDLATGKLL-----------------RTLTGHTGAVRSVAFSPDGKLLASG--SADGTVRLW 231
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 757 DTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPhhvD--KLVT 834
Cdd:COG2319 232 DLATGKLLRTLTG-HSGSVRSVAFSPDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSP---DgkLLAS 305
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 835 VGI-KHIKFWQQAGGgftSKRGTFGsvGKLETMMCVSYGRMEDLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAMYA 911
Cdd:COG2319 306 GSDdGTVRLWDLATG---KLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGTVRLWdlATGELLRTLTGHTGAVTSVAF 380
|
410 420
....*....|....*....|
gi 2462573201 912 LDKG--FVTGGKDGIVELWD 929
Cdd:COG2319 381 SPDGrtLASGSADGTVRLWD 400
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
46-177 |
1.57e-16 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 81.23 E-value: 1.57e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 46 VYNTREHS-QKFFLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASV 124
Cdd:cd00200 161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 2462573201 125 GLDakNTVCIWDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFW 177
Cdd:cd00200 238 SED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP-DGKRLASGSAdGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
55-179 |
2.23e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 82.65 E-value: 2.23e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 55 KFFLGHNDDIISLALHPDKTLVATGqvGKEPYICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCI 134
Cdd:COG2319 282 RTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRL 356
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 2462573201 135 WDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWTL 179
Cdd:COG2319 357 WDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSAdGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
5.88e-15 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 76.60 E-value: 5.88e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462573201 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
743-1028 |
1.26e-10 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 64.93 E-value: 1.26e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 743 ATGQVGRDAAIHVWDTQTLKCLSLLKGQHQRGVcALDFSADGKCLVSVGLDDfhSIVFWDWKKGEKIATTRGHKDKIFVV 822
Cdd:COG2319 8 ALAAASADLALALLAAALGALLLLLLGLAAAVA-SLAASPDGARLAAGAGDL--TLLLLDAAAGALLATLLGHTAAVLSV 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 823 KCNPHHVDKLVTVGIKHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---GRMedlVFSGAATGDIFIW--KDILLL 896
Cdd:COG2319 85 AFSPDGRLLASASADGTVRLWDLATGLLlRTLTGHTGAV------RSVAFspdGKT---LASGSADGTVRLWdlATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 897 KTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDDMFERCLKTyaikrsalstsskglLLEDNPSIRAITLGH-GHILV 973
Cdd:COG2319 156 RTLTGHSGAVTSVaFSPDgKLLASGSDDGTVRLWDLATGKLLRT---------------LTGHTGAVRSVAFSPdGKLLA 220
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 974 -GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELS 1028
Cdd:COG2319 221 sGSADGTVRLWDlATGKLLRTLTGH-SGSVRSVAFSPdgrLL---ASGSADGTVRLWDLA 276
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1027 |
1.56e-05 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 48.10 E-value: 1.56e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 895 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 965
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462573201 966 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWEL 1027
Cdd:cd00200 77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDV 122
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
2.86e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 2.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 2462573201 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
105-136 |
1.59e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 1.59e-04
10 20 30
....*....|....*....|....*....|..
gi 2462573201 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320 11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
315-353 |
3.80e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 3.80e-04
10 20 30
....*....|....*....|....*....|....*....
gi 2462573201 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400 2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
5.90e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 5.90e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2462573201 760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
315-353 |
8.92e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 8.92e-04
10 20 30
....*....|....*....|....*....|....*....
gi 2462573201 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
105-136 |
9.68e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 37.71 E-value: 9.68e-04
10 20 30
....*....|....*....|....*....|..
gi 2462573201 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:pfam00400 10 HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
339-430 |
1.17e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.99 E-value: 1.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 339 LAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQL-ALGMKDGSFIVLRVRDMTEVVHIKDRKEViHEMKFS 417
Cdd:COG3391 82 LYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLyVADSGNGRVSVIDTATGKVVATIPVGAGP-HGIAVD 160
|
90
....*....|...
gi 2462573201 418 PDGSYLAVGSNDG 430
Cdd:COG3391 161 PDGKRLYVANSGS 173
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
329-425 |
1.42e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.60 E-value: 1.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462573201 329 WALALHPK-KPLAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQLALGMKDGSFI-----VLRVRDMTEVV 402
Cdd:COG3391 113 RGLAVDPDgGRLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVsvivsVIDTATGKVVA 192
|
90 100
....*....|....*....|...
gi 2462573201 403 HIkDRKEVIHEMKFSPDGSYLAV 425
Cdd:COG3391 193 TI-PVGGGPVGVAVSPDGRRLYV 214
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
762-802 |
8.29e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 35.01 E-value: 8.29e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462573201 762 KCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:pfam00400 2 KLLKTLEG-HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
227-263 |
8.68e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 34.98 E-value: 8.68e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2462573201 227 NLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWD 263
Cdd:smart00320 3 ELLKTLKG-HTGPVTSVAFSPDGkyLASGSDDGTIKLWD 40
|
|
|