|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
24-109 |
6.44e-68 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 219.22 E-value: 6.44e-68
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 24 FKFTISESCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLNAICAQVIPFLS 103
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
....*.
gi 544224626 104 QEQQLQ 109
Cdd:pfam03920 81 QEHQQQ 86
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
461-746 |
2.17e-44 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 164.70 E-value: 2.17e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 461 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAa 539
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 540 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 619
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 620 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKD 697
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 544224626 698 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 746
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
461-745 |
2.79e-39 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 147.10 E-value: 2.79e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 461 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCRLLPDGRTLIVGGEASTLSIWDLaa 539
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 540 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 619
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 620 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPD-KYQLHLHESCVLSLKFAHCGKWFVSTGKD 697
Cdd:cd00200 161 LWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSED 240
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 544224626 698 NLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 745
Cdd:cd00200 241 GTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
496-744 |
2.73e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 54.32 E-value: 2.73e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 496 KSPVSQLDCLNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAAPtprIKAELTSSAPACYALAISPDSKVCF------- 568
Cdd:PLN00181 471 KADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRSKLSGICWnsyiksq 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 569 --SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-C 644
Cdd:PLN00181 548 vaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 645 PTGEWLAVGMENSNVEVLHVTKPdkyQLHL-----HESCVLSLKFAHCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSS 719
Cdd:PLN00181 628 ESGRSLAFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHS 703
|
250 260 270
....*....|....*....|....*....|..
gi 544224626 720 VLS-------CDISVDDKYIVTGSGDKKATVY 744
Cdd:PLN00181 704 FMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
583-622 |
3.97e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.92 E-value: 3.97e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 544224626 583 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 622
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
585-622 |
4.77e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.87 E-value: 4.77e-06
10 20 30
....*....|....*....|....*....|....*...
gi 544224626 585 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 622
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
24-109 |
6.44e-68 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 219.22 E-value: 6.44e-68
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 24 FKFTISESCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLNAICAQVIPFLS 103
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
....*.
gi 544224626 104 QEQQLQ 109
Cdd:pfam03920 81 QEHQQQ 86
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
461-746 |
2.17e-44 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 164.70 E-value: 2.17e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 461 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAa 539
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 540 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 619
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 620 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKD 697
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 544224626 698 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 746
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
428-746 |
3.30e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 155.84 E-value: 3.30e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 428 VSADGQMQPVPFPPDALIGPGIPRHARQINTLNHGEVVCAVTISNPTRHVYTGGKGCVKVWDISHPGNKSPVSQLdclnR 507
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLG----H 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 508 DNYIRSCRLLPDGRTLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLV 587
Cdd:COG2319 78 TAAVLSVAFSPDGRLLASASADGTVRLWDLA--TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 588 RQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ---HdfTSQIFSLGYCPTGEWLAVGMENSNVEVLHV 664
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTltgH--TGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 665 -TKPDKYQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI-FQSKESSSVLSCDISVDDKYIVTGSGDKKAT 742
Cdd:COG2319 234 aTGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
....
gi 544224626 743 VYEV 746
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
454-706 |
2.82e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 153.14 E-value: 2.82e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 454 RQINTLN-HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEAST 531
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 532 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWT 611
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 612 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMENSNVEVLHV-TKPDKYQLHLHESCVLSLKFAHCGK 689
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLaTGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 544224626 690 WFVSTGKDNLLNAWRTP 706
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
461-745 |
2.79e-39 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 147.10 E-value: 2.79e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 461 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCRLLPDGRTLIVGGEASTLSIWDLaa 539
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 540 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 619
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 620 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEVLHVTKPD-KYQLHLHESCVLSLKFAHCGKWFVSTGKD 697
Cdd:cd00200 161 LWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSED 240
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 544224626 698 NLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 745
Cdd:cd00200 241 GTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
509-746 |
8.14e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 119.75 E-value: 8.14e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 509 NYIRSCRLLPDGRTLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVR 588
Cdd:cd00200 10 GGVTCVAFSPDGKLLATGSGDGTIKVWDLE--TGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 589 QFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMENSNVEV--LHVT 665
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTlRGHTDWVNSVAFSPDGTFVASSSQDGTIKLwdLRTG 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 666 KPdKYQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDKKATV 743
Cdd:cd00200 168 KC-VATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLgtLRGHE-NGVNSVAFSPDGYLLASGSEDGTIRV 245
|
...
gi 544224626 744 YEV 746
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
586-746 |
2.48e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.09 E-value: 2.48e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 586 LVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMENSNVEVL 662
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 663 HVTKPDK-YQLHLHESCVLSLKFAHCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDK 739
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 544224626 740 KATVYEV 746
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
454-583 |
1.21e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 73.41 E-value: 1.21e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 454 RQINTLN-HGEVVCAVTISNPTRHVYTGGKGC-VKVWDIShpgNKSPVSQLDclNRDNYIRSCRLLPDGRTLIVGGEAST 531
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 544224626 532 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHN 583
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
496-744 |
2.73e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 54.32 E-value: 2.73e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 496 KSPVSQLDCLNRDNYIRSCRLLPDGRTLIVGGEASTLSIWDLAAPtprIKAELTSSAPACYALAISPDSKVCF------- 568
Cdd:PLN00181 471 KADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRSKLSGICWnsyiksq 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 569 --SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-C 644
Cdd:PLN00181 548 vaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 645 PTGEWLAVGMENSNVEVLHVTKPdkyQLHL-----HESCVLSLKFAHCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSS 719
Cdd:PLN00181 628 ESGRSLAFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHS 703
|
250 260 270
....*....|....*....|....*....|..
gi 544224626 720 VLS-------CDISVDDKYIVTGSGDKKATVY 744
Cdd:PLN00181 704 FMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
583-622 |
3.97e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.92 E-value: 3.97e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 544224626 583 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 622
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
585-622 |
4.77e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.87 E-value: 4.77e-06
10 20 30
....*....|....*....|....*....|....*...
gi 544224626 585 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 622
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
541-580 |
9.58e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 9.58e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 544224626 541 TPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWD 580
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
546-627 |
1.01e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 41.98 E-value: 1.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 546 AELTSSAPACYALAISPDSKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTV 618
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 544224626 619 RSWDLREGR 627
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
| COG5276 |
COG5276 |
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function ... |
476-622 |
6.59e-03 |
|
Uncharacterized secreted protein, contains LVIVD repeats, choice-of-anchor domain [Function unknown];
Pssm-ID: 444087 [Multi-domain] Cd Length: 320 Bit Score: 39.54 E-value: 6.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 544224626 476 HVYTG-GKGCVKVWDISHPGNKSPVSQLDCLNRDNYirscRLLPDGRTLIVGGEAST-LSIWDLAAPT-PRIKAELTSSA 552
Cdd:COG5276 31 YAYVAgGSNGLAIVDVSDPANPVLVGSLPTPGGTWR----DVKVSGDYLYVASEGSEgLQIFDISDPAnPKLVGRYDTGG 106
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 544224626 553 PACYALAISpDSKVCFSCCSDGNIAVWDLHNQT---LVRQFQgHTDGASCIDISNDGTKLWTGGLDNTVRSWD 622
Cdd:COG5276 107 SGAHNIAVD-GNYAYVAGGSDNGLVIVDISDPTnpvLVGRYS-LPGQAYLHDVQVVGDYAYVADWEDGLVIVD 177
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
556-580 |
9.94e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 34.63 E-value: 9.94e-03
|
|