NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2274520717|gb|UTT89837|]
View 

hypothetical protein NDA17_007514 [Ustilago hordei]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
8-397 1.70e-79

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member PTZ00421:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 493  Bit Score: 257.51  E-value: 1.70e-79
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717   8 SKYRHVYPNVAKKEACYENVKVSNNAWD-TNLISANGTYISINWNASGGGAfaVLPINRPGKLPDIYPLCRGHTAAVLDT 86
Cdd:PTZ00421    4 SRFRHTQGVPARPDRHFLNVTPSTALWDcSNTIACNDRFIAVPWQQLGSTA--VLKHTDYGKLASNPPILLGQEGPIIDV 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  87 ALNPFQDNVVASASDDGTIGLWKIEDCNYDQlEWSDkerernggvkdfePLARISGGGRKVGQVVWHPTASNLLAAATAD 166
Cdd:PTZ00421   82 AFNPFDPQKLFTASEDGTIMGWGIPEEGLTQ-NISD-------------PIVHLQGHTKKVGIVSFHPSAMNVLASAGAD 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 167 HVVKLFDVSHASTACSspsiaLRGFTDTIQSLDWDWSGTTLIATSRDRKIRTFDPRQGErPVQIADSHGGIKGARVIWCG 246
Cdd:PTZ00421  148 MVVNVWDVERGKAVEV-----IKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGT-IVSSVEAHASAKSQRCLWAK 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 247 DKDRAISTGFSKMSDRQMFLWDTNNLASgPLKQITLDASSGIIMPFW-SDNNIVFLAGKGDGNIRYYELEKDELHYLTES 325
Cdd:PTZ00421  222 RKDLIITLGCSKSQQRQIMLWDTRKMAS-PYSTVDLDQSSALFIPFFdEDTNLLYIGSKGEGNIRCFELMNERLTFCSSY 300
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2274520717 326 KSSEPQRGLTFVPRRFLNTEENEIAKAYKITGTTIQPVSFCVPRKA--ESFQSDIFPPAPSNVASLTAKDFFQG 397
Cdd:PTZ00421  301 SSVEPHKGLCMMPKWSLDTRKCEIARFYALTYHSLYTIQMLLPRKQadSELQVDVYPPTFADHPAITADEYFSG 374
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
482-526 5.47e-04

Phage shock protein A [Transcription, Signal transduction mechanisms];


:

Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 41.35  E-value: 5.47e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 2274520717 482 AQVEDLKKQIEKLRRAVEERDTQIRHLEQENETLKAN------QEKVREAL 526
Cdd:COG1842   105 AQLAQLEEQVEKLKEALRQLESKLEELKAKKDTLKARakaakaQEKVNEAL 155
 
Name Accession Description Interval E-value
PTZ00421 PTZ00421
coronin; Provisional
8-397 1.70e-79

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 257.51  E-value: 1.70e-79
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717   8 SKYRHVYPNVAKKEACYENVKVSNNAWD-TNLISANGTYISINWNASGGGAfaVLPINRPGKLPDIYPLCRGHTAAVLDT 86
Cdd:PTZ00421    4 SRFRHTQGVPARPDRHFLNVTPSTALWDcSNTIACNDRFIAVPWQQLGSTA--VLKHTDYGKLASNPPILLGQEGPIIDV 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  87 ALNPFQDNVVASASDDGTIGLWKIEDCNYDQlEWSDkerernggvkdfePLARISGGGRKVGQVVWHPTASNLLAAATAD 166
Cdd:PTZ00421   82 AFNPFDPQKLFTASEDGTIMGWGIPEEGLTQ-NISD-------------PIVHLQGHTKKVGIVSFHPSAMNVLASAGAD 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 167 HVVKLFDVSHASTACSspsiaLRGFTDTIQSLDWDWSGTTLIATSRDRKIRTFDPRQGErPVQIADSHGGIKGARVIWCG 246
Cdd:PTZ00421  148 MVVNVWDVERGKAVEV-----IKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGT-IVSSVEAHASAKSQRCLWAK 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 247 DKDRAISTGFSKMSDRQMFLWDTNNLASgPLKQITLDASSGIIMPFW-SDNNIVFLAGKGDGNIRYYELEKDELHYLTES 325
Cdd:PTZ00421  222 RKDLIITLGCSKSQQRQIMLWDTRKMAS-PYSTVDLDQSSALFIPFFdEDTNLLYIGSKGEGNIRCFELMNERLTFCSSY 300
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2274520717 326 KSSEPQRGLTFVPRRFLNTEENEIAKAYKITGTTIQPVSFCVPRKA--ESFQSDIFPPAPSNVASLTAKDFFQG 397
Cdd:PTZ00421  301 SSVEPHKGLCMMPKWSLDTRKCEIARFYALTYHSLYTIQMLLPRKQadSELQVDVYPPTFADHPAITADEYFSG 374
DUF1899 pfam08953
Domain of unknown function (DUF1899); This set of domains is found in various eukaryotic ...
3-68 8.73e-36

Domain of unknown function (DUF1899); This set of domains is found in various eukaryotic proteins. Function is unknown.


Pssm-ID: 462645 [Multi-domain]  Cd Length: 66  Bit Score: 127.62  E-value: 8.73e-36
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2274520717   3 RFVRPSKYRHVYPNVAKKEACYENVKVSNNAWDTNLISANGTYISINWNASGGGAFAVLPINRPGK 68
Cdd:pfam08953   1 RFVRASKFRHVYGKPAKKELCYDNIKVTKNAWDSNFIAANPKFLAVNWESSGGGAFAVLPLNQTGR 66
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
77-322 7.51e-19

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 87.01  E-value: 7.51e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  77 RGHTAAVLDTALNPfQDNVVASASDDGTIGLWKIEDCnydqlewsdkerernggvkdfEPLARISGGGRKVGQVVWHPtA 156
Cdd:cd00200     6 KGHTGGVTCVAFSP-DGKLLATGSGDGTIKVWDLETG---------------------ELLRTLKGHTGPVRDVAASA-D 62
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 157 SNLLAAATADHVVKLFDVShaSTACSSpsiALRGFTDTIQSLDWDWSGTTLIATSRDRKIRTFDPRQGErPVQIADSH-G 235
Cdd:cd00200    63 GTYLASGSSDKTIRLWDLE--TGECVR---TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK-CLTTLRGHtD 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 236 GIKGARVIWCGdkdRAISTGfskMSDRQMFLWDtnnLASGPLKQiTLDASSGIIMPF-WSDNNIVFLAGKGDGNIRYYEL 314
Cdd:cd00200   137 WVNSVAFSPDG---TFVASS---SQDGTIKLWD---LRTGKCVA-TLTGHTGEVNSVaFSPDGEKLLSSSSDGTIKLWDL 206

                  ....*...
gi 2274520717 315 EKDELHYL 322
Cdd:cd00200   207 STGKCLGT 214
WD40 COG2319
WD40 repeat [General function prediction only];
50-315 8.60e-18

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 85.35  E-value: 8.60e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  50 WNASGGGAFAVLpinrpgklpdiyplcRGHTAAVLDTALNPfQDNVVASASDDGTIGLWKiedcnydqlewsdkererng 129
Cdd:COG2319   189 WDLATGKLLRTL---------------TGHTGAVRSVAFSP-DGKLLASGSADGTVRLWD-------------------- 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 130 gVKDFEPLARISGGGRKVGQVVWHPTaSNLLAAATADHVVKLFDVSHastacSSPSIALRGFTDTIQSLDWDWSGTTLIA 209
Cdd:COG2319   233 -LATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWDLAT-----GELLRTLTGHSGGVNSVAFSPDGKLLAS 305
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 210 TSRDRKIRTFDPRQGERPVQIADSHGGIKGarVIWCGDKDRAISTGfskmSDRQMFLWDtnnLASGPLKQiTLDASSGII 289
Cdd:COG2319   306 GSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPDGKTLASGS----DDGTVRLWD---LATGELLR-TLTGHTGAV 375
                         250       260
                  ....*....|....*....|....*...
gi 2274520717 290 MP--FWSDNNIVFLAGkGDGNIRYYELE 315
Cdd:COG2319   376 TSvaFSPDGRTLASGS-ADGTVRLWDLA 402
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
482-526 5.47e-04

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 41.35  E-value: 5.47e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 2274520717 482 AQVEDLKKQIEKLRRAVEERDTQIRHLEQENETLKAN------QEKVREAL 526
Cdd:COG1842   105 AQLAQLEEQVEKLKEALRQLESKLEELKAKKDTLKARakaakaQEKVNEAL 155
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
76-109 6.79e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.29  E-value: 6.79e-04
                           10        20        30
                   ....*....|....*....|....*....|....
gi 2274520717   76 CRGHTAAVLDTALNPfQDNVVASASDDGTIGLWK 109
Cdd:smart00320   8 LKGHTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
MAT1 pfam06391
CDK-activating kinase assembly factor MAT1; MAT1 is an assembly/targeting factor for ...
481-530 1.06e-03

CDK-activating kinase assembly factor MAT1; MAT1 is an assembly/targeting factor for cyclin-dependent kinase-activating kinase (CAK), which interacts with the transcription factor TFIIH. The domain found to the N-terminal side of this domain is a C3HC4 RING finger.


Pssm-ID: 461894 [Multi-domain]  Cd Length: 202  Bit Score: 40.30  E-value: 1.06e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 2274520717 481 GAQVEDLKKQIEKLRRAVEERDTQIRHLEQENetlKANQEKVREALLNSL 530
Cdd:pfam06391  89 SQEEEELEELLELEKREKEERRKEEKQEEEEE---KEKKEKAKQELIDEL 135
ZIP_TSC22D cd21936
leucine zipper domain found in the TSC22 domain family of leucine zipper transcription factors; ...
484-518 2.79e-03

leucine zipper domain found in the TSC22 domain family of leucine zipper transcription factors; The TGF-beta-stimulated clone-22 domain (TSC22D) family includes TSC22D1-4 and similar proteins. They have diverse physiological functions, including cell growth, development, homeostasis, and immune regulation. All family members contain a conserved leucine zipper (ZIP) domain located at the C-terminus. Its first helix is not basic and does not contain the consensus sequence, NXX(A)(A)XX(C/S)R, found in most basic region/leucine zipper (bZIP) proteins. In the bZIP family of transcription factors, the leucine zipper acts as a dimerization domain and the upstream basic region as a DNA-binding domain. However, DNA-binding capability of TSC22D family proteins is not obvious, due to the lack of the basic region found in the original bZIP DNA-binding domains. Similar to bZIP, ZIP forms homo- and heterodimers, resulting in many dimers that may have different effects on transcription.


Pssm-ID: 409276  Cd Length: 49  Bit Score: 36.00  E-value: 2.79e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 2274520717 484 VEDLKKQIEKLrravEERdtqIRHLEQENETLKAN 518
Cdd:cd21936    17 VDVLKEQIAEL----EER---ISQLERENSLLRSN 44
SH3_and_anchor TIGR04211
SH3 domain protein; Members of this protein family have a signal peptide, a strongly conserved ...
482-521 4.69e-03

SH3 domain protein; Members of this protein family have a signal peptide, a strongly conserved SH3 domain, a variable region, and then a C-terminal hydrophobic transmembrane alpha helix region.


Pssm-ID: 275056 [Multi-domain]  Cd Length: 198  Bit Score: 38.45  E-value: 4.69e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 2274520717 482 AQVEDLKKQIEKLRRAVEERDTQIRHLEQENETLKANQEK 521
Cdd:TIGR04211 125 ANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164
 
Name Accession Description Interval E-value
PTZ00421 PTZ00421
coronin; Provisional
8-397 1.70e-79

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 257.51  E-value: 1.70e-79
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717   8 SKYRHVYPNVAKKEACYENVKVSNNAWD-TNLISANGTYISINWNASGGGAfaVLPINRPGKLPDIYPLCRGHTAAVLDT 86
Cdd:PTZ00421    4 SRFRHTQGVPARPDRHFLNVTPSTALWDcSNTIACNDRFIAVPWQQLGSTA--VLKHTDYGKLASNPPILLGQEGPIIDV 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  87 ALNPFQDNVVASASDDGTIGLWKIEDCNYDQlEWSDkerernggvkdfePLARISGGGRKVGQVVWHPTASNLLAAATAD 166
Cdd:PTZ00421   82 AFNPFDPQKLFTASEDGTIMGWGIPEEGLTQ-NISD-------------PIVHLQGHTKKVGIVSFHPSAMNVLASAGAD 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 167 HVVKLFDVSHASTACSspsiaLRGFTDTIQSLDWDWSGTTLIATSRDRKIRTFDPRQGErPVQIADSHGGIKGARVIWCG 246
Cdd:PTZ00421  148 MVVNVWDVERGKAVEV-----IKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGT-IVSSVEAHASAKSQRCLWAK 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 247 DKDRAISTGFSKMSDRQMFLWDTNNLASgPLKQITLDASSGIIMPFW-SDNNIVFLAGKGDGNIRYYELEKDELHYLTES 325
Cdd:PTZ00421  222 RKDLIITLGCSKSQQRQIMLWDTRKMAS-PYSTVDLDQSSALFIPFFdEDTNLLYIGSKGEGNIRCFELMNERLTFCSSY 300
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2274520717 326 KSSEPQRGLTFVPRRFLNTEENEIAKAYKITGTTIQPVSFCVPRKA--ESFQSDIFPPAPSNVASLTAKDFFQG 397
Cdd:PTZ00421  301 SSVEPHKGLCMMPKWSLDTRKCEIARFYALTYHSLYTIQMLLPRKQadSELQVDVYPPTFADHPAITADEYFSG 374
PTZ00420 PTZ00420
coronin; Provisional
11-408 1.74e-52

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 187.46  E-value: 1.74e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  11 RHVYPNVAKKeaCYENVKVSNNAWDTNLISANGTYISINWNASGGGAFAVLPINRPGKLPDIYPLcRGHTAAVLDTALNP 90
Cdd:PTZ00420    8 KNLYPDPSNN--LFDDLRICSRVIDSCGIACSSGFVAVPWEVEGGGLIGAIRLENQMRKPPVIKL-KGHTSSILDLQFNP 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  91 FQDNVVASASDDGTIGLWKIEdcnydqlewsdKERERNGGVKDfePLARISGGGRKVGQVVWHPTASNLLAAATADHVVK 170
Cdd:PTZ00420   85 CFSEILASGSEDLTIRVWEIP-----------HNDESVKEIKD--PQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVN 151
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 171 LFDVSHASTACSspsIALrgfTDTIQSLDWDWSGTTLIATSRDRKIRTFDPRQGErpvqIADS---HGGIKGARVIWC-- 245
Cdd:PTZ00420  152 IWDIENEKRAFQ---INM---PKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQE----IASSfhiHDGGKNTKNIWIdg 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 246 --GDKDRAISTGFSKMSDRQMFLWDTNNLASgPLKQITLDASSGIIMPFWSDN-NIVFLAGKGDGNIRYYELEKDELHYL 322
Cdd:PTZ00420  222 lgGDDNYILSTGFSKNNMREMKLWDLKNTTS-ALVTMSIDNASAPLIPHYDEStGLIYLIGKGDGNCRYYQHSLGSIRKV 300
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 323 TESKSSEPQRGLTFVPRRFLNTEENEIAKAYK-ITGTTIQPVSFCVPRKAES-FQSDIFPPAPSNVASLTAKDFFQGKRG 400
Cdd:PTZ00420  301 NEYKSCSPFRSFGFLPKQICDVYKCEIGRVYKnENNSSIRPISFYVPRKNPTkFQEDLYPPILMHDPERSSRNWIDGKDN 380

                  ....*...
gi 2274520717 401 KRMMVSLE 408
Cdd:PTZ00420  381 KMKRINIK 388
DUF1899 pfam08953
Domain of unknown function (DUF1899); This set of domains is found in various eukaryotic ...
3-68 8.73e-36

Domain of unknown function (DUF1899); This set of domains is found in various eukaryotic proteins. Function is unknown.


Pssm-ID: 462645 [Multi-domain]  Cd Length: 66  Bit Score: 127.62  E-value: 8.73e-36
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2274520717   3 RFVRPSKYRHVYPNVAKKEACYENVKVSNNAWDTNLISANGTYISINWNASGGGAFAVLPINRPGK 68
Cdd:pfam08953   1 RFVRASKFRHVYGKPAKKELCYDNIKVTKNAWDSNFIAANPKFLAVNWESSGGGAFAVLPLNQTGR 66
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
77-322 7.51e-19

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 87.01  E-value: 7.51e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  77 RGHTAAVLDTALNPfQDNVVASASDDGTIGLWKIEDCnydqlewsdkerernggvkdfEPLARISGGGRKVGQVVWHPtA 156
Cdd:cd00200     6 KGHTGGVTCVAFSP-DGKLLATGSGDGTIKVWDLETG---------------------ELLRTLKGHTGPVRDVAASA-D 62
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 157 SNLLAAATADHVVKLFDVShaSTACSSpsiALRGFTDTIQSLDWDWSGTTLIATSRDRKIRTFDPRQGErPVQIADSH-G 235
Cdd:cd00200    63 GTYLASGSSDKTIRLWDLE--TGECVR---TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK-CLTTLRGHtD 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 236 GIKGARVIWCGdkdRAISTGfskMSDRQMFLWDtnnLASGPLKQiTLDASSGIIMPF-WSDNNIVFLAGKGDGNIRYYEL 314
Cdd:cd00200   137 WVNSVAFSPDG---TFVASS---SQDGTIKLWD---LRTGKCVA-TLTGHTGEVNSVaFSPDGEKLLSSSSDGTIKLWDL 206

                  ....*...
gi 2274520717 315 EKDELHYL 322
Cdd:cd00200   207 STGKCLGT 214
WD40 COG2319
WD40 repeat [General function prediction only];
50-315 8.60e-18

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 85.35  E-value: 8.60e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  50 WNASGGGAFAVLpinrpgklpdiyplcRGHTAAVLDTALNPfQDNVVASASDDGTIGLWKiedcnydqlewsdkererng 129
Cdd:COG2319   189 WDLATGKLLRTL---------------TGHTGAVRSVAFSP-DGKLLASGSADGTVRLWD-------------------- 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 130 gVKDFEPLARISGGGRKVGQVVWHPTaSNLLAAATADHVVKLFDVSHastacSSPSIALRGFTDTIQSLDWDWSGTTLIA 209
Cdd:COG2319   233 -LATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWDLAT-----GELLRTLTGHSGGVNSVAFSPDGKLLAS 305
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 210 TSRDRKIRTFDPRQGERPVQIADSHGGIKGarVIWCGDKDRAISTGfskmSDRQMFLWDtnnLASGPLKQiTLDASSGII 289
Cdd:COG2319   306 GSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPDGKTLASGS----DDGTVRLWD---LATGELLR-TLTGHTGAV 375
                         250       260
                  ....*....|....*....|....*...
gi 2274520717 290 MP--FWSDNNIVFLAGkGDGNIRYYELE 315
Cdd:COG2319   376 TSvaFSPDGRTLASGS-ADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
50-319 8.49e-17

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 82.27  E-value: 8.49e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  50 WNASGGGAFAVLpinrpgklpdiyplcRGHTAAVLDTALNPfQDNVVASASDDGTIGLWKIEDcnydqlewsdkerernG 129
Cdd:COG2319   105 WDLATGLLLRTL---------------TGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLAT----------------G 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 130 gvkdfEPLARISGGGRKVGQVVWHPTaSNLLAAATADHVVKLFDVSHAstacsSPSIALRGFTDTIQSLDWDWSGTTLIA 209
Cdd:COG2319   153 -----KLLRTLTGHSGAVTSVAFSPD-GKLLASGSDDGTVRLWDLATG-----KLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 210 TSRDRKIRTFDPRQGERPVQIADSHGGIKGarVIWCGDKDRAISTGfskmSDRQMFLWDtnnLASGPLKQiTLDASSGII 289
Cdd:COG2319   222 GSADGTVRLWDLATGKLLRTLTGHSGSVRS--VAFSPDGRLLASGS----ADGTVRLWD---LATGELLR-TLTGHSGGV 291
                         250       260       270
                  ....*....|....*....|....*....|.
gi 2274520717 290 MPF-WSDNNIVFLAGKGDGNIRYYELEKDEL 319
Cdd:COG2319   292 NSVaFSPDGKLLASGSDDGTVRLWDLATGKL 322
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
76-268 1.01e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 71.60  E-value: 1.01e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  76 CRGHTAAVLDTALNPFQDnVVASASDDGTIGLWKIEDCnydqlewsdkerernggvkdfEPLARISGGGRKVGQVVWHPT 155
Cdd:cd00200   131 LRGHTDWVNSVAFSPDGT-FVASSSQDGTIKLWDLRTG---------------------KCVATLTGHTGEVNSVAFSPD 188
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 156 ASNLLAAAtADHVVKLFDVSHASTACSspsiaLRGFTDTIQSLDWDWSGTTLIATSRDRKIRTFDPRQGErPVQIADSH- 234
Cdd:cd00200   189 GEKLLSSS-SDGTIKLWDLSTGKCLGT-----LRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGE-CVQTLSGHt 261
                         170       180       190
                  ....*....|....*....|....*....|....
gi 2274520717 235 GGIKGARviWCGDKDRAISTGFskmsDRQMFLWD 268
Cdd:cd00200   262 NSVTSLA--WSPDGKRLASGSA----DGTIRIWD 289
WD40_4 pfam16300
Type of WD40 repeat; Most members of this family form part of the 7-bladed beta-propeller at ...
360-398 1.50e-13

Type of WD40 repeat; Most members of this family form part of the 7-bladed beta-propeller at the N-terminus of coronin proteins.


Pssm-ID: 465087 [Multi-domain]  Cd Length: 44  Bit Score: 64.84  E-value: 1.50e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 2274520717 360 IQPVSFCVPRKA-ESFQSDIFPPAPSNVASLTAKDFFQGK 398
Cdd:pfam16300   1 IEPISFTVPRKSkEDFQDDLYPDTAGTEPALTAEEWLSGK 40
WD40 COG2319
WD40 repeat [General function prediction only];
136-338 1.62e-08

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 56.84  E-value: 1.62e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 136 PLARISGGGRKVGQVVWHPTASNLLAAATADHVVKLFDVSHastacSSPSIALRGFTDTIQSLDWDWSGTTLIATSRDRK 215
Cdd:COG2319    69 ALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLAT-----GLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT 143
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 216 IRTFDPRQGERPVQIADSHGGIKGarVIWCGDKDRAISTGfskmSDRQMFLWDtnnLASGPLKQiTLDASSGIIMP--FW 293
Cdd:COG2319   144 VRLWDLATGKLLRTLTGHSGAVTS--VAFSPDGKLLASGS----DDGTVRLWD---LATGKLLR-TLTGHTGAVRSvaFS 213
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 2274520717 294 SDNNIVFLAGkGDGNIRYYELEKDELHYLTESKSSEPqRGLTFVP 338
Cdd:COG2319   214 PDGKLLASGS-ADGTVRLWDLATGKLLRTLTGHSGSV-RSVAFSP 256
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
75-173 1.26e-05

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 46.94  E-value: 1.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717  75 LCRGHTAAVLDTALNPfQDNVVASASDDGTIGLWKiedcnydqlewsdkerernggVKDFEPLARISGGGRKVGQVVWHP 154
Cdd:cd00200   214 TLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVWD---------------------LRTGECVQTLSGHTNSVTSLAWSP 271
                          90
                  ....*....|....*....
gi 2274520717 155 TaSNLLAAATADHVVKLFD 173
Cdd:cd00200   272 D-GKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
188-321 1.36e-04

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 43.86  E-value: 1.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2274520717 188 LRGFTDTIQSLDWDWSGTTLIATSRDRKIRTFDPRQGERPVQIADSHGGIkgARVIWCGDKDRAISTGfskmSDRQMFLW 267
Cdd:cd00200     5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPV--RDVAASADGTYLASGS----SDKTIRLW 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 2274520717 268 DTNnlaSGPLKQITLDASSGIIMPFWSDNNIVFLAGKGDGNIRYYELEKDELHY 321
Cdd:cd00200    79 DLE---TGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
482-526 5.47e-04

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 41.35  E-value: 5.47e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 2274520717 482 AQVEDLKKQIEKLRRAVEERDTQIRHLEQENETLKAN------QEKVREAL 526
Cdd:COG1842   105 AQLAQLEEQVEKLKEALRQLESKLEELKAKKDTLKARakaakaQEKVNEAL 155
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
482-528 6.19e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.45  E-value: 6.19e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 2274520717 482 AQVEDLKKQIEKLRRAVEERDTQIRHLEQENETLKANQEKVREALLN 528
Cdd:COG1579    38 DELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGN 84
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
76-109 6.79e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.29  E-value: 6.79e-04
                           10        20        30
                   ....*....|....*....|....*....|....
gi 2274520717   76 CRGHTAAVLDTALNPfQDNVVASASDDGTIGLWK 109
Cdd:smart00320   8 LKGHTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
MAT1 pfam06391
CDK-activating kinase assembly factor MAT1; MAT1 is an assembly/targeting factor for ...
481-530 1.06e-03

CDK-activating kinase assembly factor MAT1; MAT1 is an assembly/targeting factor for cyclin-dependent kinase-activating kinase (CAK), which interacts with the transcription factor TFIIH. The domain found to the N-terminal side of this domain is a C3HC4 RING finger.


Pssm-ID: 461894 [Multi-domain]  Cd Length: 202  Bit Score: 40.30  E-value: 1.06e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 2274520717 481 GAQVEDLKKQIEKLRRAVEERDTQIRHLEQENetlKANQEKVREALLNSL 530
Cdd:pfam06391  89 SQEEEELEELLELEKREKEERRKEEKQEEEEE---KEKKEKAKQELIDEL 135
WD40 COG2319
WD40 repeat [General function prediction only];
50-112 1.62e-03

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 41.05  E-value: 1.62e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2274520717  50 WNASGGGAFAVLpinrpgklpdiyplcRGHTAAVLDTALNPfQDNVVASASDDGTIGLWKIED 112
Cdd:COG2319   357 WDLATGELLRTL---------------TGHTGAVTSVAFSP-DGRTLASGSADGTVRLWDLAT 403
ZIP_TSC22D cd21936
leucine zipper domain found in the TSC22 domain family of leucine zipper transcription factors; ...
484-518 2.79e-03

leucine zipper domain found in the TSC22 domain family of leucine zipper transcription factors; The TGF-beta-stimulated clone-22 domain (TSC22D) family includes TSC22D1-4 and similar proteins. They have diverse physiological functions, including cell growth, development, homeostasis, and immune regulation. All family members contain a conserved leucine zipper (ZIP) domain located at the C-terminus. Its first helix is not basic and does not contain the consensus sequence, NXX(A)(A)XX(C/S)R, found in most basic region/leucine zipper (bZIP) proteins. In the bZIP family of transcription factors, the leucine zipper acts as a dimerization domain and the upstream basic region as a DNA-binding domain. However, DNA-binding capability of TSC22D family proteins is not obvious, due to the lack of the basic region found in the original bZIP DNA-binding domains. Similar to bZIP, ZIP forms homo- and heterodimers, resulting in many dimers that may have different effects on transcription.


Pssm-ID: 409276  Cd Length: 49  Bit Score: 36.00  E-value: 2.79e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 2274520717 484 VEDLKKQIEKLrravEERdtqIRHLEQENETLKAN 518
Cdd:cd21936    17 VDVLKEQIAEL----EER---ISQLERENSLLRSN 44
SH3_and_anchor TIGR04211
SH3 domain protein; Members of this protein family have a signal peptide, a strongly conserved ...
482-521 4.69e-03

SH3 domain protein; Members of this protein family have a signal peptide, a strongly conserved SH3 domain, a variable region, and then a C-terminal hydrophobic transmembrane alpha helix region.


Pssm-ID: 275056 [Multi-domain]  Cd Length: 198  Bit Score: 38.45  E-value: 4.69e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 2274520717 482 AQVEDLKKQIEKLRRAVEERDTQIRHLEQENETLKANQEK 521
Cdd:TIGR04211 125 ANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH