NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907189679|ref|XP_036009982|]
View 

NACHT domain- and WD repeat-containing protein 1 isoform X3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
1020-1387 1.48e-26

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.24  E-value: 1.48e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1020 AVSIQSRARLVAGFSSGSIALVSAGEDRLLEKLPE---AVGFLVVSEDDSLLV-AGFGRFVRIFLADSQGFHRFMAsdlE 1095
Cdd:COG2319     42 LAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGhtaAVLSVAFSPDGRLLAsASADGTVRLWDLATGLLLRTLT---G 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1096 HEDMVETAVLGPENNLIITGSRDALIQVWSLsEQGTLLNVLEGVGAPVSLLV--RGGTLVVSASRkSSSFKVWDLKSTKK 1173
Cdd:COG2319    119 HTGAVRSVAFSPDGKTLASGSADGTVRLWDL-ATGKLLRTLTGHSGAVTSVAfsPDGKLLASGSD-DGTVRLWDLATGKL 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1174 LQSPTPFLDRTGLAAVSHHGSFVYFpkVGDKNKVTIWDLAEGEEQDCLDT-SNEVRCLEVAEQAKLLFTGLVSGIVLVFP 1252
Cdd:COG2319    197 LRTLTGHTGAVRSVAFSPDGKLLAS--GSADGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSADGTVRLWD 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1253 LNSRQDVLCIPPPEARkaVNCMSLSKSENRLAIA-YDNIVLVLDISPGDPCPAIEGPTytfytqlpETIVSVAVLAD-YR 1330
Cdd:COG2319    275 LATGELLRTLTGHSGG--VNSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGHT--------GAVRSVAFSPDgKT 344
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907189679 1331 VVYGMSDGSLFLYDCACSK-VFPLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWDLQ 1387
Cdd:COG2319    345 LASGSDDGTVRLWDLATGElLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
870-1167 1.29e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 76.22  E-value: 1.29e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  870 LRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWSLLSGQEKVTIldggsQNPTEPqSWDLHVDERNNVVYSTSGAR 949
Cdd:cd00200      1 LRRTLKG-HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTL-----KGHTGP-VRDVAASADGTYLASGSSDK 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  950 -INMWNLETSKLVFCITGDVSDPWvCVALLAAQGLLLALSKGGQVSLWSSAMGKLqeKHQLSSIkeETPTCAVSIQSRAR 1028
Cdd:cd00200     74 tIRLWDLETGECVRTLTGHTSYVS-SVAFSPDGRILSSSSRDKTIKVWDVETGKC--LTTLRGH--TDWVNSVAFSPDGT 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1029 LVAGFSS-GSIALVSAGEDRLLEKLP---EAVGFLVVSEDDSLLVAGFG-RFVRIFLADSQgfhRFMASDLEHEDMVETA 1103
Cdd:cd00200    149 FVASSSQdGTIKLWDLRTGKCVATLTghtGEVNSVAFSPDGEKLLSSSSdGTIKLWDLSTG---KCLGTLRGHENGVNSV 225
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907189679 1104 VLGPENNLIITGSRDALIQVWSLsEQGTLLNVLEGVGAPVSLL--VRGGTLVVSASrKSSSFKVWD 1167
Cdd:cd00200    226 AFSPDGYLLASGSEDGTIRVWDL-RTGECVQTLSGHTNSVTSLawSPDGKRLASGS-ADGTIRIWD 289
ExeA super family cl30749
Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, ...
315-421 1.79e-04

Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


The actual alignment was detected with superfamily member COG3267:

Pssm-ID: 442498 [Multi-domain]  Cd Length: 261  Bit Score: 45.16  E-value: 1.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  315 GHQELLAQLRQQLRQDESrthtPLVLFGPPGIGKTSLMCKLAQQVPEllghkTVVVLRLLGTsklSLDARSLLRslsfQV 394
Cdd:COG3267     27 SHREALARLEYALAQGGG----FVVLTGEVGTGKTTLLRRLLERLPD-----DVKVAYIPNP---QLSPAELLR----AI 90
                           90       100
                   ....*....|....*....|....*..
gi 1907189679  395 CLAYGLPLPPAQVLEAHSRVGHFFHTL 421
Cdd:COG3267     91 ADELGLEPKGASKADLLRQLQEFLLEL 117
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
1020-1387 1.48e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.24  E-value: 1.48e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1020 AVSIQSRARLVAGFSSGSIALVSAGEDRLLEKLPE---AVGFLVVSEDDSLLV-AGFGRFVRIFLADSQGFHRFMAsdlE 1095
Cdd:COG2319     42 LAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGhtaAVLSVAFSPDGRLLAsASADGTVRLWDLATGLLLRTLT---G 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1096 HEDMVETAVLGPENNLIITGSRDALIQVWSLsEQGTLLNVLEGVGAPVSLLV--RGGTLVVSASRkSSSFKVWDLKSTKK 1173
Cdd:COG2319    119 HTGAVRSVAFSPDGKTLASGSADGTVRLWDL-ATGKLLRTLTGHSGAVTSVAfsPDGKLLASGSD-DGTVRLWDLATGKL 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1174 LQSPTPFLDRTGLAAVSHHGSFVYFpkVGDKNKVTIWDLAEGEEQDCLDT-SNEVRCLEVAEQAKLLFTGLVSGIVLVFP 1252
Cdd:COG2319    197 LRTLTGHTGAVRSVAFSPDGKLLAS--GSADGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSADGTVRLWD 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1253 LNSRQDVLCIPPPEARkaVNCMSLSKSENRLAIA-YDNIVLVLDISPGDPCPAIEGPTytfytqlpETIVSVAVLAD-YR 1330
Cdd:COG2319    275 LATGELLRTLTGHSGG--VNSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGHT--------GAVRSVAFSPDgKT 344
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907189679 1331 VVYGMSDGSLFLYDCACSK-VFPLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWDLQ 1387
Cdd:COG2319    345 LASGSDDGTVRLWDLATGElLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1107-1430 2.57e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 84.31  E-value: 2.57e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1107 PENNLIITGSRDALIQVWSLsEQGTLLNVLEGVGAPVS--LLVRGGTLVVSASrksssfkvwdlkstkklqsptpfLDRT 1184
Cdd:cd00200     19 PDGKLLATGSGDGTIKVWDL-ETGELLRTLKGHTGPVRdvAASADGTYLASGS-----------------------SDKT 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1185 glaavshhgsfvyfpkvgdknkVTIWDLAEGEeqdCLDT----SNEVRCLEVAEQAKLLFTGLVSGIVLVFPLNSRQDVL 1260
Cdd:cd00200     75 ----------------------IRLWDLETGE---CVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1261 CIPPPEArkAVNCMSLSKSENRLAIA-YDNIVLVLDISPGDPCPAIEGPTytfytqlpETIVSVAVLAD-YRVVYGMSDG 1338
Cdd:cd00200    130 TLRGHTD--WVNSVAFSPDGTFVASSsQDGTIKLWDLRTGKCVATLTGHT--------GEVNSVAFSPDgEKLLSSSSDG 199
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1339 SLFLYDCACSKVF-PLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWDLQACRGMFEMS-YENSccrgVRCACFSRDDKH 1416
Cdd:cd00200    200 TIKLWDLSTGKCLgTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSgHTNS----VTSLAWSPDGKR 275
                          330
                   ....*....|....
gi 1907189679 1417 VFAGMEDRSVTAWS 1430
Cdd:cd00200    276 LASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
870-1167 1.29e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 76.22  E-value: 1.29e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  870 LRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWSLLSGQEKVTIldggsQNPTEPqSWDLHVDERNNVVYSTSGAR 949
Cdd:cd00200      1 LRRTLKG-HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTL-----KGHTGP-VRDVAASADGTYLASGSSDK 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  950 -INMWNLETSKLVFCITGDVSDPWvCVALLAAQGLLLALSKGGQVSLWSSAMGKLqeKHQLSSIkeETPTCAVSIQSRAR 1028
Cdd:cd00200     74 tIRLWDLETGECVRTLTGHTSYVS-SVAFSPDGRILSSSSRDKTIKVWDVETGKC--LTTLRGH--TDWVNSVAFSPDGT 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1029 LVAGFSS-GSIALVSAGEDRLLEKLP---EAVGFLVVSEDDSLLVAGFG-RFVRIFLADSQgfhRFMASDLEHEDMVETA 1103
Cdd:cd00200    149 FVASSSQdGTIKLWDLRTGKCVATLTghtGEVNSVAFSPDGEKLLSSSSdGTIKLWDLSTG---KCLGTLRGHENGVNSV 225
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907189679 1104 VLGPENNLIITGSRDALIQVWSLsEQGTLLNVLEGVGAPVSLL--VRGGTLVVSASrKSSSFKVWD 1167
Cdd:cd00200    226 AFSPDGYLLASGSEDGTIRVWDL-RTGECVQTLSGHTNSVTSLawSPDGKRLASGS-ADGTIRIWD 289
ExeA COG3267
Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, ...
315-421 1.79e-04

Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 442498 [Multi-domain]  Cd Length: 261  Bit Score: 45.16  E-value: 1.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  315 GHQELLAQLRQQLRQDESrthtPLVLFGPPGIGKTSLMCKLAQQVPEllghkTVVVLRLLGTsklSLDARSLLRslsfQV 394
Cdd:COG3267     27 SHREALARLEYALAQGGG----FVVLTGEVGTGKTTLLRRLLERLPD-----DVKVAYIPNP---QLSPAELLR----AI 90
                           90       100
                   ....*....|....*....|....*..
gi 1907189679  395 CLAYGLPLPPAQVLEAHSRVGHFFHTL 421
Cdd:COG3267     91 ADELGLEPKGASKADLLRQLQEFLLEL 117
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
868-907 2.02e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 2.02e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 1907189679   868 GPLRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWS 907
Cdd:smart00320    2 GELLKTLKG-HTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
868-907 6.16e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.87  E-value: 6.16e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1907189679  868 GPLRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWS 907
Cdd:pfam00400    1 GKLLKTLEG-HTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
AAA cd00009
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily ...
315-361 6.95e-04

The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.


Pssm-ID: 99707 [Multi-domain]  Cd Length: 151  Bit Score: 41.75  E-value: 6.95e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1907189679  315 GHQELLAQLRQQLRQDESRthtPLVLFGPPGIGKTSLMCKLAQQVPE 361
Cdd:cd00009      2 GQEEAIEALREALELPPPK---NLLLYGPPGTGKTTLARAIANELFR 45
FxSxx_TPR NF040586
FxSxx-COOH system tetratricopeptide repeat protein; Members of this family are typically about ...
313-351 7.42e-04

FxSxx-COOH system tetratricopeptide repeat protein; Members of this family are typically about 850 amino acids long, or 1300 long because of an additional N-terminal domain. Proteins have a P-loop motif, GxGGxGKT, near the N-terminus of the region covered by this HMM, and a region over 400 residues long of tetratricopeptide repeat sequence. The family is found regularly next to other components of FxSxx-COOH systems, which feature an FxsB family radical SAM protein and a protein modified by it, FxsA. Members of this FxsA family typically have an FxSxx motif as the final five amino acids.


Pssm-ID: 468560 [Multi-domain]  Cd Length: 836  Bit Score: 44.14  E-value: 7.42e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1907189679  313 FCGHQELLAQLRQQLRQdESRTHTPLVLFGPPGIGKTSL 351
Cdd:NF040586     8 FTGREELLERLRDQLRS-GGAAVVPQALHGLGGVGKTQL 45
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1348-1385 2.14e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.29  E-value: 2.14e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1907189679  1348 SKVFPLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWD 1385
Cdd:smart00320    3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
NACHT pfam05729
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ...
337-504 2.46e-03

NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.


Pssm-ID: 428606 [Multi-domain]  Cd Length: 166  Bit Score: 40.37  E-value: 2.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  337 PLVLFGPPGIGKTSLMCKLAQ--QVPELLGHKTVVVLRLLGTSKLSLDARSlLRSLSFQVCLAYGLPLPP--AQVLEAHS 412
Cdd:pfam05729    2 TVILQGEAGSGKTTLLQKLALlwAQGKLPQGFDFVFFLPCRELSRSGNARS-LADLLFSQWPEPAAPVSEvwAVILELPE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  413 RVGHFFHTLLHTVSQRNFESLVLLLDSVDDldsichsprvSWLPLKCPPRVHLILSTCSGQqvLHNLQQTLKDPsTYWEV 492
Cdd:pfam05729   81 RLLLILDGLDELVSDLGQLDGPCPVLTLLS----------SLLRKKLLPGASLLLTVRPDA--LRDLRRGLEEP-RYLEV 147
                          170
                   ....*....|..
gi 1907189679  493 KALSGSQGQEFI 504
Cdd:pfam05729  148 RGFSESDRKQYV 159
WD40 pfam00400
WD domain, G-beta repeat;
1353-1385 2.62e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.94  E-value: 2.62e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1907189679 1353 LEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWD 1385
Cdd:pfam00400    7 LEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
ruvB TIGR00635
Holliday junction DNA helicase, RuvB subunit; All proteins in this family for which functions ...
313-351 6.23e-03

Holliday junction DNA helicase, RuvB subunit; All proteins in this family for which functions are known are 5'-3' DNA helicases that, as part of a complex with RuvA homologs serve as a 5'-3' Holliday junction helicase. RuvA specifically binds Holliday junctions as a sandwich of two tetramers and maintains the configuration of the junction. It forms a complex with two hexameric rings of RuvB, the subunit that contains helicase activity. The complex drives ATP-dependent branch migration of the Holliday junction recombination intermediate. The endonuclease RuvC resolves junctions. [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129721 [Multi-domain]  Cd Length: 305  Bit Score: 40.36  E-value: 6.23e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1907189679  313 FCGHQELLAQLRQQLRQDESRTHTP--LVLFGPPGIGKTSL 351
Cdd:TIGR00635    6 FIGQEKVKEQLQLFIEAAKMRQEALdhLLLYGPPGLGKTTL 46
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
1020-1387 1.48e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.24  E-value: 1.48e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1020 AVSIQSRARLVAGFSSGSIALVSAGEDRLLEKLPE---AVGFLVVSEDDSLLV-AGFGRFVRIFLADSQGFHRFMAsdlE 1095
Cdd:COG2319     42 LAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGhtaAVLSVAFSPDGRLLAsASADGTVRLWDLATGLLLRTLT---G 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1096 HEDMVETAVLGPENNLIITGSRDALIQVWSLsEQGTLLNVLEGVGAPVSLLV--RGGTLVVSASRkSSSFKVWDLKSTKK 1173
Cdd:COG2319    119 HTGAVRSVAFSPDGKTLASGSADGTVRLWDL-ATGKLLRTLTGHSGAVTSVAfsPDGKLLASGSD-DGTVRLWDLATGKL 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1174 LQSPTPFLDRTGLAAVSHHGSFVYFpkVGDKNKVTIWDLAEGEEQDCLDT-SNEVRCLEVAEQAKLLFTGLVSGIVLVFP 1252
Cdd:COG2319    197 LRTLTGHTGAVRSVAFSPDGKLLAS--GSADGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSADGTVRLWD 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1253 LNSRQDVLCIPPPEARkaVNCMSLSKSENRLAIA-YDNIVLVLDISPGDPCPAIEGPTytfytqlpETIVSVAVLAD-YR 1330
Cdd:COG2319    275 LATGELLRTLTGHSGG--VNSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGHT--------GAVRSVAFSPDgKT 344
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907189679 1331 VVYGMSDGSLFLYDCACSK-VFPLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWDLQ 1387
Cdd:COG2319    345 LASGSDDGTVRLWDLATGElLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
1020-1431 2.19e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 113.47  E-value: 2.19e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1020 AVSIQSRARLVAGFSSGSIALVSAGEDRLLEKLPEAVGFLVVSEDDSLLVAGFGRFVRIFLADSQGFHRfmASDLEHEDM 1099
Cdd:COG2319      3 SADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALL--ATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1100 VETAVLGPENNLIITGSRDALIQVWSLSeQGTLLNVLEGVGAPVSLLV--RGGTLVVSASRkSSSFKVWDLKSTKKLQSP 1177
Cdd:COG2319     81 VLSVAFSPDGRLLASASADGTVRLWDLA-TGLLLRTLTGHTGAVRSVAfsPDGKTLASGSA-DGTVRLWDLATGKLLRTL 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1178 TPFLDRTGLAAVSHHGSFVYFpkVGDKNKVTIWDLAEGEEQDCLD-TSNEVRCLEVAEQAKLLFTGLVSGIVLVFPLNSR 1256
Cdd:COG2319    159 TGHSGAVTSVAFSPDGKLLAS--GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATG 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1257 QDVLCIPPPEARkaVNCMSLSKSENRLAIA-YDNIVLVLDISPGDPCPAIEGPTytfytqlpETIVSVAVLAD-YRVVYG 1334
Cdd:COG2319    237 KLLRTLTGHSGS--VRSVAFSPDGRLLASGsADGTVRLWDLATGELLRTLTGHS--------GGVNSVAFSPDgKLLASG 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1335 MSDGSLFLYDCACSK-VFPLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWDLQACRGMFEMSYENSccrGVRCACFSRD 1413
Cdd:COG2319    307 SDDGTVRLWDLATGKlLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTG---AVTSVAFSPD 383
                          410
                   ....*....|....*...
gi 1907189679 1414 DKHVFAGMEDRSVTAWST 1431
Cdd:COG2319    384 GRTLASGSADGTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
1060-1443 2.17e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.61  E-value: 2.17e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1060 VVSEDDSLLVAGFGRFVRIFLADSQGFHRfmASDLEHEDMVETAVLGPENNLIITGSRDALIQVWSLSEQGTLLNVLEGV 1139
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALL--LLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1140 GAPVSLLVRGGTLVVSASRKSSSFKVWDLKSTKKLQSPTPFLDRTGLAAVSHHGSFVYFpkVGDKNKVTIWDLAEGEEQD 1219
Cdd:COG2319     79 AAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLAS--GSADGTVRLWDLATGKLLR 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1220 CLDT-SNEVRCLEVAEQAKLLFTGLVSGIVLVFPLNSRQDVLCIPPPEARkaVNCMSLSKSENRLAIA-YDNIVLVLDIS 1297
Cdd:COG2319    157 TLTGhSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGA--VRSVAFSPDGKLLASGsADGTVRLWDLA 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1298 PGDPCPAIEGPTytfytqlpETIVSVAVLAD-YRVVYGMSDGSLFLYDCACSK-VFPLEAHGSRVSCVEVSHSEQLAVSG 1375
Cdd:COG2319    235 TGKLLRTLTGHS--------GSVRSVAFSPDgRLLASGSADGTVRLWDLATGElLRTLTGHSGGVNSVAFSPDGKLLASG 306
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907189679 1376 AEDALLCLWDLQACRGMFEMSYENSccrGVRCACFSRDDKHVFAGMEDRSVTAWSTVDGTLLAVQFVH 1443
Cdd:COG2319    307 SDDGTVRLWDLATGKLLRTLTGHTG---AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGH 371
WD40 COG2319
WD40 repeat [General function prediction only];
867-1242 9.18e-21

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 96.52  E-value: 9.18e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  867 GGPLRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWSLLSGQEKVTILDggsqnptepqswdlHVDERNNVVYSTS 946
Cdd:COG2319     67 AGALLATLLG-HTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG--------------HTGAVRSVAFSPD 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  947 GARInmwnletsklvfcITGdvsdpwvcvallaaqglllalSKGGQVSLWSSAMGKLQekHQLSSIKEETPTCAVSIQSR 1026
Cdd:COG2319    132 GKTL-------------ASG---------------------SADGTVRLWDLATGKLL--RTLTGHSGAVTSVAFSPDGK 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1027 aRLVAGFSSGSIALVSAGEDRLLEKLP---EAVGFLVVSEDDSLLV-AGFGRFVRIFLADSQGFHRFMASdleHEDMVET 1102
Cdd:COG2319    176 -LLASGSDDGTVRLWDLATGKLLRTLTghtGAVRSVAFSPDGKLLAsGSADGTVRLWDLATGKLLRTLTG---HSGSVRS 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1103 AVLGPENNLIITGSRDALIQVWSLSEqGTLLNVLEGVGAPVSLLV--RGGTLVVSASRkSSSFKVWDLKSTKKLQSPTPF 1180
Cdd:COG2319    252 VAFSPDGRLLASGSADGTVRLWDLAT-GELLRTLTGHSGGVNSVAfsPDGKLLASGSD-DGTVRLWDLATGKLLRTLTGH 329
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907189679 1181 LDRTGLAAVSHHGSFVYFpkVGDKNKVTIWDLAEGEEQDCLDT-SNEVRCLEVAEQAKLLFTG 1242
Cdd:COG2319    330 TGAVRSVAFSPDGKTLAS--GSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASG 390
WD40 COG2319
WD40 repeat [General function prediction only];
864-1213 1.95e-18

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 89.59  E-value: 1.95e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  864 QPPGGPLRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWSLLSGQEkVTILDGgsqnptepqswdlHVDERNNVVY 943
Cdd:COG2319    106 DLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKL-LRTLTG-------------HSGAVTSVAF 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  944 STSGARInmwnletsklvfcITGdvsdpwvcvallaaqglllalSKGGQVSLWSSAMGKLQekHQLSSIKEETPTCAVSI 1023
Cdd:COG2319    171 SPDGKLL-------------ASG---------------------SDDGTVRLWDLATGKLL--RTLTGHTGAVRSVAFSP 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1024 QSRaRLVAGFSSGSIALVSAGEDRLLEKLP---EAVGFLVVSEDDSLLV-AGFGRFVRIFLADSQGFHRFMASdleHEDM 1099
Cdd:COG2319    215 DGK-LLASGSADGTVRLWDLATGKLLRTLTghsGSVRSVAFSPDGRLLAsGSADGTVRLWDLATGELLRTLTG---HSGG 290
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1100 VETAVLGPENNLIITGSRDALIQVWSLSEqGTLLNVLEGVGAPVSLLV--RGGTLVVSASRkSSSFKVWDLKSTKKLQSP 1177
Cdd:COG2319    291 VNSVAFSPDGKLLASGSDDGTVRLWDLAT-GKLLRTLTGHTGAVRSVAfsPDGKTLASGSD-DGTVRLWDLATGELLRTL 368
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1907189679 1178 TPFLDRTGLAAVSHHGSFVYFpkVGDKNKVTIWDLA 1213
Cdd:COG2319    369 TGHTGAVTSVAFSPDGRTLAS--GSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1107-1430 2.57e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 84.31  E-value: 2.57e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1107 PENNLIITGSRDALIQVWSLsEQGTLLNVLEGVGAPVS--LLVRGGTLVVSASrksssfkvwdlkstkklqsptpfLDRT 1184
Cdd:cd00200     19 PDGKLLATGSGDGTIKVWDL-ETGELLRTLKGHTGPVRdvAASADGTYLASGS-----------------------SDKT 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1185 glaavshhgsfvyfpkvgdknkVTIWDLAEGEeqdCLDT----SNEVRCLEVAEQAKLLFTGLVSGIVLVFPLNSRQDVL 1260
Cdd:cd00200     75 ----------------------IRLWDLETGE---CVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1261 CIPPPEArkAVNCMSLSKSENRLAIA-YDNIVLVLDISPGDPCPAIEGPTytfytqlpETIVSVAVLAD-YRVVYGMSDG 1338
Cdd:cd00200    130 TLRGHTD--WVNSVAFSPDGTFVASSsQDGTIKLWDLRTGKCVATLTGHT--------GEVNSVAFSPDgEKLLSSSSDG 199
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1339 SLFLYDCACSKVF-PLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWDLQACRGMFEMS-YENSccrgVRCACFSRDDKH 1416
Cdd:cd00200    200 TIKLWDLSTGKCLgTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSgHTNS----VTSLAWSPDGKR 275
                          330
                   ....*....|....
gi 1907189679 1417 VFAGMEDRSVTAWS 1430
Cdd:cd00200    276 LASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
870-1167 1.29e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 76.22  E-value: 1.29e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  870 LRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWSLLSGQEKVTIldggsQNPTEPqSWDLHVDERNNVVYSTSGAR 949
Cdd:cd00200      1 LRRTLKG-HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTL-----KGHTGP-VRDVAASADGTYLASGSSDK 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  950 -INMWNLETSKLVFCITGDVSDPWvCVALLAAQGLLLALSKGGQVSLWSSAMGKLqeKHQLSSIkeETPTCAVSIQSRAR 1028
Cdd:cd00200     74 tIRLWDLETGECVRTLTGHTSYVS-SVAFSPDGRILSSSSRDKTIKVWDVETGKC--LTTLRGH--TDWVNSVAFSPDGT 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1029 LVAGFSS-GSIALVSAGEDRLLEKLP---EAVGFLVVSEDDSLLVAGFG-RFVRIFLADSQgfhRFMASDLEHEDMVETA 1103
Cdd:cd00200    149 FVASSSQdGTIKLWDLRTGKCVATLTghtGEVNSVAFSPDGEKLLSSSSdGTIKLWDLSTG---KCLGTLRGHENGVNSV 225
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907189679 1104 VLGPENNLIITGSRDALIQVWSLsEQGTLLNVLEGVGAPVSLL--VRGGTLVVSASrKSSSFKVWD 1167
Cdd:cd00200    226 AFSPDGYLLASGSEDGTIRVWDL-RTGECVQTLSGHTNSVTSLawSPDGKRLASGS-ADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
867-1125 1.79e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 75.83  E-value: 1.79e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  867 GGPLRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWSlLSGQEKVTILDGGSQNPTepqSWDLHVDERnnVVYSTS 946
Cdd:cd00200     40 TGELLRTLKG-HTGPVRDVAASADGTYLASGSSDKTIRLWD-LETGECVRTLTGHTSYVS---SVAFSPDGR--ILSSSS 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  947 GAR-INMWNLETSKLVFCITGdVSDPWVCVALLAAQGLLLALSKGGQVSLWSSAMGKLqeKHQLSSIKEETPTCAVSiQS 1025
Cdd:cd00200    113 RDKtIKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKC--VATLTGHTGEVNSVAFS-PD 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1026 RARLVAGFSSGSIALVSAGEDRLLEKLP---EAVGFLVVSEDDSLLVAG-FGRFVRIFlaDSQGFhRFMASDLEHEDMVE 1101
Cdd:cd00200    189 GEKLLSSSSDGTIKLWDLSTGKCLGTLRgheNGVNSVAFSPDGYLLASGsEDGTIRVW--DLRTG-ECVQTLSGHTNSVT 265
                          250       260
                   ....*....|....*....|....
gi 1907189679 1102 TAVLGPENNLIITGSRDALIQVWS 1125
Cdd:cd00200    266 SLAWSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1224-1443 1.73e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 72.75  E-value: 1.73e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1224 SNEVRCLEVAEQAKLLFTGLVSGIVLVFPLNSRQDVLCIPppEARKAVNCMSLSKSENRLAIA-YDNIVLVLDISPGDPC 1302
Cdd:cd00200      9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK--GHTGPVRDVAASADGTYLASGsSDKTIRLWDLETGECV 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1303 PAIEGPTytfytqlpETIVSVAVLADYRVVYG-MSDGSLFLYDCACSK-VFPLEAHGSRVSCVEVSHSEQLAVSGAEDAL 1380
Cdd:cd00200     87 RTLTGHT--------SYVSSVAFSPDGRILSSsSRDKTIKVWDVETGKcLTTLRGHTDWVNSVAFSPDGTFVASSSQDGT 158
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907189679 1381 LCLWDLQA--CRGMFEMSYENsccrgVRCACFSRDDKHVFAGMEDRSVTAWSTVDGTLLAVQFVH 1443
Cdd:cd00200    159 IKLWDLRTgkCVATLTGHTGE-----VNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH 218
ExeA COG3267
Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, ...
315-421 1.79e-04

Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 442498 [Multi-domain]  Cd Length: 261  Bit Score: 45.16  E-value: 1.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  315 GHQELLAQLRQQLRQDESrthtPLVLFGPPGIGKTSLMCKLAQQVPEllghkTVVVLRLLGTsklSLDARSLLRslsfQV 394
Cdd:COG3267     27 SHREALARLEYALAQGGG----FVVLTGEVGTGKTTLLRRLLERLPD-----DVKVAYIPNP---QLSPAELLR----AI 90
                           90       100
                   ....*....|....*....|....*..
gi 1907189679  395 CLAYGLPLPPAQVLEAHSRVGHFFHTL 421
Cdd:COG3267     91 ADELGLEPKGASKADLLRQLQEFLLEL 117
WDR74 cd22857
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ...
1265-1387 1.96e-04

WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.


Pssm-ID: 439303 [Multi-domain]  Cd Length: 325  Bit Score: 45.30  E-value: 1.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679 1265 PEARKAVNCMSLS--KSENRLAIAY-DNIVLVLDISPGDPCPAIEGPTYTFYTQLPETIVSVAVLAdyRVVYGMSD-GSL 1340
Cdd:cd22857     27 PDKSKAVQALSIAdrESEPLLAVARkNGTVEVLDPENGDLLASFSDSEPATKLSEEDHFVGLHLFS--GTLLTCTSkGSL 104
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907189679 1341 FLY-----DCACSKVFPLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWDLQ 1387
Cdd:cd22857    105 RSTklpddSTASSSPTAWVCLGGNLLCMRVDPNENYFAFGGKEVELNVWDLE 156
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
868-907 2.02e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 2.02e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 1907189679   868 GPLRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWS 907
Cdd:smart00320    2 GELLKTLKG-HTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
868-907 6.16e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.87  E-value: 6.16e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1907189679  868 GPLRATLTGcHKAEVKCVRVFAQGTLAISASKDHTLRLWS 907
Cdd:pfam00400    1 GKLLKTLEG-HTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
AAA cd00009
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily ...
315-361 6.95e-04

The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.


Pssm-ID: 99707 [Multi-domain]  Cd Length: 151  Bit Score: 41.75  E-value: 6.95e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1907189679  315 GHQELLAQLRQQLRQDESRthtPLVLFGPPGIGKTSLMCKLAQQVPE 361
Cdd:cd00009      2 GQEEAIEALREALELPPPK---NLLLYGPPGTGKTTLARAIANELFR 45
FxSxx_TPR NF040586
FxSxx-COOH system tetratricopeptide repeat protein; Members of this family are typically about ...
313-351 7.42e-04

FxSxx-COOH system tetratricopeptide repeat protein; Members of this family are typically about 850 amino acids long, or 1300 long because of an additional N-terminal domain. Proteins have a P-loop motif, GxGGxGKT, near the N-terminus of the region covered by this HMM, and a region over 400 residues long of tetratricopeptide repeat sequence. The family is found regularly next to other components of FxSxx-COOH systems, which feature an FxsB family radical SAM protein and a protein modified by it, FxsA. Members of this FxsA family typically have an FxSxx motif as the final five amino acids.


Pssm-ID: 468560 [Multi-domain]  Cd Length: 836  Bit Score: 44.14  E-value: 7.42e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1907189679  313 FCGHQELLAQLRQQLRQdESRTHTPLVLFGPPGIGKTSL 351
Cdd:NF040586     8 FTGREELLERLRDQLRS-GGAAVVPQALHGLGGVGKTQL 45
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1348-1385 2.14e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.29  E-value: 2.14e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1907189679  1348 SKVFPLEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWD 1385
Cdd:smart00320    3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
NACHT pfam05729
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ...
337-504 2.46e-03

NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.


Pssm-ID: 428606 [Multi-domain]  Cd Length: 166  Bit Score: 40.37  E-value: 2.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  337 PLVLFGPPGIGKTSLMCKLAQ--QVPELLGHKTVVVLRLLGTSKLSLDARSlLRSLSFQVCLAYGLPLPP--AQVLEAHS 412
Cdd:pfam05729    2 TVILQGEAGSGKTTLLQKLALlwAQGKLPQGFDFVFFLPCRELSRSGNARS-LADLLFSQWPEPAAPVSEvwAVILELPE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907189679  413 RVGHFFHTLLHTVSQRNFESLVLLLDSVDDldsichsprvSWLPLKCPPRVHLILSTCSGQqvLHNLQQTLKDPsTYWEV 492
Cdd:pfam05729   81 RLLLILDGLDELVSDLGQLDGPCPVLTLLS----------SLLRKKLLPGASLLLTVRPDA--LRDLRRGLEEP-RYLEV 147
                          170
                   ....*....|..
gi 1907189679  493 KALSGSQGQEFI 504
Cdd:pfam05729  148 RGFSESDRKQYV 159
WD40 pfam00400
WD domain, G-beta repeat;
1353-1385 2.62e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.94  E-value: 2.62e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1907189679 1353 LEAHGSRVSCVEVSHSEQLAVSGAEDALLCLWD 1385
Cdd:pfam00400    7 LEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
ruvB TIGR00635
Holliday junction DNA helicase, RuvB subunit; All proteins in this family for which functions ...
313-351 6.23e-03

Holliday junction DNA helicase, RuvB subunit; All proteins in this family for which functions are known are 5'-3' DNA helicases that, as part of a complex with RuvA homologs serve as a 5'-3' Holliday junction helicase. RuvA specifically binds Holliday junctions as a sandwich of two tetramers and maintains the configuration of the junction. It forms a complex with two hexameric rings of RuvB, the subunit that contains helicase activity. The complex drives ATP-dependent branch migration of the Holliday junction recombination intermediate. The endonuclease RuvC resolves junctions. [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129721 [Multi-domain]  Cd Length: 305  Bit Score: 40.36  E-value: 6.23e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1907189679  313 FCGHQELLAQLRQQLRQDESRTHTP--LVLFGPPGIGKTSL 351
Cdd:TIGR00635    6 FIGQEKVKEQLQLFIEAAKMRQEALdhLLLYGPPGLGKTTL 46
HolB COG0470
DNA polymerase III, delta prime subunit [Replication, recombination and repair];
316-366 7.43e-03

DNA polymerase III, delta prime subunit [Replication, recombination and repair];


Pssm-ID: 440238 [Multi-domain]  Cd Length: 289  Bit Score: 40.34  E-value: 7.43e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907189679  316 HQELLAQLRQQLRQDesRTHTPLVLFGPPGIGKTSLMCKLAQqvpELLGHK 366
Cdd:COG0470      1 QEEAWEQLLAAAESG--RLPHALLLHGPPGIGKTTLALALAR---DLLCEN 46
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH