NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|22094987|ref|NP_065812|]
View 

regulatory-associated protein of mTOR isoform 1 [Homo sapiens]

Protein Classification

raptor family protein( domain architecture ID 13861801)

raptor (regulatory-associated protein of mTOR) family protein similar to Homo sapiens regulatory-associated protein of mTOR that functions as a scaffold for recruiting mTORC1 substrates

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Raptor_N pfam14538
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ...
55-206 2.97e-96

Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.


:

Pssm-ID: 464202  Cd Length: 152  Bit Score: 304.58  E-value: 2.97e-96
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987     55 MKTVSVALVLCLNVGVDPPDVVKTTPCARLECWIDPLSMGPQKALETIGANLQKQYENWQPRARYKQSLDPTVDEVKKLC 134
Cdd:pfam14538    1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 22094987    135 TSLRRNAKEERVLFHYNGHGVPRPTVNGEVWVFNKNYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 206
Cdd:pfam14538   81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1027-1322 1.57e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 102.41  E-value: 1.57e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1027 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 1103
Cdd:cd00200   13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1104 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDREMKVQDIpTGADSCVTSLSCDSHRSLIVA 1183
Cdd:cd00200   82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1184 GLGDGSIRVYDrrMALSECrVMTYREHTAWVvkASLQKRPDG-HIVSVSVNGDVRIFDPRMPESVNVLQI-VKGLTALDI 1261
Cdd:cd00200  153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEV--NSVAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 22094987 1262 HPQADLIACGSVNqftaiynssgeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1322
Cdd:cd00200  228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
HEAT COG1413
HEAT repeat [General function prediction only];
563-670 6.70e-08

HEAT repeat [General function prediction only];


:

Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 52.71  E-value: 6.70e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987  563 LEQLNDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 642
Cdd:COG1413   53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
                         90       100
                 ....*....|....*....|....*...
gi 22094987  643 nvammLAQLVSDGSPMVRKELVVALSHL 670
Cdd:COG1413  114 -----LLEALKDPDWEVRRAAARALGRL 136
 
Name Accession Description Interval E-value
Raptor_N pfam14538
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ...
55-206 2.97e-96

Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.


Pssm-ID: 464202  Cd Length: 152  Bit Score: 304.58  E-value: 2.97e-96
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987     55 MKTVSVALVLCLNVGVDPPDVVKTTPCARLECWIDPLSMGPQKALETIGANLQKQYENWQPRARYKQSLDPTVDEVKKLC 134
Cdd:pfam14538    1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 22094987    135 TSLRRNAKEERVLFHYNGHGVPRPTVNGEVWVFNKNYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 206
Cdd:pfam14538   81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1027-1322 1.57e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 102.41  E-value: 1.57e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1027 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 1103
Cdd:cd00200   13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1104 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDREMKVQDIpTGADSCVTSLSCDSHRSLIVA 1183
Cdd:cd00200   82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1184 GLGDGSIRVYDrrMALSECrVMTYREHTAWVvkASLQKRPDG-HIVSVSVNGDVRIFDPRMPESVNVLQI-VKGLTALDI 1261
Cdd:cd00200  153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEV--NSVAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 22094987 1262 HPQADLIACGSVNqftaiynssgeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1322
Cdd:cd00200  228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
WD40 COG2319
WD40 repeat [General function prediction only];
1038-1331 6.04e-18

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 87.66  E-value: 6.04e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1038 IAVADKD-SICFWDWEKGEKLDYF--HNGnprytRVTAMEYL-NGQdcsLLLTATDDGAIRVWknfaDLEKNPEMVTawq 1113
Cdd:COG2319  135 LASGSADgTVRLWDLATGKLLRTLtgHSG-----AVTSVAFSpDGK---LLASGSDDGTVRLW----DLATGKLLRT--- 199
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1114 glsdmLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDREmKVQDIPTGADSCVTSLSCDSHRSLIVAGLGDGSIRVY 1193
Cdd:COG2319  200 -----LTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLW 273
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1194 DRRmalSECRVMTYREHTAWVVKASLqkRPDG-HIVSVSVNGDVRIFDPRMPESVNVLQI-VKGLTALDIHPQADLIACG 1271
Cdd:COG2319  274 DLA---TGELLRTLTGHSGGVNSVAF--SPDGkLLASGSDDGTVRLWDLATGKLLRTLTGhTGAVRSVAFSPDGKTLASG 348
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 22094987 1272 SVNQFTAIYN-SSGELINNIKyydgfmgQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1331
Cdd:COG2319  349 SDDGTVRLWDlATGELLRTLT-------GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
HEAT COG1413
HEAT repeat [General function prediction only];
563-670 6.70e-08

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 52.71  E-value: 6.70e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987  563 LEQLNDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 642
Cdd:COG1413   53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
                         90       100
                 ....*....|....*....|....*...
gi 22094987  643 nvammLAQLVSDGSPMVRKELVVALSHL 670
Cdd:COG1413  114 -----LLEALKDPDWEVRRAAARALGRL 136
HEAT_2 pfam13646
HEAT repeats; This family includes multiple HEAT repeats.
566-668 5.16e-05

HEAT repeats; This family includes multiple HEAT repeats.


Pssm-ID: 433376 [Multi-domain]  Cd Length: 88  Bit Score: 43.10  E-value: 5.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987    566 LNDPHPLLRQWVAICLGRIwqNFDSARwcgvrdsahEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDHnva 645
Cdd:pfam13646    9 LRDPDPEVRAAAIRALGRI--GDPEAV---------PALLELLKDEDPAVRRAAAEALG--------KIGDPEALPA--- 66
                           90       100
                   ....*....|....*....|...
gi 22094987    646 mMLAQLVSDGSPMVRKELVVALS 668
Cdd:pfam13646   67 -LLELLRDDDDDVVRAAAAEALA 88
 
Name Accession Description Interval E-value
Raptor_N pfam14538
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ...
55-206 2.97e-96

Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.


Pssm-ID: 464202  Cd Length: 152  Bit Score: 304.58  E-value: 2.97e-96
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987     55 MKTVSVALVLCLNVGVDPPDVVKTTPCARLECWIDPLSMGPQKALETIGANLQKQYENWQPRARYKQSLDPTVDEVKKLC 134
Cdd:pfam14538    1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 22094987    135 TSLRRNAKEERVLFHYNGHGVPRPTVNGEVWVFNKNYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 206
Cdd:pfam14538   81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1027-1322 1.57e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 102.41  E-value: 1.57e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1027 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 1103
Cdd:cd00200   13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1104 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDREMKVQDIpTGADSCVTSLSCDSHRSLIVA 1183
Cdd:cd00200   82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1184 GLGDGSIRVYDrrMALSECrVMTYREHTAWVvkASLQKRPDG-HIVSVSVNGDVRIFDPRMPESVNVLQI-VKGLTALDI 1261
Cdd:cd00200  153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEV--NSVAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 22094987 1262 HPQADLIACGSVNqftaiynssgeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1322
Cdd:cd00200  228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1069-1334 8.20e-20

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 91.24  E-value: 8.20e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1069 RVTAMEYLNGQDcsLLLTATDDGAIRVWknfaDLEKNpEMVTAWQGLSdmlpttrGAGMVVDWEQETGLLMSSGDVRIVR 1148
Cdd:cd00200   11 GVTCVAFSPDGK--LLATGSGDGTIKVW----DLETG-ELLRTLKGHT-------GPVRDVAASADGTYLASGSSDKTIR 76
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1149 IWDTDREMKVQDIpTGADSCVTSLSCDSHRSLIVAGLGDGSIRVYDrrMALSECrVMTYREHTAWVVkaSLQKRPDGHIV 1228
Cdd:cd00200   77 LWDLETGECVRTL-TGHTSYVSSVAFSPDGRILSSSSRDKTIKVWD--VETGKC-LTTLRGHTDWVN--SVAFSPDGTFV 150
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1229 -SVSVNGDVRIFDPRMPESVNVLQI-VKGLTALDIHPQADLIACGSVNQFTAIYN-SSGELINNIKYYDGFmgqrvgaIS 1305
Cdd:cd00200  151 aSSSQDGTIKLWDLRTGKCVATLTGhTGEVNSVAFSPDGEKLLSSSSDGTIKLWDlSTGKCLGTLRGHENG-------VN 223
                        250       260
                 ....*....|....*....|....*....
gi 22094987 1306 CLAFHPHWPHLAVGSNDYYISVYSVEKRV 1334
Cdd:cd00200  224 SVAFSPDGYLLASGSEDGTIRVWDLRTGE 252
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1123-1330 6.23e-19

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 88.93  E-value: 6.23e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1123 RGAGMVVDWEQETGLLMSSGDVRIVRIWDTDReMKVQDIPTGADSCVTSLSCDSHRSLIVAGLGDGSIRVYDRRmalSEC 1202
Cdd:cd00200    9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLET-GELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLE---TGE 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1203 RVMTYREHTAWVvkASLQKRPDGHIVSVS-VNGDVRIFDPRMPESVNVLQ-IVKGLTALDIHPQADLIACGSVNQFTAIY 1280
Cdd:cd00200   85 CVRTLTGHTSYV--SSVAFSPDGRILSSSsRDKTIKVWDVETGKCLTTLRgHTDWVNSVAFSPDGTFVASSSQDGTIKLW 162
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|.
gi 22094987 1281 N-SSGELINNIKYYDGFmgqrvgaISCLAFHPHWPHLAVGSNDYYISVYSV 1330
Cdd:cd00200  163 DlRTGKCVATLTGHTGE-------VNSVAFSPDGEKLLSSSSDGTIKLWDL 206
WD40 COG2319
WD40 repeat [General function prediction only];
1038-1331 6.04e-18

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 87.66  E-value: 6.04e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1038 IAVADKD-SICFWDWEKGEKLDYF--HNGnprytRVTAMEYL-NGQdcsLLLTATDDGAIRVWknfaDLEKNPEMVTawq 1113
Cdd:COG2319  135 LASGSADgTVRLWDLATGKLLRTLtgHSG-----AVTSVAFSpDGK---LLASGSDDGTVRLW----DLATGKLLRT--- 199
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1114 glsdmLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDREmKVQDIPTGADSCVTSLSCDSHRSLIVAGLGDGSIRVY 1193
Cdd:COG2319  200 -----LTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLW 273
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1194 DRRmalSECRVMTYREHTAWVVKASLqkRPDG-HIVSVSVNGDVRIFDPRMPESVNVLQI-VKGLTALDIHPQADLIACG 1271
Cdd:COG2319  274 DLA---TGELLRTLTGHSGGVNSVAF--SPDGkLLASGSDDGTVRLWDLATGKLLRTLTGhTGAVRSVAFSPDGKTLASG 348
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 22094987 1272 SVNQFTAIYN-SSGELINNIKyydgfmgQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1331
Cdd:COG2319  349 SDDGTVRLWDlATGELLRTLT-------GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1029-1240 6.32e-15

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 76.99  E-value: 6.32e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1029 VKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNprytrVTAMEYLngQDCSLLLTATDDGAIRVWknfaDLEKN 1105
Cdd:cd00200   57 VAASADGTYLASGSSDKTIRlWDLETGECVRTLtgHTSY-----VSSVAFS--PDGRILSSSSRDKTIKVW----DVETG 125
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1106 pEMVTAWQGLSDmlpttrgAGMVVDWEQETGLLMSSGDVRIVRIWDT--------------------------------- 1152
Cdd:cd00200  126 -KCLTTLRGHTD-------WVNSVAFSPDGTFVASSSQDGTIKLWDLrtgkcvatltghtgevnsvafspdgekllssss 197
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1153 DREMKVQDIPTGADSC--------VTSLSCDSHRSLIVAGLGDGSIRVYDRRMAlsECrVMTYREHTAWVVKASLQkrPD 1224
Cdd:cd00200  198 DGTIKLWDLSTGKCLGtlrghengVNSVAFSPDGYLLASGSEDGTIRVWDLRTG--EC-VQTLSGHTNSVTSLAWS--PD 272
                        250
                 ....*....|....*..
gi 22094987 1225 GH-IVSVSVNGDVRIFD 1240
Cdd:cd00200  273 GKrLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1110-1331 1.79e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 67.63  E-value: 1.79e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1110 TAWQGLSDMLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDREmKVQDIPTGADSCVTSLSCDSHRSLIVAGLGDGS 1189
Cdd:COG2319   23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAG-ALLATLLGHTAAVLSVAFSPDGRLLASASADGT 101
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1190 IRVYDrrmALSECRVMTYREHTAWVVKASLqkRPDGH-IVSVSVNGDVRIFDPRMPESVNVLQIVKG-LTALDIHPQADL 1267
Cdd:COG2319  102 VRLWD---LATGLLLRTLTGHTGAVRSVAF--SPDGKtLASGSADGTVRLWDLATGKLLRTLTGHSGaVTSVAFSPDGKL 176
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 22094987 1268 IACGSVNQFTAIYN-SSGELINNIKYYDgfmgqrvGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1331
Cdd:COG2319  177 LASGSDDGTVRLWDlATGKLLRTLTGHT-------GAVRSVAFSPDGKLLASGSADGTVRLWDLA 234
WDR74 cd22857
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ...
1038-1333 2.32e-08

WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.


Pssm-ID: 439303 [Multi-domain]  Cd Length: 325  Bit Score: 57.62  E-value: 2.32e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1038 IAVADKD-SICFWDWEKGEKLDYFHNGNPRYT-----RVTAMEYLNGQdcslLLTATDDGAIRVWKNFADLEKNPEmVTA 1111
Cdd:cd22857   47 LAVARKNgTVEVLDPENGDLLASFSDSEPATKlseedHFVGLHLFSGT----LLTCTSKGSLRSTKLPDDSTASSS-PTA 121
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1112 WQGLSDMLPTTRGagmvvdwEQETGLLMSSGDVRIVRIWDTdrEMKVQDI---------------PTgadsCVTS---LS 1173
Cdd:cd22857  122 WVCLGGNLLCMRV-------DPNENYFAFGGKEVELNVWDL--EEKPGKIwraknvpndslglrvPV----WVTDltfLS 188
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1174 CDSHRSlIVAGLGDGSIRVYD----RRmalsecRVM--TYREHTAWVVKASlqkrPDGHIVSVSVN-GDVRIFDPRMpes 1246
Cdd:cd22857  189 KDDHRK-IVTGTGYHQVRLYDtraqRR------PVVsvDFGETPIKAVAED----PDGHTVYVGDTsGDLASIDLRT--- 254
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987 1247 vnvlqivkgltaldihpqadliacgsvnqftaiynssGELINNikyYDGFMGqrvGAISCLAFHPHWPHLAVGSNDYYIS 1326
Cdd:cd22857  255 -------------------------------------GKLLGC---FKGKCG---GSIRSIARHPELPLIASCGLDRYLR 291

                 ....*..
gi 22094987 1327 VYSVEKR 1333
Cdd:cd22857  292 IWDTETR 298
HEAT COG1413
HEAT repeat [General function prediction only];
563-670 6.70e-08

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 52.71  E-value: 6.70e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987  563 LEQLNDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 642
Cdd:COG1413   53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
                         90       100
                 ....*....|....*....|....*...
gi 22094987  643 nvammLAQLVSDGSPMVRKELVVALSHL 670
Cdd:COG1413  114 -----LLEALKDPDWEVRRAAARALGRL 136
HEAT COG1413
HEAT repeat [General function prediction only];
563-670 1.28e-06

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 49.24  E-value: 1.28e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987  563 LEQLNDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 642
Cdd:COG1413   22 IAALADEDPDVRAAAARALGRLGD-----------PRAVPALLEALKDPDPEVRAAAAEALG--------RIGDPEAVPA 82
                         90       100
                 ....*....|....*....|....*...
gi 22094987  643 nvammLAQLVSDGSPMVRKELVVALSHL 670
Cdd:COG1413   83 -----LIAALKDEDPEVRRAAAEALGRL 105
HEAT COG1413
HEAT repeat [General function prediction only];
559-624 4.30e-05

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 44.62  E-value: 4.30e-05
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 22094987  559 IAICLEQLNDPHPLLRQWVAICLGRIWqnfdsarwcgvRDSAHEKLYSLLSDPIPEVRCAAVFALG 624
Cdd:COG1413   80 VPALIAALKDEDPEVRRAAAEALGRLG-----------DPAAVPALLEALKDPDWEVRRAAARALG 134
HEAT_2 pfam13646
HEAT repeats; This family includes multiple HEAT repeats.
566-668 5.16e-05

HEAT repeats; This family includes multiple HEAT repeats.


Pssm-ID: 433376 [Multi-domain]  Cd Length: 88  Bit Score: 43.10  E-value: 5.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987    566 LNDPHPLLRQWVAICLGRIwqNFDSARwcgvrdsahEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDHnva 645
Cdd:pfam13646    9 LRDPDPEVRAAAIRALGRI--GDPEAV---------PALLELLKDEDPAVRRAAAEALG--------KIGDPEALPA--- 66
                           90       100
                   ....*....|....*....|...
gi 22094987    646 mMLAQLVSDGSPMVRKELVVALS 668
Cdd:pfam13646   67 -LLELLRDDDDDVVRAAAAEALA 88
COG5096 COG5096
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular ...
472-676 2.07e-04

Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 227427 [Multi-domain]  Cd Length: 757  Bit Score: 45.87  E-value: 2.07e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987  472 IFPYVLKLLQSSARELRPLLVFIW---AK------ILAVDSscqadLVKDNGHKyflsvlaDPYMpaehRTMTAFILAVI 542
Cdd:COG5096   56 LFPDVIKNVATRDVELKRLLYLYLeryAKlkpelaLLAVNT-----IQKDLQDP-------NEEI----RGFALRTLSLL 119
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 22094987  543 vnsyhtgQEACLQGNLIAICLEQLNDPHPLLRQWVAICLGRIWQnFDSARWCGVRDSAHEKLysLLSDPIPEVRCAAVFA 622
Cdd:COG5096  120 -------RVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYR-LDKDLYHELGLIDILKE--LVADSDPIVIANALAS 189
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....
gi 22094987  623 LGTFVGNSAErtDHSTTIDHNVAMMLAQLVSDGSPMVRKELVVALSHLVVQYES 676
Cdd:COG5096  190 LAEIDPELAH--GYSLEVILRIPQLDLLSLSVSTEWLLLIILEVLTERVPTTPD 241
HEAT_2 pfam13646
HEAT repeats; This family includes multiple HEAT repeats.
607-668 1.19e-03

HEAT repeats; This family includes multiple HEAT repeats.


Pssm-ID: 433376 [Multi-domain]  Cd Length: 88  Bit Score: 39.24  E-value: 1.19e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 22094987    607 LLSDPIPEVRCAAVFALGTFvgnsaertDHSTTIDhnvamMLAQLVSDGSPMVRKELVVALS 668
Cdd:pfam13646    8 LLRDPDPEVRAAAIRALGRI--------GDPEAVP-----ALLELLKDEDPAVRRAAAEALG 56
HEAT COG1413
HEAT repeat [General function prediction only];
598-670 3.17e-03

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 39.23  E-value: 3.17e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 22094987  598 DSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDHnvammLAQLVSDGSPMVRKELVVALSHL 670
Cdd:COG1413   15 PAAVPALIAALADEDPDVRAAAARALG--------RLGDPRAVPA-----LLEALKDPDPEVRAAAAEALGRI 74
HEAT pfam02985
HEAT repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see ...
604-627 7.11e-03

HEAT repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514).


Pssm-ID: 460773  Cd Length: 31  Bit Score: 35.20  E-value: 7.11e-03
                           10        20
                   ....*....|....*....|....
gi 22094987    604 LYSLLSDPIPEVRCAAVFALGTFV 627
Cdd:pfam02985    5 LLKLLNDPSPEVREAAAEALGELA 28
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH