NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|31711792|gb|AAP68252|]
View 

At3g08850 [Arabidopsis thaliana]

Protein Classification

raptor family protein( domain architecture ID 13861801)

raptor (regulatory-associated protein of mTOR) family protein similar to Homo sapiens regulatory-associated protein of mTOR that functions as a scaffold for recruiting mTORC1 substrates

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Raptor_N pfam14538
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ...
104-255 1.19e-90

Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.


:

Pssm-ID: 464202  Cd Length: 152  Bit Score: 289.18  E-value: 1.19e-90
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792    104 MKTGCVALVLCLNITVDPPDVIKISPCARIEAWIDPFSMAPPKALETIGKNLSTQYERWQPRARYKVQLDPTVDEVRKLC 183
Cdd:pfam14538    1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 31711792    184 LTCRKYAKTERVLFHYNGHGVPKPTANGEIWVFNKSYTQYIPLPISELDSWLKTPSIYVFDCSAARMILNAF 255
Cdd:pfam14538   81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1035-1336 3.62e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 112.81  E-value: 3.62e-27
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1035 LHPFSPIVVAADENERIRVWNYEEATLLNGFDNHDFPDKGISKLClinelDDSLLLVASCDGSVRIWKnyaTKGKQKLVT 1114
Cdd:cd00200   17 FSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA-----DGTYLASGSSDKTIRLWD---LETGECVRT 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1115 gfssIQGHKPGARDlnavVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSEsECGVTALSASQvHGGQLAAGFADGS 1194
Cdd:cd00200   89 ----LTGHTSYVSS----VAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGH-TDWVNSVAFSP-DGTFVASSSQDGT 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1195 LRLYDVRSPEPlvCATRP-HQKveRVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRdTYLTIDAHRGSLTALAVHRHA 1273
Cdd:cd00200  159 IKLWDLRTGKC--VATLTgHTG--EVNSVAFSP--DGEKLLSSSSDGTIKLWDLSTGK-CLGTLRGHENGVNSVAFSPDG 231
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 31711792 1274 PIIASGSAKQLIKVFSLQ-GEQLGIIRyypsfmaQKIGSVSCLTFHPYQVLLAAGAADSFVSIY 1336
Cdd:cd00200  232 YLLASGSEDGTIRVWDLRtGECVQTLS-------GHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
HEAT COG1413
HEAT repeat [General function prediction only];
635-719 1.89e-06

HEAT repeat [General function prediction only];


:

Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 48.47  E-value: 1.89e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792  635 REANAFEKLAPLLSEPQPEVRAAAVFALGtllDIGFDSNKSVVEDEF-DDDEKIRAE----------DAIIKSLLDVVSD 703
Cdd:COG1413   13 GDPAAVPALIAALADEDPDVRAAAARALG---RLGDPRAVPALLEALkDPDPEVRAAaaealgrigdPEAVPALIAALKD 89
                         90
                 ....*....|....*.
gi 31711792  704 GSPLVRAEVAVALARF 719
Cdd:COG1413   90 EDPEVRRAAAEALGRL 105
 
Name Accession Description Interval E-value
Raptor_N pfam14538
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ...
104-255 1.19e-90

Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.


Pssm-ID: 464202  Cd Length: 152  Bit Score: 289.18  E-value: 1.19e-90
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792    104 MKTGCVALVLCLNITVDPPDVIKISPCARIEAWIDPFSMAPPKALETIGKNLSTQYERWQPRARYKVQLDPTVDEVRKLC 183
Cdd:pfam14538    1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 31711792    184 LTCRKYAKTERVLFHYNGHGVPKPTANGEIWVFNKSYTQYIPLPISELDSWLKTPSIYVFDCSAARMILNAF 255
Cdd:pfam14538   81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1035-1336 3.62e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 112.81  E-value: 3.62e-27
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1035 LHPFSPIVVAADENERIRVWNYEEATLLNGFDNHDFPDKGISKLClinelDDSLLLVASCDGSVRIWKnyaTKGKQKLVT 1114
Cdd:cd00200   17 FSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA-----DGTYLASGSSDKTIRLWD---LETGECVRT 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1115 gfssIQGHKPGARDlnavVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSEsECGVTALSASQvHGGQLAAGFADGS 1194
Cdd:cd00200   89 ----LTGHTSYVSS----VAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGH-TDWVNSVAFSP-DGTFVASSSQDGT 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1195 LRLYDVRSPEPlvCATRP-HQKveRVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRdTYLTIDAHRGSLTALAVHRHA 1273
Cdd:cd00200  159 IKLWDLRTGKC--VATLTgHTG--EVNSVAFSP--DGEKLLSSSSDGTIKLWDLSTGK-CLGTLRGHENGVNSVAFSPDG 231
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 31711792 1274 PIIASGSAKQLIKVFSLQ-GEQLGIIRyypsfmaQKIGSVSCLTFHPYQVLLAAGAADSFVSIY 1336
Cdd:cd00200  232 YLLASGSEDGTIRVWDLRtGECVQTLS-------GHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
1038-1336 1.25e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 108.07  E-value: 1.25e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1038 FSP---IVVAADENERIRVWNYEEATLLNGFDNHDFPdkgisklclINEL----DDSLLLVASCDGSVRIWKnyATKGKQ 1110
Cdd:COG2319  128 FSPdgkTLASGSADGTVRLWDLATGKLLRTLTGHSGA---------VTSVafspDGKLLASGSDDGTVRLWD--LATGKL 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1111 klvtgFSSIQGHKPGARDlnavVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSESEcGVTALSASQvHGGQLAAGF 1190
Cdd:COG2319  197 -----LRTLTGHTGAVRS----VAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSG-SVRSVAFSP-DGRLLASGS 265
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1191 ADGSLRLYDVRSPEPLvcATRPHQKvERVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRDTYlTIDAHRGSLTALAVH 1270
Cdd:COG2319  266 ADGTVRLWDLATGELL--RTLTGHS-GGVNSVAFSP--DGKLLASGSDDGTVRLWDLATGKLLR-TLTGHTGAVRSVAFS 339
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 31711792 1271 RHAPIIASGSAKQLIKVFSLQ-GEQLGIIRYYPsfmaqkiGSVSCLTFHPYQVLLAAGAADSFVSIY 1336
Cdd:COG2319  340 PDGKTLASGSDDGTVRLWDLAtGELLRTLTGHT-------GAVTSVAFSPDGRTLASGSADGTVRLW 399
HEAT COG1413
HEAT repeat [General function prediction only];
635-719 1.89e-06

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 48.47  E-value: 1.89e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792  635 REANAFEKLAPLLSEPQPEVRAAAVFALGtllDIGFDSNKSVVEDEF-DDDEKIRAE----------DAIIKSLLDVVSD 703
Cdd:COG1413   13 GDPAAVPALIAALADEDPDVRAAAARALG---RLGDPRAVPALLEALkDPDPEVRAAaaealgrigdPEAVPALIAALKD 89
                         90
                 ....*....|....*.
gi 31711792  704 GSPLVRAEVAVALARF 719
Cdd:COG1413   90 EDPEVRRAAAEALGRL 105
HEAT_2 pfam13646
HEAT repeats; This family includes multiple HEAT repeats.
646-719 6.73e-04

HEAT repeats; This family includes multiple HEAT repeats.


Pssm-ID: 433376 [Multi-domain]  Cd Length: 88  Bit Score: 40.01  E-value: 6.73e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 31711792    646 LLSEPQPEVRAAAVFALGtlldigfdsnksvvedEFDDDEkiraedaIIKSLLDVVSDGSPLVRAEVAVALARF 719
Cdd:pfam13646    8 LLRDPDPEVRAAAIRALG----------------RIGDPE-------AVPALLELLKDEDPAVRRAAAEALGKI 58
 
Name Accession Description Interval E-value
Raptor_N pfam14538
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ...
104-255 1.19e-90

Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.


Pssm-ID: 464202  Cd Length: 152  Bit Score: 289.18  E-value: 1.19e-90
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792    104 MKTGCVALVLCLNITVDPPDVIKISPCARIEAWIDPFSMAPPKALETIGKNLSTQYERWQPRARYKVQLDPTVDEVRKLC 183
Cdd:pfam14538    1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 31711792    184 LTCRKYAKTERVLFHYNGHGVPKPTANGEIWVFNKSYTQYIPLPISELDSWLKTPSIYVFDCSAARMILNAF 255
Cdd:pfam14538   81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1035-1336 3.62e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 112.81  E-value: 3.62e-27
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1035 LHPFSPIVVAADENERIRVWNYEEATLLNGFDNHDFPDKGISKLClinelDDSLLLVASCDGSVRIWKnyaTKGKQKLVT 1114
Cdd:cd00200   17 FSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA-----DGTYLASGSSDKTIRLWD---LETGECVRT 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1115 gfssIQGHKPGARDlnavVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSEsECGVTALSASQvHGGQLAAGFADGS 1194
Cdd:cd00200   89 ----LTGHTSYVSS----VAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGH-TDWVNSVAFSP-DGTFVASSSQDGT 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1195 LRLYDVRSPEPlvCATRP-HQKveRVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRdTYLTIDAHRGSLTALAVHRHA 1273
Cdd:cd00200  159 IKLWDLRTGKC--VATLTgHTG--EVNSVAFSP--DGEKLLSSSSDGTIKLWDLSTGK-CLGTLRGHENGVNSVAFSPDG 231
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 31711792 1274 PIIASGSAKQLIKVFSLQ-GEQLGIIRyypsfmaQKIGSVSCLTFHPYQVLLAAGAADSFVSIY 1336
Cdd:cd00200  232 YLLASGSEDGTIRVWDLRtGECVQTLS-------GHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
1038-1336 1.25e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 108.07  E-value: 1.25e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1038 FSP---IVVAADENERIRVWNYEEATLLNGFDNHDFPdkgisklclINEL----DDSLLLVASCDGSVRIWKnyATKGKQ 1110
Cdd:COG2319  128 FSPdgkTLASGSADGTVRLWDLATGKLLRTLTGHSGA---------VTSVafspDGKLLASGSDDGTVRLWD--LATGKL 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1111 klvtgFSSIQGHKPGARDlnavVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSESEcGVTALSASQvHGGQLAAGF 1190
Cdd:COG2319  197 -----LRTLTGHTGAVRS----VAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSG-SVRSVAFSP-DGRLLASGS 265
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1191 ADGSLRLYDVRSPEPLvcATRPHQKvERVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRDTYlTIDAHRGSLTALAVH 1270
Cdd:COG2319  266 ADGTVRLWDLATGELL--RTLTGHS-GGVNSVAFSP--DGKLLASGSDDGTVRLWDLATGKLLR-TLTGHTGAVRSVAFS 339
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 31711792 1271 RHAPIIASGSAKQLIKVFSLQ-GEQLGIIRYYPsfmaqkiGSVSCLTFHPYQVLLAAGAADSFVSIY 1336
Cdd:COG2319  340 PDGKTLASGSDDGTVRLWDLAtGELLRTLTGHT-------GAVTSVAFSPDGRTLASGSADGTVRLW 399
WD40 COG2319
WD40 repeat [General function prediction only];
1038-1292 5.20e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 103.07  E-value: 5.20e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1038 FSP---IVVAADENERIRVWNYEEATLLNGFDNHDFPdkgisklclINEL----DDSLLLVASCDGSVRIWkNYATkgkQ 1110
Cdd:COG2319  170 FSPdgkLLASGSDDGTVRLWDLATGKLLRTLTGHTGA---------VRSVafspDGKLLASGSADGTVRLW-DLAT---G 236
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1111 KLVTgfsSIQGHKPGARDlnavVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSESEcGVTALSASQvHGGQLAAGF 1190
Cdd:COG2319  237 KLLR---TLTGHSGSVRS----VAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSG-GVNSVAFSP-DGKLLASGS 307
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1191 ADGSLRLYDVRSPEPLvcatRPHQ-KVERVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRDTYlTIDAHRGSLTALAV 1269
Cdd:COG2319  308 DDGTVRLWDLATGKLL----RTLTgHTGAVRSVAFSP--DGKTLASGSDDGTVRLWDLATGELLR-TLTGHTGAVTSVAF 380
                        250       260
                 ....*....|....*....|...
gi 31711792 1270 HRHAPIIASGSAKQLIKVFSLQG 1292
Cdd:COG2319  381 SPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
1033-1336 2.49e-18

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 89.20  E-value: 2.49e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1033 ALLHPFSPIVVAADENERIRVWNYEEATLLNGFDNHDFPDKGISKLclineLDDSLLLVASCDGSVRIWKnyaTKGKQKL 1112
Cdd:COG2319   42 LAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS-----PDGRLLASASADGTVRLWD---LATGLLL 113
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1113 VTgfssIQGHKPGARDlnavVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSESEcGVTALSASQvHGGQLAAGFAD 1192
Cdd:COG2319  114 RT----LTGHTGAVRS----VAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSG-AVTSVAFSP-DGKLLASGSDD 183
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1193 GSLRLYDVRSPEPLVcATRPHQkvERVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRDTYlTIDAHRGSLTALAVHRH 1272
Cdd:COG2319  184 GTVRLWDLATGKLLR-TLTGHT--GAVRSVAFSP--DGKLLASGSADGTVRLWDLATGKLLR-TLTGHSGSVRSVAFSPD 257
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 31711792 1273 APIIASGSAKQLIKVFSLQGEQLgiiryyPSFMAQKIGSVSCLTFHPYQVLLAAGAADSFVSIY 1336
Cdd:COG2319  258 GRLLASGSADGTVRLWDLATGEL------LRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLW 315
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1119-1342 4.20e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 83.54  E-value: 4.20e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1119 IQGHKPGARdlnaVVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSESEcGVTALSASqVHGGQLAAGFADGSLRLY 1198
Cdd:cd00200    5 LKGHTGGVT----CVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTG-PVRDVAAS-ADGTYLASGSSDKTIRLW 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1199 DVRSPEPLVCATrPHQKveRVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRDTYlTIDAHRGSLTALAVHRHAPIIAS 1278
Cdd:cd00200   79 DLETGECVRTLT-GHTS--YVSSVAFSP--DGRILSSSSRDKTIKVWDVETGKCLT-TLRGHTDWVNSVAFSPDGTFVAS 152
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 31711792 1279 GSAKQLIKVFSLQGEQLgiIRYYPSFmaqkIGSVSCLTFHPYQVLLAAGAADSFVSIYTHDNSQ 1342
Cdd:cd00200  153 SSQDGTIKLWDLRTGKC--VATLTGH----TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK 210
WD40 COG2319
WD40 repeat [General function prediction only];
1125-1336 2.25e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 67.63  E-value: 2.25e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1125 GARDLNAVVDWQQQSGYLYASGETSTVTLWDLEKEQLVRSVPSESEcGVTALSASQvHGGQLAAGFADGSLRLYDVRSPE 1204
Cdd:COG2319   34 GLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA-AVLSVAFSP-DGRLLASASADGTVRLWDLATGL 111
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1205 PLVcATRPHQkvERVVGLSFQPglDPAKVVSASQAGDIQFLDLRTTRDTYlTIDAHRGSLTALAVHRHAPIIASGSAKQL 1284
Cdd:COG2319  112 LLR-TLTGHT--GAVRSVAFSP--DGKTLASGSADGTVRLWDLATGKLLR-TLTGHSGAVTSVAFSPDGKLLASGSDDGT 185
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|...
gi 31711792 1285 IKVFSLQ-GEQLGIIRYYPsfmaqkiGSVSCLTFHPYQVLLAAGAADSFVSIY 1336
Cdd:COG2319  186 VRLWDLAtGKLLRTLTGHT-------GAVRSVAFSPDGKLLASGSADGTVRLW 231
HEAT COG1413
HEAT repeat [General function prediction only];
635-719 1.89e-06

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 48.47  E-value: 1.89e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792  635 REANAFEKLAPLLSEPQPEVRAAAVFALGtllDIGFDSNKSVVEDEF-DDDEKIRAE----------DAIIKSLLDVVSD 703
Cdd:COG1413   13 GDPAAVPALIAALADEDPDVRAAAARALG---RLGDPRAVPALLEALkDPDPEVRAAaaealgrigdPEAVPALIAALKD 89
                         90
                 ....*....|....*.
gi 31711792  704 GSPLVRAEVAVALARF 719
Cdd:COG1413   90 EDPEVRRAAAEALGRL 105
WDR74 cd22857
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ...
1072-1336 3.70e-05

WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.


Pssm-ID: 439303 [Multi-domain]  Cd Length: 325  Bit Score: 47.61  E-value: 3.70e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1072 DKGISKLCLINELDDSLLLVASCDGSVRIWknyatkgkqKLVTG--FSSIQGHKPGARDLN--AVVDWQQQSGYLYASGE 1147
Cdd:cd22857   30 SKAVQALSIADRESEPLLAVARKNGTVEVL---------DPENGdlLASFSDSEPATKLSEedHFVGLHLFSGTLLTCTS 100
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1148 TSTVTLWDLEKEQLVRSVPSESECGVTALSASQVHG-GQLAAGFADG-SLRLYDVR-SPEPLVCATRP-------HQKVe 1217
Cdd:cd22857  101 KGSLRSTKLPDDSTASSSPTAWVCLGGNLLCMRVDPnENYFAFGGKEvELNVWDLEeKPGKIWRAKNVpndslglRVPV- 179
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1218 RVVGLSFQPGLDPAKVVSASQAGDIQFLDLRTTRDTYLTIDAHRGSLTALAV--HRHAPIIASGSAkQLIKvFSL-QGEQ 1294
Cdd:cd22857  180 WVTDLTFLSKDDHRKIVTGTGYHQVRLYDTRAQRRPVVSVDFGETPIKAVAEdpDGHTVYVGDTSG-DLAS-IDLrTGKL 257
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|..
gi 31711792 1295 LGIiryYPSFMAQKIGSVSCltfHPYQVLLAAGAADSFVSIY 1336
Cdd:cd22857  258 LGC---FKGKCGGSIRSIAR---HPELPLIASCGLDRYLRIW 293
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1256-1336 8.65e-05

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 46.17  E-value: 8.65e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792 1256 TIDAHRGSLTALAVHRHAPIIASGSAKQLIKVFSLQGEQLgiiryyPSFMAQKIGSVSCLTFHPYQVLLAAGAADSFVSI 1335
Cdd:cd00200    4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGEL------LRTLKGHTGPVRDVAASADGTYLASGSSDKTIRL 77

                 .
gi 31711792 1336 Y 1336
Cdd:cd00200   78 W 78
HEAT COG1413
HEAT repeat [General function prediction only];
639-720 1.10e-04

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 43.46  E-value: 1.10e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 31711792  639 AFEKLAPLLSEPQPEVRAAAVFALGTLldigfdsnksvvedefdddekirAEDAIIKSLLDVVSDGSPLVRAEVAVALAR 718
Cdd:COG1413   79 AVPALIAALKDEDPEVRRAAAEALGRL-----------------------GDPAAVPALLEALKDPDWEVRRAAARALGR 135

                 ..
gi 31711792  719 FA 720
Cdd:COG1413  136 LG 137
HEAT_2 pfam13646
HEAT repeats; This family includes multiple HEAT repeats.
646-719 6.73e-04

HEAT repeats; This family includes multiple HEAT repeats.


Pssm-ID: 433376 [Multi-domain]  Cd Length: 88  Bit Score: 40.01  E-value: 6.73e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 31711792    646 LLSEPQPEVRAAAVFALGtlldigfdsnksvvedEFDDDEkiraedaIIKSLLDVVSDGSPLVRAEVAVALARF 719
Cdd:pfam13646    8 LLRDPDPEVRAAAIRALG----------------RIGDPE-------AVPALLELLKDEDPAVRRAAAEALGKI 58
HEAT_EZ pfam13513
HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats ...
652-719 1.85e-03

HEAT-like repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). These EZ repeats are found in subunits of cyanobacterial phycocyanin lyase and other proteins and probably carry out a scaffolding role.


Pssm-ID: 463906 [Multi-domain]  Cd Length: 55  Bit Score: 37.73  E-value: 1.85e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 31711792    652 PEVRAAAVFALGTLLDIGFDSNKSVVEDefdddekiraedaIIKSLLDVVSDGSPLVRAEVAVALARF 719
Cdd:pfam13513    1 WRVREAAALALGSLAEGGPDLLAPAVPE-------------LLPALLPLLNDDSDLVREAAAWALGRL 55
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH