NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958762266|ref|XP_038961074|]
View 

cleavage stimulation factor subunit 1 isoform X1 [Rattus norvegicus]

Protein Classification

CSTF1_dimer and WD40 domain-containing protein( domain architecture ID 11245140)

CSTF1_dimer and WD40 domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
100-424 4.77e-60

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


:

Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 197.17  E-value: 4.77e-60
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 100 ETCYVTSHKGPCRVATYSRDGQLIATGSADASIKILDTERMLaksampievmmnetaqqnmenhpVIRTLYDHVDEVTCL 179
Cdd:cd00200     1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGE-----------------------LLRTLKGHTGPVRDV 57
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 180 AFHPTEQILASGSRDYTLKLFDYSKPsaKRAFKYIQEAEMLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCNpqd 259
Cdd:cd00200    58 AASADGTYLASGSSDKTIRLWDLETG--ECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLR--- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 260 QHTDAICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFEkAHDGaEVCSAIFSKNSKYILSSGKDSVAKLWEISTGR 339
Cdd:cd00200   133 GHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLT-GHTG-EVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 340 TLVRYTGaglsgrqvHR---TQAVFNHTEDYILLPDErTISLCCWDSRTAERRNLLSlGHNNIVRCIVHSPTNPGFMTCS 416
Cdd:cd00200   211 CLGTLRG--------HEngvNSVAFSPDGYLLASGSE-DGTIRVWDLRTGECVQTLS-GHTNSVTSLAWSPDGKRLASGS 280

                  ....*...
gi 1958762266 417 DDFRARFW 424
Cdd:cd00200   281 ADGTIRIW 288
CSTF1_dimer pfam16699
Cleavage stimulation factor subunit 1, dimerization domain; This family is the dimerization ...
8-59 3.03e-19

Cleavage stimulation factor subunit 1, dimerization domain; This family is the dimerization domain, at the N-terminal, of a family of cleavage stimulation factor subunit 1 proteins from eukaryotes. This domain allows for homodimerization such that the functional state of CSTF1 is a heterohexamer. The cleavage stimulation factor (CstF) complex is composed of three subunits and is essential for pre-mRNA 3'-end processing. CstF recognizes U and G/U-rich cis-acting RNA sequence elements and helps to stabilize the cleavage and polyadenylation specificity factor (CPSF) at the polyadenylation site as required for productive RNA cleavage.


:

Pssm-ID: 465240  Cd Length: 57  Bit Score: 80.78  E-value: 3.03e-19
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1958762266   8 LKDRQQLYKLIISQLLYDGYISIANGLINEIKPQSVCAPSEQLLHLIKLGME 59
Cdd:pfam16699   3 IKERELLYRLIISQLFYDGHQSIAVQLANLVSADPPCPPSDRLLHLVKLGLQ 54
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
100-424 4.77e-60

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 197.17  E-value: 4.77e-60
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 100 ETCYVTSHKGPCRVATYSRDGQLIATGSADASIKILDTERMLaksampievmmnetaqqnmenhpVIRTLYDHVDEVTCL 179
Cdd:cd00200     1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGE-----------------------LLRTLKGHTGPVRDV 57
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 180 AFHPTEQILASGSRDYTLKLFDYSKPsaKRAFKYIQEAEMLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCNpqd 259
Cdd:cd00200    58 AASADGTYLASGSSDKTIRLWDLETG--ECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLR--- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 260 QHTDAICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFEkAHDGaEVCSAIFSKNSKYILSSGKDSVAKLWEISTGR 339
Cdd:cd00200   133 GHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLT-GHTG-EVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 340 TLVRYTGaglsgrqvHR---TQAVFNHTEDYILLPDErTISLCCWDSRTAERRNLLSlGHNNIVRCIVHSPTNPGFMTCS 416
Cdd:cd00200   211 CLGTLRG--------HEngvNSVAFSPDGYLLASGSE-DGTIRVWDLRTGECVQTLS-GHTNSVTSLAWSPDGKRLASGS 280

                  ....*...
gi 1958762266 417 DDFRARFW 424
Cdd:cd00200   281 ADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
105-424 3.63e-49

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 172.02  E-value: 3.63e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 105 TSHKGPCRVATYSRDGQLIATGSADASIKILDTErmlaksampievmmnetaqqnmeNHPVIRTLYDHVDEVTCLAFHPT 184
Cdd:COG2319   117 TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA-----------------------TGKLLRTLTGHSGAVTSVAFSPD 173
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 185 EQILASGSRDYTLKLFDYSKPSAKRAFKyiQEAEMLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCNPqdqHTDA 264
Cdd:COG2319   174 GKLLASGSDDGTVRLWDLATGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTG---HSGS 248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 265 ICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFEkaHDGAEVCSAIFSKNSKYILSSGKDSVAKLWEISTGRTLvry 344
Cdd:COG2319   249 VRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLT--GHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLL--- 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 345 tgAGLSGRQVHRTQAVFNHTEDYILLP-DERTISLccWDSRTAERRNLLSlGHNNIVRCIVHSPTNPGFMTCSDDFRARF 423
Cdd:COG2319   324 --RTLTGHTGAVRSVAFSPDGKTLASGsDDGTVRL--WDLATGELLRTLT-GHTGAVTSVAFSPDGRTLASGSADGTVRL 398

                  .
gi 1958762266 424 W 424
Cdd:COG2319   399 W 399
CSTF1_dimer pfam16699
Cleavage stimulation factor subunit 1, dimerization domain; This family is the dimerization ...
8-59 3.03e-19

Cleavage stimulation factor subunit 1, dimerization domain; This family is the dimerization domain, at the N-terminal, of a family of cleavage stimulation factor subunit 1 proteins from eukaryotes. This domain allows for homodimerization such that the functional state of CSTF1 is a heterohexamer. The cleavage stimulation factor (CstF) complex is composed of three subunits and is essential for pre-mRNA 3'-end processing. CstF recognizes U and G/U-rich cis-acting RNA sequence elements and helps to stabilize the cleavage and polyadenylation specificity factor (CPSF) at the polyadenylation site as required for productive RNA cleavage.


Pssm-ID: 465240  Cd Length: 57  Bit Score: 80.78  E-value: 3.03e-19
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1958762266   8 LKDRQQLYKLIISQLLYDGYISIANGLINEIKPQSVCAPSEQLLHLIKLGME 59
Cdd:pfam16699   3 IKERELLYRLIISQLFYDGHQSIAVQLANLVSADPPCPPSDRLLHLVKLGLQ 54
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
165-201 2.79e-08

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 49.62  E-value: 2.79e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1958762266  165 VIRTLYDHVDEVTCLAFHPTEQILASGSRDYTLKLFD 201
Cdd:smart00320   4 LLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
166-201 1.74e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 47.34  E-value: 1.74e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1958762266 166 IRTLYDHVDEVTCLAFHPTEQILASGSRDYTLKLFD 201
Cdd:pfam00400   4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
100-424 4.77e-60

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 197.17  E-value: 4.77e-60
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 100 ETCYVTSHKGPCRVATYSRDGQLIATGSADASIKILDTERMLaksampievmmnetaqqnmenhpVIRTLYDHVDEVTCL 179
Cdd:cd00200     1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGE-----------------------LLRTLKGHTGPVRDV 57
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 180 AFHPTEQILASGSRDYTLKLFDYSKPsaKRAFKYIQEAEMLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCNpqd 259
Cdd:cd00200    58 AASADGTYLASGSSDKTIRLWDLETG--ECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLR--- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 260 QHTDAICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFEkAHDGaEVCSAIFSKNSKYILSSGKDSVAKLWEISTGR 339
Cdd:cd00200   133 GHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLT-GHTG-EVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 340 TLVRYTGaglsgrqvHR---TQAVFNHTEDYILLPDErTISLCCWDSRTAERRNLLSlGHNNIVRCIVHSPTNPGFMTCS 416
Cdd:cd00200   211 CLGTLRG--------HEngvNSVAFSPDGYLLASGSE-DGTIRVWDLRTGECVQTLS-GHTNSVTSLAWSPDGKRLASGS 280

                  ....*...
gi 1958762266 417 DDFRARFW 424
Cdd:cd00200   281 ADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
105-424 3.63e-49

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 172.02  E-value: 3.63e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 105 TSHKGPCRVATYSRDGQLIATGSADASIKILDTErmlaksampievmmnetaqqnmeNHPVIRTLYDHVDEVTCLAFHPT 184
Cdd:COG2319   117 TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA-----------------------TGKLLRTLTGHSGAVTSVAFSPD 173
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 185 EQILASGSRDYTLKLFDYSKPSAKRAFKyiQEAEMLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCNPqdqHTDA 264
Cdd:COG2319   174 GKLLASGSDDGTVRLWDLATGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTG---HSGS 248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 265 ICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFEkaHDGAEVCSAIFSKNSKYILSSGKDSVAKLWEISTGRTLvry 344
Cdd:COG2319   249 VRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLT--GHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLL--- 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 345 tgAGLSGRQVHRTQAVFNHTEDYILLP-DERTISLccWDSRTAERRNLLSlGHNNIVRCIVHSPTNPGFMTCSDDFRARF 423
Cdd:COG2319   324 --RTLTGHTGAVRSVAFSPDGKTLASGsDDGTVRL--WDLATGELLRTLT-GHTGAVTSVAFSPDGRTLASGSADGTVRL 398

                  .
gi 1958762266 424 W 424
Cdd:COG2319   399 W 399
WD40 COG2319
WD40 repeat [General function prediction only];
104-424 1.97e-43

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 157.00  E-value: 1.97e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 104 VTSHKGPCRVATYSRDGQLIATGSADASIKILDTErmlaksampievmmnetaqqnmeNHPVIRTLYDHVDEVTCLAFHP 183
Cdd:COG2319    74 LLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLA-----------------------TGLLLRTLTGHTGAVRSVAFSP 130
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 184 TEQILASGSRDYTLKLFDYSKPSAKRAFKyiQEAEMLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCNPqdqHTD 263
Cdd:COG2319   131 DGKTLASGSADGTVRLWDLATGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTG---HTG 205
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 264 AICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFEkaHDGAEVCSAIFSKNSKYILSSGKDSVAKLWEISTGRTLVR 343
Cdd:COG2319   206 AVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT--GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 344 YTGaglSGRQVhrTQAVFNHTEDYILLPDE-RTISLccWDSRTAERRNLLSlGHNNIVRCIVHSPTNPGFMTCSDDFRAR 422
Cdd:COG2319   284 LTG---HSGGV--NSVAFSPDGKLLASGSDdGTVRL--WDLATGKLLRTLT-GHTGAVRSVAFSPDGKTLASGSDDGTVR 355

                  ..
gi 1958762266 423 FW 424
Cdd:COG2319   356 LW 357
WD40 COG2319
WD40 repeat [General function prediction only];
104-337 3.76e-39

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 145.44  E-value: 3.76e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 104 VTSHKGPCRVATYSRDGQLIATGSADASIKILDTErmlaksampievmmneTAQQnmenhpvIRTLYDHVDEVTCLAFHP 183
Cdd:COG2319   200 LTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLA----------------TGKL-------LRTLTGHSGSVRSVAFSP 256
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 184 TEQILASGSRDYTLKLFDYSKPSAKRAFKYIQEAemLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCNPqdqHTD 263
Cdd:COG2319   257 DGRLLASGSADGTVRLWDLATGELLRTLTGHSGG--VNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTG---HTG 331
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958762266 264 AICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFeKAHDGAeVCSAIFSKNSKYILSSGKDSVAKLWEIST 337
Cdd:COG2319   332 AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTL-TGHTGA-VTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
155-424 9.34e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 136.19  E-value: 9.34e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 155 TAQQNMENHPVIRTLYDHVDEVTCLAFHPTEQILASGSRDYTLKLFDYSKPSAKRAFKYIQEAemLRSISFHPSGDFILV 234
Cdd:COG2319    60 LLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGA--VRSVAFSPDGKTLAS 137
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 235 GTQHPTLRLYDINTFQCFVSCNPqdqHTDAICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFeKAHDGAeVCSAIF 314
Cdd:COG2319   138 GSADGTVRLWDLATGKLLRTLTG---HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTL-TGHTGA-VRSVAF 212
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 315 SKNSKYILSSGKDSVAKLWEISTGRTLVRYTGAGLSGRQVhrtqaVFNhtedyillPDERTI-------SLCCWDSRTAE 387
Cdd:COG2319   213 SPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSV-----AFS--------PDGRLLasgsadgTVRLWDLATGE 279
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 1958762266 388 RRNLLSlGHNNIVRCIVHSPTNPGFMTCSDDFRARFW 424
Cdd:COG2319   280 LLRTLT-GHSGGVNSVAFSPDGKLLASGSDDGTVRLW 315
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
97-334 7.42e-35

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 130.92  E-value: 7.42e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266  97 SEYETCYVTSHKGPCRVATYSRDGQLIATGSADASIKILDTermlaksampievmmnETAQQnmenhpvIRTLYDHVDEV 176
Cdd:cd00200    82 TGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDV----------------ETGKC-------LTTLRGHTDWV 138
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 177 TCLAFHPTEQILASGSRDYTLKLFDYSKPSAKRAFKyiQEAEMLRSISFHPSGDFILVGTQHPTLRLYDINTFQCFVSCN 256
Cdd:cd00200   139 NSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLT--GHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLR 216
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958762266 257 PqdqHTDAICSVNYNPSANMYVTGSKDGCIKLWDGVSNRCITTFeKAHDGAeVCSAIFSKNSKYILSSGKDSVAKLWE 334
Cdd:cd00200   217 G---HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL-SGHTNS-VTSLAWSPDGKRLASGSADGTIRIWD 289
CSTF1_dimer pfam16699
Cleavage stimulation factor subunit 1, dimerization domain; This family is the dimerization ...
8-59 3.03e-19

Cleavage stimulation factor subunit 1, dimerization domain; This family is the dimerization domain, at the N-terminal, of a family of cleavage stimulation factor subunit 1 proteins from eukaryotes. This domain allows for homodimerization such that the functional state of CSTF1 is a heterohexamer. The cleavage stimulation factor (CstF) complex is composed of three subunits and is essential for pre-mRNA 3'-end processing. CstF recognizes U and G/U-rich cis-acting RNA sequence elements and helps to stabilize the cleavage and polyadenylation specificity factor (CPSF) at the polyadenylation site as required for productive RNA cleavage.


Pssm-ID: 465240  Cd Length: 57  Bit Score: 80.78  E-value: 3.03e-19
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1958762266   8 LKDRQQLYKLIISQLLYDGYISIANGLINEIKPQSVCAPSEQLLHLIKLGME 59
Cdd:pfam16699   3 IKERELLYRLIISQLFYDGHQSIAVQLANLVSADPPCPPSDRLLHLVKLGLQ 54
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
165-201 2.79e-08

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 49.62  E-value: 2.79e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1958762266  165 VIRTLYDHVDEVTCLAFHPTEQILASGSRDYTLKLFD 201
Cdd:smart00320   4 LLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
166-201 1.74e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 47.34  E-value: 1.74e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1958762266 166 IRTLYDHVDEVTCLAFHPTEQILASGSRDYTLKLFD 201
Cdd:pfam00400   4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
261-290 3.40e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.46  E-value: 3.40e-06
                           10        20        30
                   ....*....|....*....|....*....|
gi 1958762266  261 HTDAICSVNYNPSANMYVTGSKDGCIKLWD 290
Cdd:smart00320  11 HTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
261-290 1.81e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 41.56  E-value: 1.81e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 1958762266 261 HTDAICSVNYNPSANMYVTGSKDGCIKLWD 290
Cdd:pfam00400  10 HTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
NBCH_WD40 pfam20426
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ...
107-247 5.93e-05

Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.


Pssm-ID: 466575 [Multi-domain]  Cd Length: 350  Bit Score: 45.06  E-value: 5.93e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958762266 107 HKGPCRVATYSRDGQLIATGSADASIKILDTERmlaksAMPIEVMMNETAQQNMENHPVI-----RTLYDHVDEVTCLAF 181
Cdd:pfam20426 123 HKDVVSCVAVTSDGSILATGSYDTTVMVWEVLR-----GRSSEKRSRNTQTEFPRKDHVIaetpfHILCGHDDIITCLYV 197
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958762266 182 HPTEQILASGSRDYTLklfdyskpsakrAFKYIQEAEMLRSISfHPSGDFI--LVGTQHP----------TLRLYDIN 247
Cdd:pfam20426 198 SVELDIVISGSKDGTC------------IFHTLREGRYVRSIR-HPSGCPLskLVASRHGrivlyadddlSLHLYSIN 262
WD40 pfam00400
WD domain, G-beta repeat;
395-424 2.79e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 35.40  E-value: 2.79e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1958762266 395 GHNNIVRCIVHSPTNPGFMTCSDDFRARFW 424
Cdd:pfam00400   9 GHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
105-136 6.36e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 34.21  E-value: 6.36e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1958762266  105 TSHKGPCRVATYSRDGQLIATGSADASIKILD 136
Cdd:smart00320   9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
294-334 7.19e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 34.24  E-value: 7.19e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1958762266 294 NRCITTFeKAHDGAeVCSAIFSKNSKYILSSGKDSVAKLWE 334
Cdd:pfam00400   1 GKLLKTL-EGHTGS-VTSLAFSPDGKLLASGSDDGTVKVWD 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH