NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1622939797|ref|XP_002804113|]
View 

protein transport protein Sec31A isoform X21 [Macaca mulatta]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
84-327 1.08e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 102.80  E-value: 1.08e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797   84 LIAGGENGNIILYDPSKiiaGDKEVVIAQndkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGAKTQ 161
Cdd:cd00200     66 LASGSSDKTIRLWDLET---GECVRTLTG---HTSYVSSVA---FSPDgrILSSSSRDKTIKVWDVETGKCLTTLRGHTD 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  162 PpedISCIAWNrQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDiATQMVLASEDDrlpVIQMWDL 241
Cdd:cd00200    137 W---VNSVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKLWDL 206
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  242 RfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDG 321
Cdd:cd00200    207 S-TGKCLGTLRGHENGVNSVAFS-PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GSADG 283

                   ....*.
gi 1622939797  322 RISVYS 327
Cdd:cd00200    284 TIRIWD 289
ACE1-Sec16-like super family cl14807
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
568-691 4.66e-09

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


The actual alignment was detected with superfamily member cd09233:

Pssm-ID: 449359 [Multi-domain]  Cd Length: 314  Bit Score: 59.19  E-value: 4.66e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  568 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 639
Cdd:cd09233     69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622939797  640 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgtrleneGDSLLQTQ----ACLCYICAG 691
Cdd:cd09233    147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG 200
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
938-1095 2.26e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.00  E-value: 2.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  938 PPPPSSGASFQHGGPGAPPSSSAYALP----PGTTGPQNGWNDP----------------------PALNRVPKKKKTPE 991
Cdd:pfam03154  181 ASPPSPPPPGTTQAATAGPTPSAPSVPpqgsPATSQPPNQTQSTaaphtliqqtptlhpqrlpsphPPLQPMTQPPPPSQ 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  992 NfmPPVPITSPIMNPLGDPQSQMLQQQPS--------APIPLSSQSS------FPQPHLSgGQPFHGIQQPLGQTgMPPS 1057
Cdd:pfam03154  261 V--SPQPLPQPSLHGQMPPMPHSLQTGPShmqhpvppQPFPLTPQSSqsqvppGPSPAAP-GQSQQRIHTPPSQS-QLQS 336
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1622939797 1058 FSKPNIEGAPGAPIgnTFQHVQSLPTKKITKKPIPDEH 1095
Cdd:pfam03154  337 QQPPREQPLPPAPL--SMPHIKPPPTTPIPQLPNPQSH 372
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
745-1134 8.30e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 43.46  E-value: 8.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  745 MSQYANLLAAQGSIAAALAFLPDNTNQPNIVQLrdrlcrAQGEPVAGHESPKIPYEKQQLPK--GRPGPVGHHQMPRVQT 822
Cdd:pfam09606  117 PGTASNLLASLGRPQMPMGGAGFPSQMSRVGRM------QPGGQAGGMMQPSSGQPGSGTPNqmGPNGGPGQGQAGGMNG 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  823 QQYYPHGENPPPPGF--IMHGNVNPNA--------AGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMYR---- 888
Cdd:pfam09606  191 GQQGPMGGQMPPQMGvpGMPGPADAGAqmgqqaqaNGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQmpqg 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  889 ---------PQQPVAPPtsnayPNTPYISSASSYSGQSQLYAAQHQASSPTSSPATSfpPPPSSGASFQHGGPGAPPSSS 959
Cdd:pfam09606  271 vgggagqggPGQPMGPP-----GQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNH--PAAHQQQMNQSVGQGGQVVAL 343
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  960 AYALPPGTTGPQN--GWNDPPALNRVP-------------KKKKTPENFMPPVPitSPIMNPLGDPQSQMLQQQPSAPIP 1024
Cdd:pfam09606  344 GGLNHLETWNPGNfgGLGANPMQRGQPgmmsspspvpgqqVRQVTPNQFMRQSP--QPSVPSPQGPGSQPPQSHPGGMIP 421
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797 1025 LSSQSSFPQPHLSGGQPfhgiqqplgQTGMPPSFSKPNIEGAPG-----APIGNTFQHVQSLPTKKITKKPIPDEHLILK 1099
Cdd:pfam09606  422 SPALIPSPSPQMSQQPA---------QQRTIGQDSPGGSLNTPGqsavnSPLNPQEEQLYREKYRQLTKYIEPLKRMIAK 492
                          410       420       430
                   ....*....|....*....|....*....|....*
gi 1622939797 1100 TTfedliqrclssaTDPQTKRKLDDASKRLEFLYD 1134
Cdd:pfam09606  493 ME------------NDPGDIDKMNKMKRLLEILSN 515
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
84-327 1.08e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 102.80  E-value: 1.08e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797   84 LIAGGENGNIILYDPSKiiaGDKEVVIAQndkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGAKTQ 161
Cdd:cd00200     66 LASGSSDKTIRLWDLET---GECVRTLTG---HTSYVSSVA---FSPDgrILSSSSRDKTIKVWDVETGKCLTTLRGHTD 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  162 PpedISCIAWNrQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDiATQMVLASEDDrlpVIQMWDL 241
Cdd:cd00200    137 W---VNSVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKLWDL 206
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  242 RfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDG 321
Cdd:cd00200    207 S-TGKCLGTLRGHENGVNSVAFS-PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GSADG 283

                   ....*.
gi 1622939797  322 RISVYS 327
Cdd:cd00200    284 TIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
84-328 4.83e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 103.07  E-value: 4.83e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797   84 LIAGGENGNIILYDpskiIAGDKEvvIAQNDKHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPp 163
Cdd:COG2319    177 LASGSDDGTVRLWD----LATGKL--LRTLTGHTGAVRSVAFS-PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  164 edISCIAWNRQVQHiLASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPDiATQMVLASEDDRlpvIQMWDLRf 243
Cdd:COG2319    249 --VRSVAFSPDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA- 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  244 ASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRI 323
Cdd:COG2319    319 TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLAS-GSADGTV 396

                   ....*
gi 1622939797  324 SVYSI 328
Cdd:COG2319    397 RLWDL 401
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
568-691 4.66e-09

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 59.19  E-value: 4.66e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  568 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 639
Cdd:cd09233     69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622939797  640 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgtrleneGDSLLQTQ----ACLCYICAG 691
Cdd:cd09233    147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG 200
Sec16_C pfam12931
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal ...
568-762 5.60e-07

Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal region is the part that binds to Sec23, a COPII vesicle coat protein. This association is part of the transport vesicle coat structure.


Pssm-ID: 432884  Cd Length: 279  Bit Score: 52.56  E-value: 5.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  568 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKY----FAKSQSKITRLItAVVMK----NWKEIVE- 637
Cdd:pfam12931    1 IRALLLTGDREKALWLAL-DKKLwAHALLIASTLGKEKWKEVVQEFvrseFKGSNNKSGESL-AALYQvfagNSEEAVDe 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  638 --------SCDLKNWREALAAVLTYAKPDEFSALCDlLGTRLENEGdslLQTQACLCYICAG---NVEKLVACWTKAQDG 706
Cdd:pfam12931   79 lvppsknaLWALDNWRETLALVLSNRSPGDVEALLA-LGDLLAQYG---RTEAAHICFLLAGlplSQTVLLGADHVRFPS 154
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  707 SHPLSLQDLI--EkvvILRKAVQLTqAMDTSTVGV--LLAAKMsQYANLLAAQGSIAAAL 762
Cdd:pfam12931  155 TFGNDLESILltE---IYEYALSLS-PPQPPFVGLphLLPYKL-QHAAVLAEYGLVSEAQ 209
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
938-1095 2.26e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.00  E-value: 2.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  938 PPPPSSGASFQHGGPGAPPSSSAYALP----PGTTGPQNGWNDP----------------------PALNRVPKKKKTPE 991
Cdd:pfam03154  181 ASPPSPPPPGTTQAATAGPTPSAPSVPpqgsPATSQPPNQTQSTaaphtliqqtptlhpqrlpsphPPLQPMTQPPPPSQ 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  992 NfmPPVPITSPIMNPLGDPQSQMLQQQPS--------APIPLSSQSS------FPQPHLSgGQPFHGIQQPLGQTgMPPS 1057
Cdd:pfam03154  261 V--SPQPLPQPSLHGQMPPMPHSLQTGPShmqhpvppQPFPLTPQSSqsqvppGPSPAAP-GQSQQRIHTPPSQS-QLQS 336
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1622939797 1058 FSKPNIEGAPGAPIgnTFQHVQSLPTKKITKKPIPDEH 1095
Cdd:pfam03154  337 QQPPREQPLPPAPL--SMPHIKPPPTTPIPQLPNPQSH 372
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
198-328 4.00e-04

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 44.69  E-value: 4.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  198 PIIKVSdhsNRMHCSGLAWHPDIATQMVLASEDDrlpVIQMWDLrfASSPLRV-LENHARGILAIAWSMADPELLLSCGK 276
Cdd:PLN00181   525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1622939797  277 DAKILCSNPNTGEVLYELPTNTQWCFdIQWCPRNPAVLSAASFDGRISVYSI 328
Cdd:PLN00181   597 DGSVKLWSINQGVSIGTIKTKANICC-VQFPSESGRSLAFGSADHKVYYYDL 647
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
745-1134 8.30e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 43.46  E-value: 8.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  745 MSQYANLLAAQGSIAAALAFLPDNTNQPNIVQLrdrlcrAQGEPVAGHESPKIPYEKQQLPK--GRPGPVGHHQMPRVQT 822
Cdd:pfam09606  117 PGTASNLLASLGRPQMPMGGAGFPSQMSRVGRM------QPGGQAGGMMQPSSGQPGSGTPNqmGPNGGPGQGQAGGMNG 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  823 QQYYPHGENPPPPGF--IMHGNVNPNA--------AGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMYR---- 888
Cdd:pfam09606  191 GQQGPMGGQMPPQMGvpGMPGPADAGAqmgqqaqaNGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQmpqg 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  889 ---------PQQPVAPPtsnayPNTPYISSASSYSGQSQLYAAQHQASSPTSSPATSfpPPPSSGASFQHGGPGAPPSSS 959
Cdd:pfam09606  271 vgggagqggPGQPMGPP-----GQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNH--PAAHQQQMNQSVGQGGQVVAL 343
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  960 AYALPPGTTGPQN--GWNDPPALNRVP-------------KKKKTPENFMPPVPitSPIMNPLGDPQSQMLQQQPSAPIP 1024
Cdd:pfam09606  344 GGLNHLETWNPGNfgGLGANPMQRGQPgmmsspspvpgqqVRQVTPNQFMRQSP--QPSVPSPQGPGSQPPQSHPGGMIP 421
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797 1025 LSSQSSFPQPHLSGGQPfhgiqqplgQTGMPPSFSKPNIEGAPG-----APIGNTFQHVQSLPTKKITKKPIPDEHLILK 1099
Cdd:pfam09606  422 SPALIPSPSPQMSQQPA---------QQRTIGQDSPGGSLNTPGqsavnSPLNPQEEQLYREKYRQLTKYIEPLKRMIAK 492
                          410       420       430
                   ....*....|....*....|....*....|....*
gi 1622939797 1100 TTfedliqrclssaTDPQTKRKLDDASKRLEFLYD 1134
Cdd:pfam09606  493 ME------------NDPGDIDKMNKMKRLLEILSN 515
PHA03378 PHA03378
EBNA-3B; Provisional
788-1091 6.65e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.82  E-value: 6.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  788 PVAGHESPKIPYEKQQLPKGRPGPVGHHQMPRVQTQQYYPHGENPPppgfimHGNVNP---NAAGQLPTS--PGHMhtqv 862
Cdd:PHA03378   625 PMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIG------HIPYQPsptGANTMLPIQwaPGTM---- 694
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  863 ppypqpqpyqpaqpypfgtggsamyrpQQPVAPPTSNAYPNTPYISSASSYSGQSQLYAAQHQASSPTSSPATSFPPPPS 942
Cdd:PHA03378   695 ---------------------------QPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPP 747
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  943 SGASFQHGGP-GAPPSSSAYALPPGTTGPQNGWNDPPALNRVPKKKKTPenfmppvpitspimnplgdpqsqmlQQQPSA 1021
Cdd:PHA03378   748 AAAPGRARPPaAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTP-------------------------QPPPQA 802
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622939797 1022 PiPLSSQSSFPQPHLSGGQPFHGIQQPL--GQTGMPPSFSKPNiEGAPGAPIGNTfQHVQSLPTKKITKKPI 1091
Cdd:PHA03378   803 G-PTSMQLMPRAAPGQQGPTKQILRQLLtgGVKRGRPSLKKPA-ALERQAAAGPT-PSPGSGTSDKIVQAPV 871
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
84-327 1.08e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 102.80  E-value: 1.08e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797   84 LIAGGENGNIILYDPSKiiaGDKEVVIAQndkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGAKTQ 161
Cdd:cd00200     66 LASGSSDKTIRLWDLET---GECVRTLTG---HTSYVSSVA---FSPDgrILSSSSRDKTIKVWDVETGKCLTTLRGHTD 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  162 PpedISCIAWNrQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDiATQMVLASEDDrlpVIQMWDL 241
Cdd:cd00200    137 W---VNSVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKLWDL 206
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  242 RfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDG 321
Cdd:cd00200    207 S-TGKCLGTLRGHENGVNSVAFS-PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GSADG 283

                   ....*.
gi 1622939797  322 RISVYS 327
Cdd:cd00200    284 TIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
116-335 1.87e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 102.03  E-value: 1.87e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  116 HTGPVRALDVNIfQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQV-------------------- 175
Cdd:cd00200      8 HTGGVTCVAFSP-DGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLasgssdktirlwdletgecv 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  176 ------------------QHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDiaTQMVLASEDDRLpvIQ 237
Cdd:cd00200     87 rtltghtsyvssvafspdGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IK 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  238 MWDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAA 317
Cdd:cd00200    161 LWDLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASG 237
                          250
                   ....*....|....*...
gi 1622939797  318 SFDGRISVYSIMGGSTDG 335
Cdd:cd00200    238 SEDGTIRVWDLRTGECVQ 255
WD40 COG2319
WD40 repeat [General function prediction only];
84-328 4.83e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 103.07  E-value: 4.83e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797   84 LIAGGENGNIILYDpskiIAGDKEvvIAQNDKHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPp 163
Cdd:COG2319    177 LASGSDDGTVRLWD----LATGKL--LRTLTGHTGAVRSVAFS-PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  164 edISCIAWNRQVQHiLASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPDiATQMVLASEDDRlpvIQMWDLRf 243
Cdd:COG2319    249 --VRSVAFSPDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA- 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  244 ASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRI 323
Cdd:COG2319    319 TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLAS-GSADGTV 396

                   ....*
gi 1622939797  324 SVYSI 328
Cdd:COG2319    397 RLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
84-328 1.32e-21

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 98.83  E-value: 1.32e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797   84 LIAGGENGNIILYDpskiIAGDKEVVIAQNdkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGAKTQ 161
Cdd:COG2319    135 LASGSADGTVRLWD----LATGKLLRTLTG--HSGAVTSVA---FSPDgkLLASGSDDGTVRLWDLATGKLLRTLTGHTG 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  162 PpedISCIAWNRQvQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPD---IATqmvlASEDDRlpvIQM 238
Cdd:COG2319    206 A---VRSVAFSPD-GKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRS--VAFSPDgrlLAS----GSADGT---VRL 272
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  239 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaAS 318
Cdd:COG2319    273 WDLA-TGELLRTLTGHSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLAS-GS 349
                          250
                   ....*....|
gi 1622939797  319 FDGRISVYSI 328
Cdd:COG2319    350 DDGTVRLWDL 359
WD40 COG2319
WD40 repeat [General function prediction only];
116-331 2.01e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 92.28  E-value: 2.01e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  116 HTGPVRALDVNiFQTNLVASGANESEIYIWDLnnfATPMTPGAKTQPPEDISCIAWNRQvQHILASASPSGRATVWDLRK 195
Cdd:COG2319     77 HTAAVLSVAFS-PDGRLLASASADGTVRLWDL---ATGLLLRTLTGHTGAVRSVAFSPD-GKTLASGSADGTVRLWDLAT 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  196 NEPIIKVSDHSNRMHCsgLAWHPD---IATqmvlASEDDRlpvIQMWDLRfASSPLRVLENHARGILAIAWSmADPELLL 272
Cdd:COG2319    152 GKLLRTLTGHSGAVTS--VAFSPDgklLAS----GSDDGT---VRLWDLA-TGKLLRTLTGHTGAVRSVAFS-PDGKLLA 220
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1622939797  273 SCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAASFDGRISVYSIMGG 331
Cdd:COG2319    221 SGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATG 278
WD40 COG2319
WD40 repeat [General function prediction only];
84-242 2.19e-09

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 61.08  E-value: 2.19e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797   84 LIAGGENGNIILYDpskiIAGDKEVVIAQNdkHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTqpp 163
Cdd:COG2319    261 LASGSADGTVRLWD----LATGELLRTLTG--HSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHT--- 330
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  164 EDISCIAWNRQVQhILASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPD---IATqmvlASEDDRlpvIQMWD 240
Cdd:COG2319    331 GAVRSVAFSPDGK-TLASGSDDGTVRLWDLATGELLRTLTGHTGAVT--SVAFSPDgrtLAS----GSADGT---VRLWD 400

                   ..
gi 1622939797  241 LR 242
Cdd:COG2319    401 LA 402
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
568-691 4.66e-09

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 59.19  E-value: 4.66e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  568 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 639
Cdd:cd09233     69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622939797  640 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgtrleneGDSLLQTQ----ACLCYICAG 691
Cdd:cd09233    147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG 200
Sec16_C pfam12931
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal ...
568-762 5.60e-07

Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal region is the part that binds to Sec23, a COPII vesicle coat protein. This association is part of the transport vesicle coat structure.


Pssm-ID: 432884  Cd Length: 279  Bit Score: 52.56  E-value: 5.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  568 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKY----FAKSQSKITRLItAVVMK----NWKEIVE- 637
Cdd:pfam12931    1 IRALLLTGDREKALWLAL-DKKLwAHALLIASTLGKEKWKEVVQEFvrseFKGSNNKSGESL-AALYQvfagNSEEAVDe 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  638 --------SCDLKNWREALAAVLTYAKPDEFSALCDlLGTRLENEGdslLQTQACLCYICAG---NVEKLVACWTKAQDG 706
Cdd:pfam12931   79 lvppsknaLWALDNWRETLALVLSNRSPGDVEALLA-LGDLLAQYG---RTEAAHICFLLAGlplSQTVLLGADHVRFPS 154
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  707 SHPLSLQDLI--EkvvILRKAVQLTqAMDTSTVGV--LLAAKMsQYANLLAAQGSIAAAL 762
Cdd:pfam12931  155 TFGNDLESILltE---IYEYALSLS-PPQPPFVGLphLLPYKL-QHAAVLAEYGLVSEAQ 209
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
247-331 3.66e-06

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 50.03  E-value: 3.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  247 PLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRISVY 326
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFS-PDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLW 78

                   ....*
gi 1622939797  327 SIMGG 331
Cdd:cd00200     79 DLETG 83
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
938-1095 2.26e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.00  E-value: 2.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  938 PPPPSSGASFQHGGPGAPPSSSAYALP----PGTTGPQNGWNDP----------------------PALNRVPKKKKTPE 991
Cdd:pfam03154  181 ASPPSPPPPGTTQAATAGPTPSAPSVPpqgsPATSQPPNQTQSTaaphtliqqtptlhpqrlpsphPPLQPMTQPPPPSQ 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  992 NfmPPVPITSPIMNPLGDPQSQMLQQQPS--------APIPLSSQSS------FPQPHLSgGQPFHGIQQPLGQTgMPPS 1057
Cdd:pfam03154  261 V--SPQPLPQPSLHGQMPPMPHSLQTGPShmqhpvppQPFPLTPQSSqsqvppGPSPAAP-GQSQQRIHTPPSQS-QLQS 336
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1622939797 1058 FSKPNIEGAPGAPIgnTFQHVQSLPTKKITKKPIPDEH 1095
Cdd:pfam03154  337 QQPPREQPLPPAPL--SMPHIKPPPTTPIPQLPNPQSH 372
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
788-1094 5.10e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.84  E-value: 5.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  788 PVAGHESPKIPYEKQQLPKGRPGPVGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPpypq 867
Cdd:pfam03154  201 PSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMP---- 276
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  868 pqpyqpAQPYPFGTGGSAMYRP--QQPVAPPTSNAY---PNTPYISSASSYSGQSQLYAAQHQASSPTSSPATSFPPPPS 942
Cdd:pfam03154  277 ------PMPHSLQTGPSHMQHPvpPQPFPLTPQSSQsqvPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPL 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  943 SGASFQhGGPGAP----PSSSAYALPPGTTGPQ-----NGWNDPPALNRV--------PKKKKTPENFMPPV-PITSPIM 1004
Cdd:pfam03154  351 SMPHIK-PPPTTPipqlPNPQSHKHPPHLSGPSpfqmnSNLPPPPALKPLsslsthhpPSAHPPPLQLMPQSqQLPPPPA 429
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797 1005 NPLGDPQSQML------QQQPSAPIPLSSQSSFPQ-PHLSGGQPfhGIQQPLG-QTGMPPSFSKPNIEGAPGAPIGNTFQ 1076
Cdd:pfam03154  430 QPPVLTQSQSLpppaasHPPTSGLHQVPSQSPFPQhPFVPGGPP--PITPPSGpPTSTSSAMPGIQPPSSASVSSSGPVP 507
                          330       340
                   ....*....|....*....|
gi 1622939797 1077 HVQS--LPTKKITKKPiPDE 1094
Cdd:pfam03154  508 AAVScpLPPVQIKEEA-LDE 526
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
198-328 4.00e-04

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 44.69  E-value: 4.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  198 PIIKVSdhsNRMHCSGLAWHPDIATQMVLASEDDrlpVIQMWDLrfASSPLRV-LENHARGILAIAWSMADPELLLSCGK 276
Cdd:PLN00181   525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1622939797  277 DAKILCSNPNTGEVLYELPTNTQWCFdIQWCPRNPAVLSAASFDGRISVYSI 328
Cdd:PLN00181   597 DGSVKLWSINQGVSIGTIKTKANICC-VQFPSESGRSLAFGSADHKVYYYDL 647
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
745-1134 8.30e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 43.46  E-value: 8.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  745 MSQYANLLAAQGSIAAALAFLPDNTNQPNIVQLrdrlcrAQGEPVAGHESPKIPYEKQQLPK--GRPGPVGHHQMPRVQT 822
Cdd:pfam09606  117 PGTASNLLASLGRPQMPMGGAGFPSQMSRVGRM------QPGGQAGGMMQPSSGQPGSGTPNqmGPNGGPGQGQAGGMNG 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  823 QQYYPHGENPPPPGF--IMHGNVNPNA--------AGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMYR---- 888
Cdd:pfam09606  191 GQQGPMGGQMPPQMGvpGMPGPADAGAqmgqqaqaNGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQmpqg 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  889 ---------PQQPVAPPtsnayPNTPYISSASSYSGQSQLYAAQHQASSPTSSPATSfpPPPSSGASFQHGGPGAPPSSS 959
Cdd:pfam09606  271 vgggagqggPGQPMGPP-----GQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNH--PAAHQQQMNQSVGQGGQVVAL 343
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  960 AYALPPGTTGPQN--GWNDPPALNRVP-------------KKKKTPENFMPPVPitSPIMNPLGDPQSQMLQQQPSAPIP 1024
Cdd:pfam09606  344 GGLNHLETWNPGNfgGLGANPMQRGQPgmmsspspvpgqqVRQVTPNQFMRQSP--QPSVPSPQGPGSQPPQSHPGGMIP 421
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797 1025 LSSQSSFPQPHLSGGQPfhgiqqplgQTGMPPSFSKPNIEGAPG-----APIGNTFQHVQSLPTKKITKKPIPDEHLILK 1099
Cdd:pfam09606  422 SPALIPSPSPQMSQQPA---------QQRTIGQDSPGGSLNTPGqsavnSPLNPQEEQLYREKYRQLTKYIEPLKRMIAK 492
                          410       420       430
                   ....*....|....*....|....*....|....*
gi 1622939797 1100 TTfedliqrclssaTDPQTKRKLDDASKRLEFLYD 1134
Cdd:pfam09606  493 ME------------NDPGDIDKMNKMKRLLEILSN 515
PHA03378 PHA03378
EBNA-3B; Provisional
788-1091 6.65e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.82  E-value: 6.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  788 PVAGHESPKIPYEKQQLPKGRPGPVGHHQMPRVQTQQYYPHGENPPppgfimHGNVNP---NAAGQLPTS--PGHMhtqv 862
Cdd:PHA03378   625 PMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIG------HIPYQPsptGANTMLPIQwaPGTM---- 694
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  863 ppypqpqpyqpaqpypfgtggsamyrpQQPVAPPTSNAYPNTPYISSASSYSGQSQLYAAQHQASSPTSSPATSFPPPPS 942
Cdd:PHA03378   695 ---------------------------QPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPP 747
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  943 SGASFQHGGP-GAPPSSSAYALPPGTTGPQNGWNDPPALNRVPKKKKTPenfmppvpitspimnplgdpqsqmlQQQPSA 1021
Cdd:PHA03378   748 AAAPGRARPPaAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTP-------------------------QPPPQA 802
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622939797 1022 PiPLSSQSSFPQPHLSGGQPFHGIQQPL--GQTGMPPSFSKPNiEGAPGAPIGNTfQHVQSLPTKKITKKPI 1091
Cdd:PHA03378   803 G-PTSMQLMPRAAPGQQGPTKQILRQLLtgGVKRGRPSLKKPA-ALERQAAAGPT-PSPGSGTSDKIVQAPV 871
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
792-1034 9.03e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 9.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  792 HESPKIPYEKQQLPKGRPgpvghhqmPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPY 871
Cdd:pfam03154  154 NESDSDSSAQQQILQTQP--------PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTA 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  872 QPAQPYPFGTG--GSAMYRPQQPVAPPTSNAYPNTPYISSASSYSGQSQLYAAQHQASSPTSSPATSFPPPPSSGASFQH 949
Cdd:pfam03154  226 APHTLIQQTPTlhPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSS 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939797  950 GGPGAPPSSSAYALPPGTTGPQngwndPPALNRVPKKKKTPENFMPPVPITSPIMNP--------LGDPQSQMLQQQPSA 1021
Cdd:pfam03154  306 QSQVPPGPSPAAPGQSQQRIHT-----PPSQSQLQSQQPPREQPLPPAPLSMPHIKPppttpipqLPNPQSHKHPPHLSG 380
                          250
                   ....*....|...
gi 1622939797 1022 PIPLSSQSSFPQP 1034
Cdd:pfam03154  381 PSPFQMNSNLPPP 393
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH