|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
13-332 |
2.99e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 104.34 E-value: 2.99e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 13 AWSPAQNhpiYLATGtsaqqldatfSTNASLEIFELD-------LSDPSLDMKSCATFSSSHRyhkliwgpykmdskgdv 85
Cdd:cd00200 16 AFSPDGK---LLATG----------SGDGTIKVWDLEtgellrtLKGHTGPVRDVAASADGTY----------------- 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 86 sgvLIAGGENGNIILYDPSKiiaGDKEVVIAQndkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGA 163
Cdd:cd00200 66 ---LASGSSDKTIRLWDLET---GECVRTLTG---HTSYVSSVA---FSPDgrILSSSSRDKTIKVWDVETGKCLTTLRG 133
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 164 KTQPpedISCIAWNrQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDiATQMVLASEDDrlpVIQM 243
Cdd:cd00200 134 HTDW---VNSVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKL 203
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 244 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaAS 323
Cdd:cd00200 204 WDLS-TGKCLGTLRGHENGVNSVAFS-PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GS 280
|
....*....
gi 1622939783 324 FDGRISVYS 332
Cdd:cd00200 281 ADGTIRIWD 289
|
|
| ACE1-Sec16-like super family |
cl14807 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
573-696 |
4.37e-09 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site. The actual alignment was detected with superfamily member cd09233:
Pssm-ID: 449359 [Multi-domain] Cd Length: 314 Bit Score: 59.19 E-value: 4.37e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 573 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 644
Cdd:cd09233 69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622939783 645 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgtrleneGDSLLQTQ----ACLCYICAG 696
Cdd:cd09233 147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG 200
|
|
| Atrophin-1 super family |
cl38111 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
772-1027 |
2.84e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity. The actual alignment was detected with superfamily member pfam03154:
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.69 E-value: 2.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 772 DNTNQPNIVQLRDRLCRAQGEPVAGHESPkiPYEKQQLPKGRPGPVGHHQMPRVQTQQYYPHVRIAPTVTTWS--NKTPT 849
Cdd:pfam03154 159 DSSAQQQILQTQPPVLQAQSGAASPPSPP--PPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTliQQTPT 236
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 850 ALPSHPPAASPSDTQGENPPPPgfimhgnvnPNAAGQlPTSPGHMHTQVppypqpqrpqngwndPPalnrVPKKKKTPEN 929
Cdd:pfam03154 237 LHPQRLPSPHPPLQPMTQPPPP---------SQVSPQ-PLPQPSLHGQM---------------PP----MPHSLQTGPS 287
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 930 FMP-PVPiTSPIMNPLGDPQSQMlqqqPSAPIPLSSQSSFPQPHLSGGQPFHGIQQPLGQTGMPPS-FSKPNIEGAPGAP 1007
Cdd:pfam03154 288 HMQhPVP-PQPFPLTPQSSQSQV----PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPApLSMPHIKPPPTTP 362
|
250 260
....*....|....*....|
gi 1622939783 1008 IgntfqhvQSLPTKKITKKP 1027
Cdd:pfam03154 363 I-------PQLPNPQSHKHP 375
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
13-332 |
2.99e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 104.34 E-value: 2.99e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 13 AWSPAQNhpiYLATGtsaqqldatfSTNASLEIFELD-------LSDPSLDMKSCATFSSSHRyhkliwgpykmdskgdv 85
Cdd:cd00200 16 AFSPDGK---LLATG----------SGDGTIKVWDLEtgellrtLKGHTGPVRDVAASADGTY----------------- 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 86 sgvLIAGGENGNIILYDPSKiiaGDKEVVIAQndkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGA 163
Cdd:cd00200 66 ---LASGSSDKTIRLWDLET---GECVRTLTG---HTSYVSSVA---FSPDgrILSSSSRDKTIKVWDVETGKCLTTLRG 133
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 164 KTQPpedISCIAWNrQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDiATQMVLASEDDrlpVIQM 243
Cdd:cd00200 134 HTDW---VNSVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKL 203
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 244 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaAS 323
Cdd:cd00200 204 WDLS-TGKCLGTLRGHENGVNSVAFS-PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GS 280
|
....*....
gi 1622939783 324 FDGRISVYS 332
Cdd:cd00200 281 ADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
89-333 |
4.49e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 103.07 E-value: 4.49e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 89 LIAGGENGNIILYDpskiIAGDKEvvIAQNDKHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPp 168
Cdd:COG2319 177 LASGSDDGTVRLWD----LATGKL--LRTLTGHTGAVRSVAFS-PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 169 edISCIAWNRQVQHiLASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPDiATQMVLASEDDRlpvIQMWDLRf 248
Cdd:COG2319 249 --VRSVAFSPDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA- 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 249 ASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRI 328
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLAS-GSADGTV 396
|
....*
gi 1622939783 329 SVYSI 333
Cdd:COG2319 397 RLWDL 401
|
|
| ACE1-Sec16-like |
cd09233 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
573-696 |
4.37e-09 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.
Pssm-ID: 187750 [Multi-domain] Cd Length: 314 Bit Score: 59.19 E-value: 4.37e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 573 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 644
Cdd:cd09233 69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622939783 645 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgtrleneGDSLLQTQ----ACLCYICAG 696
Cdd:cd09233 147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG 200
|
|
| Sec16_C |
pfam12931 |
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal ... |
573-767 |
7.03e-07 |
|
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal region is the part that binds to Sec23, a COPII vesicle coat protein. This association is part of the transport vesicle coat structure.
Pssm-ID: 432884 Cd Length: 279 Bit Score: 52.18 E-value: 7.03e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 573 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKY----FAKSQSKITRLItAVVMK----NWKEIVE- 642
Cdd:pfam12931 1 IRALLLTGDREKALWLAL-DKKLwAHALLIASTLGKEKWKEVVQEFvrseFKGSNNKSGESL-AALYQvfagNSEEAVDe 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 643 --------SCDLKNWREALAAVLTYAKPDEFSALCDlLGTRLENEGdslLQTQACLCYICAG---NVEKLVACWTKAQDG 711
Cdd:pfam12931 79 lvppsknaLWALDNWRETLALVLSNRSPGDVEALLA-LGDLLAQYG---RTEAAHICFLLAGlplSQTVLLGADHVRFPS 154
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 712 SHPLSLQDLI--EkvvILRKAVQLTqAMDTSTVGV--LLAAKMsQYANLLAAQGSIAAAL 767
Cdd:pfam12931 155 TFGNDLESILltE---IYEYALSLS-PPQPPFVGLphLLPYKL-QHAAVLAEYGLVSEAQ 209
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
772-1027 |
2.84e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.69 E-value: 2.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 772 DNTNQPNIVQLRDRLCRAQGEPVAGHESPkiPYEKQQLPKGRPGPVGHHQMPRVQTQQYYPHVRIAPTVTTWS--NKTPT 849
Cdd:pfam03154 159 DSSAQQQILQTQPPVLQAQSGAASPPSPP--PPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTliQQTPT 236
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 850 ALPSHPPAASPSDTQGENPPPPgfimhgnvnPNAAGQlPTSPGHMHTQVppypqpqrpqngwndPPalnrVPKKKKTPEN 929
Cdd:pfam03154 237 LHPQRLPSPHPPLQPMTQPPPP---------SQVSPQ-PLPQPSLHGQM---------------PP----MPHSLQTGPS 287
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 930 FMP-PVPiTSPIMNPLGDPQSQMlqqqPSAPIPLSSQSSFPQPHLSGGQPFHGIQQPLGQTGMPPS-FSKPNIEGAPGAP 1007
Cdd:pfam03154 288 HMQhPVP-PQPFPLTPQSSQSQV----PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPApLSMPHIKPPPTTP 362
|
250 260
....*....|....*....|
gi 1622939783 1008 IgntfqhvQSLPTKKITKKP 1027
Cdd:pfam03154 363 I-------PQLPNPQSHKHP 375
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
803-1007 |
1.92e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 1.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 803 PYEKQQLPKGRPGPvgHHQMPRVQTQQYYPHVRIAPTvttwSNKTPTALP-----SHPPAASPSDTQGENPPPPGfimhg 877
Cdd:PHA03247 2562 AAPDRSVPPPRPAP--RPSEPAVTSRARRPDAPPQSA----RPRAPVDDRgdprgPAPPSPLPPDTHAPDPPPPS----- 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 878 nvnPNAAgqlPTSPGHMHTQVPPYPQPQrpqngwNDPPALNRVPKKKKTPENFMPPVPiTSPIMNPlgdpqsqmlqQQPS 957
Cdd:PHA03247 2631 ---PSPA---ANEPDPHPPPTVPPPERP------RDDPAPGRVSRPRRARRLGRAAQA-SSPPQRP----------RRRA 2687
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1622939783 958 APIPLSSQSSFPQPHLSGGQP-------FHGIQQPLGQTGMPPSFSKPNIEGAPGAP 1007
Cdd:PHA03247 2688 ARPTVGSLTSLADPPPPPPTPepaphalVSATPLPPGPAAARQASPALPAAPAPPAV 2744
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
203-333 |
3.76e-04 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 44.69 E-value: 3.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 203 PIIKVSdhsNRMHCSGLAWHPDIATQMVLASEDDrlpVIQMWDLrfASSPLRV-LENHARGILAIAWSMADPELLLSCGK 281
Cdd:PLN00181 525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1622939783 282 DAKILCSNPNTGEVLYELPTNTQWCFdIQWCPRNPAVLSAASFDGRISVYSI 333
Cdd:PLN00181 597 DGSVKLWSINQGVSIGTIKTKANICC-VQFPSESGRSLAFGSADHKVYYYDL 647
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
13-332 |
2.99e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 104.34 E-value: 2.99e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 13 AWSPAQNhpiYLATGtsaqqldatfSTNASLEIFELD-------LSDPSLDMKSCATFSSSHRyhkliwgpykmdskgdv 85
Cdd:cd00200 16 AFSPDGK---LLATG----------SGDGTIKVWDLEtgellrtLKGHTGPVRDVAASADGTY----------------- 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 86 sgvLIAGGENGNIILYDPSKiiaGDKEVVIAQndkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGA 163
Cdd:cd00200 66 ---LASGSSDKTIRLWDLET---GECVRTLTG---HTSYVSSVA---FSPDgrILSSSSRDKTIKVWDVETGKCLTTLRG 133
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 164 KTQPpedISCIAWNrQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDiATQMVLASEDDrlpVIQM 243
Cdd:cd00200 134 HTDW---VNSVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKL 203
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 244 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaAS 323
Cdd:cd00200 204 WDLS-TGKCLGTLRGHENGVNSVAFS-PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GS 280
|
....*....
gi 1622939783 324 FDGRISVYS 332
Cdd:cd00200 281 ADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
121-340 |
1.75e-23 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 102.03 E-value: 1.75e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 121 HTGPVRALDVNIfQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQV-------------------- 180
Cdd:cd00200 8 HTGGVTCVAFSP-DGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLasgssdktirlwdletgecv 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 181 ------------------QHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDiaTQMVLASEDDRLpvIQ 242
Cdd:cd00200 87 rtltghtsyvssvafspdGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 243 MWDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAA 322
Cdd:cd00200 161 LWDLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASG 237
|
250
....*....|....*...
gi 1622939783 323 SFDGRISVYSIMGGSTDG 340
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQ 255
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
89-333 |
4.49e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 103.07 E-value: 4.49e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 89 LIAGGENGNIILYDpskiIAGDKEvvIAQNDKHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPp 168
Cdd:COG2319 177 LASGSDDGTVRLWD----LATGKL--LRTLTGHTGAVRSVAFS-PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 169 edISCIAWNRQVQHiLASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPDiATQMVLASEDDRlpvIQMWDLRf 248
Cdd:COG2319 249 --VRSVAFSPDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA- 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 249 ASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRI 328
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLAS-GSADGTV 396
|
....*
gi 1622939783 329 SVYSI 333
Cdd:COG2319 397 RLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
89-333 |
1.23e-21 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 98.83 E-value: 1.23e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 89 LIAGGENGNIILYDpskiIAGDKEVVIAQNdkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGAKTQ 166
Cdd:COG2319 135 LASGSADGTVRLWD----LATGKLLRTLTG--HSGAVTSVA---FSPDgkLLASGSDDGTVRLWDLATGKLLRTLTGHTG 205
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 167 PpedISCIAWNRQvQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPD---IATqmvlASEDDRlpvIQM 243
Cdd:COG2319 206 A---VRSVAFSPD-GKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRS--VAFSPDgrlLAS----GSADGT---VRL 272
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 244 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaAS 323
Cdd:COG2319 273 WDLA-TGELLRTLTGHSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLAS-GS 349
|
250
....*....|
gi 1622939783 324 FDGRISVYSI 333
Cdd:COG2319 350 DDGTVRLWDL 359
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
121-336 |
1.87e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 92.28 E-value: 1.87e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 121 HTGPVRALDVNiFQTNLVASGANESEIYIWDLnnfATPMTPGAKTQPPEDISCIAWNRQvQHILASASPSGRATVWDLRK 200
Cdd:COG2319 77 HTAAVLSVAFS-PDGRLLASASADGTVRLWDL---ATGLLLRTLTGHTGAVRSVAFSPD-GKTLASGSADGTVRLWDLAT 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 201 NEPIIKVSDHSNRMHCsgLAWHPD---IATqmvlASEDDRlpvIQMWDLRfASSPLRVLENHARGILAIAWSmADPELLL 277
Cdd:COG2319 152 GKLLRTLTGHSGAVTS--VAFSPDgklLAS----GSDDGT---VRLWDLA-TGKLLRTLTGHTGAVRSVAFS-PDGKLLA 220
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1622939783 278 SCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAASFDGRISVYSIMGG 336
Cdd:COG2319 221 SGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATG 278
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
89-247 |
2.05e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 61.08 E-value: 2.05e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 89 LIAGGENGNIILYDpskiIAGDKEVVIAQNdkHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTqpp 168
Cdd:COG2319 261 LASGSADGTVRLWD----LATGELLRTLTG--HSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHT--- 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 169 EDISCIAWNRQVQhILASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPD---IATqmvlASEDDRlpvIQMWD 245
Cdd:COG2319 331 GAVRSVAFSPDGK-TLASGSDDGTVRLWDLATGELLRTLTGHTGAVT--SVAFSPDgrtLAS----GSADGT---VRLWD 400
|
..
gi 1622939783 246 LR 247
Cdd:COG2319 401 LA 402
|
|
| ACE1-Sec16-like |
cd09233 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
573-696 |
4.37e-09 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.
Pssm-ID: 187750 [Multi-domain] Cd Length: 314 Bit Score: 59.19 E-value: 4.37e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 573 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 644
Cdd:cd09233 69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622939783 645 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgtrleneGDSLLQTQ----ACLCYICAG 696
Cdd:cd09233 147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG 200
|
|
| Sec16_C |
pfam12931 |
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal ... |
573-767 |
7.03e-07 |
|
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal region is the part that binds to Sec23, a COPII vesicle coat protein. This association is part of the transport vesicle coat structure.
Pssm-ID: 432884 Cd Length: 279 Bit Score: 52.18 E-value: 7.03e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 573 ITQALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLARTQKKY----FAKSQSKITRLItAVVMK----NWKEIVE- 642
Cdd:pfam12931 1 IRALLLTGDREKALWLAL-DKKLwAHALLIASTLGKEKWKEVVQEFvrseFKGSNNKSGESL-AALYQvfagNSEEAVDe 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 643 --------SCDLKNWREALAAVLTYAKPDEFSALCDlLGTRLENEGdslLQTQACLCYICAG---NVEKLVACWTKAQDG 711
Cdd:pfam12931 79 lvppsknaLWALDNWRETLALVLSNRSPGDVEALLA-LGDLLAQYG---RTEAAHICFLLAGlplSQTVLLGADHVRFPS 154
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 712 SHPLSLQDLI--EkvvILRKAVQLTqAMDTSTVGV--LLAAKMsQYANLLAAQGSIAAAL 767
Cdd:pfam12931 155 TFGNDLESILltE---IYEYALSLS-PPQPPFVGLphLLPYKL-QHAAVLAEYGLVSEAQ 209
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
772-1027 |
2.84e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.69 E-value: 2.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 772 DNTNQPNIVQLRDRLCRAQGEPVAGHESPkiPYEKQQLPKGRPGPVGHHQMPRVQTQQYYPHVRIAPTVTTWS--NKTPT 849
Cdd:pfam03154 159 DSSAQQQILQTQPPVLQAQSGAASPPSPP--PPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTliQQTPT 236
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 850 ALPSHPPAASPSDTQGENPPPPgfimhgnvnPNAAGQlPTSPGHMHTQVppypqpqrpqngwndPPalnrVPKKKKTPEN 929
Cdd:pfam03154 237 LHPQRLPSPHPPLQPMTQPPPP---------SQVSPQ-PLPQPSLHGQM---------------PP----MPHSLQTGPS 287
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 930 FMP-PVPiTSPIMNPLGDPQSQMlqqqPSAPIPLSSQSSFPQPHLSGGQPFHGIQQPLGQTGMPPS-FSKPNIEGAPGAP 1007
Cdd:pfam03154 288 HMQhPVP-PQPFPLTPQSSQSQV----PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPApLSMPHIKPPPTTP 362
|
250 260
....*....|....*....|
gi 1622939783 1008 IgntfqhvQSLPTKKITKKP 1027
Cdd:pfam03154 363 I-------PQLPNPQSHKHP 375
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
788-1014 |
3.37e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.31 E-value: 3.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 788 RAQGEPVAGHESPKIPYEKQQLPKGrPGPVGH---------HQMPRVQTQQYYPHVRIAPTVTTWSNKTP-------TAL 851
Cdd:pfam03154 324 RIHTPPSQSQLQSQQPPREQPLPPA-PLSMPHikpppttpiPQLPNPQSHKHPPHLSGPSPFQMNSNLPPppalkplSSL 402
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 852 PSH-PPAASP------SDTQGENPPP---PGFIMHGNVNPNAAGQLPTSPGHMHTQVppypqpqrpqngwndPPAlnrvp 921
Cdd:pfam03154 403 STHhPPSAHPpplqlmPQSQQLPPPPaqpPVLTQSQSLPPPAASHPPTSGLHQVPSQ---------------SPF----- 462
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 922 kkkkTPENFMP--PVPITSPIMNPLGDPQSQMLQQQPSApIPLSSQSSFPQPHLSGGQPFHGIQQPLGQTGMPPSFSKPN 999
Cdd:pfam03154 463 ----PQHPFVPggPPPITPPSGPPTSTSSAMPGIQPPSS-ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPP 537
|
250
....*....|....*
gi 1622939783 1000 IEGAPGAPIGNTFQH 1014
Cdd:pfam03154 538 RSPSPEPTVVNTPSH 552
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
252-336 |
3.44e-06 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 50.03 E-value: 3.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 252 PLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRISVY 331
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFS-PDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLW 78
|
....*
gi 1622939783 332 SIMGG 336
Cdd:cd00200 79 DLETG 83
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
776-1032 |
5.84e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 50.54 E-value: 5.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 776 QPNIVQLRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVGHHQMPRVQTQQYY-PHVRIAPTVTTWSNKTPT---AL 851
Cdd:pfam03154 170 QPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAaPHTLIQQTPTLHPQRLPSphpPL 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 852 PSHPPAASPSDTQGENPPPPgfIMHGNVNPnAAGQLPTSPGHMHTQVPPYPQPQRPQNGWND------------------ 913
Cdd:pfam03154 250 QPMTQPPPPSQVSPQPLPQP--SLHGQMPP-MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQvppgpspaapgqsqqrih 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 914 -PPALNRVPKKKKTPENFMPPVPITSPIMNPlgdpqsqmlqqQPSAPIPL--SSQSSFPQPHLSGGQPFH--------GI 982
Cdd:pfam03154 327 tPPSQSQLQSQQPPREQPLPPAPLSMPHIKP-----------PPTTPIPQlpNPQSHKHPPHLSGPSPFQmnsnlpppPA 395
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622939783 983 QQPLGQ--TGMPPSFSKP---------NIEGAPGAPIGNTfqHVQSLPTKKITKKPIPDEH 1032
Cdd:pfam03154 396 LKPLSSlsTHHPPSAHPPplqlmpqsqQLPPPPAQPPVLT--QSQSLPPPAASHPPTSGLH 454
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
803-1007 |
1.92e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 1.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 803 PYEKQQLPKGRPGPvgHHQMPRVQTQQYYPHVRIAPTvttwSNKTPTALP-----SHPPAASPSDTQGENPPPPGfimhg 877
Cdd:PHA03247 2562 AAPDRSVPPPRPAP--RPSEPAVTSRARRPDAPPQSA----RPRAPVDDRgdprgPAPPSPLPPDTHAPDPPPPS----- 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 878 nvnPNAAgqlPTSPGHMHTQVPPYPQPQrpqngwNDPPALNRVPKKKKTPENFMPPVPiTSPIMNPlgdpqsqmlqQQPS 957
Cdd:PHA03247 2631 ---PSPA---ANEPDPHPPPTVPPPERP------RDDPAPGRVSRPRRARRLGRAAQA-SSPPQRP----------RRRA 2687
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1622939783 958 APIPLSSQSSFPQPHLSGGQP-------FHGIQQPLGQTGMPPSFSKPNIEGAPGAP 1007
Cdd:PHA03247 2688 ARPTVGSLTSLADPPPPPPTPepaphalVSATPLPPGPAAARQASPALPAAPAPPAV 2744
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
793-1007 |
3.13e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.31 E-value: 3.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 793 PVAGHESPKIPYEKQQLPKGRPGPVGHHQMPRVQTQQYYPHVRIAPTVTTWSNKTPTALPSHPPAASP-SDTQGENPPPP 871
Cdd:PHA03247 2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPtAPPPPPGPPPP 2848
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 872 GFIMHGNVNP----------NAAGQLPTSPGHMHTQVPPYPQPQRPQNGWNDPPAlnrVPKKKKTPENFMPPVPITSPIM 941
Cdd:PHA03247 2849 SLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPD---QPERPPQPQAPPPPQPQPQPPP 2925
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622939783 942 NPLGDPQSQMlQQQPSAPIPlssqssfPQPHLSG-GQPFHGIQQPLGQTGMPPSFSKPNIEGAPGAP 1007
Cdd:PHA03247 2926 PPQPQPPPPP-PPRPQPPLA-------PTTDPAGaGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP 2984
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
203-333 |
3.76e-04 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 44.69 E-value: 3.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 203 PIIKVSdhsNRMHCSGLAWHPDIATQMVLASEDDrlpVIQMWDLrfASSPLRV-LENHARGILAIAWSMADPELLLSCGK 281
Cdd:PLN00181 525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1622939783 282 DAKILCSNPNTGEVLYELPTNTQWCFdIQWCPRNPAVLSAASFDGRISVYSI 333
Cdd:PLN00181 597 DGSVKLWSINQGVSIGTIKTKANICC-VQFPSESGRSLAFGSADHKVYYYDL 647
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
793-1028 |
7.21e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 43.90 E-value: 7.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 793 PVAGHESPKIPYEKQQLPKGRPGPVGHHQMPRVQTQQYYP------HVRIAPTVT--------TWS---NKTPTALPS-- 853
Cdd:PHA03378 625 PMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPtwtqigHIPYQPSPTgantmlpiQWApgtMQPPPRAPTpm 704
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 854 HPPAASPSDTQ------GENPPPPGfimhgnvNPNAAGQLPTSPGHMHTQVPPYPQPQRPQNGWNDPPALNRVP-KKKKT 926
Cdd:PHA03378 705 RPPAAPPGRAQrpaaatGRARPPAA-------APGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPgAPTPQ 777
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 927 PENFMPPVPITSPIMNPLGdpqsqmlQQQPSAPiPLSSQSSFPQPHLSGGQPFHGIQQPL--GQTGMPPSFSKPNiEGAP 1004
Cdd:PHA03378 778 PPPQAPPAPQQRPRGAPTP-------QPPPQAG-PTSMQLMPRAAPGQQGPTKQILRQLLtgGVKRGRPSLKKPA-ALER 848
|
250 260
....*....|....*....|....
gi 1622939783 1005 GAPIGNTfQHVQSLPTKKITKKPI 1028
Cdd:PHA03378 849 QAAAGPT-PSPGSGTSDKIVQAPV 871
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
789-1071 |
1.34e-03 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 42.69 E-value: 1.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 789 AQGEP-VAGHESPKIPyekQQLPKGRPGPVGHHQMPrvQTQQYYPHVRIAPTVTTWSNKTPTALPSHP-----PAASPSD 862
Cdd:pfam09606 268 PQGVGgGAGQGGPGQP---MGPPGQQPGAMPNVMSI--GDQNNYQQQQTRQQQQQQGGNHPAAHQQQMnqsvgQGGQVVA 342
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 863 TQGENPPPPGFimHGNVNP-NAAGQLPTSPGHMhtqvppypqpqrpqnGWNDPPALNRVpkKKKTPENFMPPVPitSPIM 941
Cdd:pfam09606 343 LGGLNHLETWN--PGNFGGlGANPMQRGQPGMM---------------SSPSPVPGQQV--RQVTPNQFMRQSP--QPSV 401
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 942 NPLGDPQSQMLQQQPSAPIPLSSQSSFPQPHLSGGQPfhgiqqplgQTGMPPSFSKPNIEGAPG-----APIGNTFQHVQ 1016
Cdd:pfam09606 402 PSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPA---------QQRTIGQDSPGGSLNTPGqsavnSPLNPQEEQLY 472
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1622939783 1017 SLPTKKITKKPIPDEHLILKttfedlIQRclssatDPQTKRKLDDASKRLEFLYD 1071
Cdd:pfam09606 473 REKYRQLTKYIEPLKRMIAK------MEN------DPGDIDKMNKMKRLLEILSN 515
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
788-895 |
2.82e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 41.95 E-value: 2.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 788 RAQGEPVAGHESP-KIPYEKQQLPKGRPGPVGHHQMPRVQTQQYYPHVRIAPTVTTWSNKTPTALPSHPPAASPS--DTQ 864
Cdd:pfam09770 210 PAQQPAPAPAQPPaAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQaqQFH 289
|
90 100 110
....*....|....*....|....*....|...
gi 1622939783 865 GENPPPPGFIMHGNVNPN--AAGQLPTSPGHMH 895
Cdd:pfam09770 290 QQPPPVPVQPTQILQNPNrlSAARVGYPQNPQP 322
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
751-1008 |
6.60e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 40.69 E-value: 6.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 751 SQYANLLAAQGSIAAALAFLPDNTNQPNIVQLRDRlCRAQGEPVAGHESPKIPyekqQLPKGRP--GPVGHHQMPRVQTQ 828
Cdd:PHA03247 2632 SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRP----RRRAARPtvGSLTSLADPPPPPP 2706
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 829 QYYPhvriAPTVTTWSNKTPTAlPSHPPAASPSDTQGENPPPP--GFIMHGNVNPNAAGQLPTSPGHmhtqvppypqpqr 906
Cdd:PHA03247 2707 TPEP----APHALVSATPLPPG-PAAARQASPALPAAPAPPAVpaGPATPGGPARPARPPTTAGPPA------------- 2768
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622939783 907 pqngwndpPALNRVPKKKKTPENFMPPVPITSPIMNPLGDPQS----QMLQQQPSAPIPLSSQSSFPQPHLSGGQPfhgi 982
Cdd:PHA03247 2769 --------PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDpadpPAAVLAPAAALPPAASPAGPLPPPTSAQP---- 2836
|
250 260
....*....|....*....|....*.
gi 1622939783 983 QQPLGQTGMPPSFSKPNIEGAPGAPI 1008
Cdd:PHA03247 2837 TAPPPPPGPPPPSLPLGGSVAPGGDV 2862
|
|
|