|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
108-355 |
1.52e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 116.67 E-value: 1.52e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 108 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 185
Cdd:cd00200 24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 186 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 265
Cdd:cd00200 95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 266 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 345
Cdd:cd00200 163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
|
250
....*....|
gi 1958646703 346 DGWISLYSVM 355
Cdd:cd00200 240 DGTIRVWDLR 249
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
31-354 |
7.01e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.62 E-value: 7.01e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 31 AVQAWSPAKQYPVYLATGTSAQQLDASFSTNATLEIFEVDFRDPSLDLKRKGILSVSSRFHklIWGSSSSGLLENTGVIA 110
Cdd:COG2319 17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH--TAAVLSVAFSPDGRLLA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 111 GGGDSGMLTLYNVTHilspgkEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNPPE 190
Cdd:COG2319 95 SASADGTVRLWDLAT------GLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTL----TGHSG 163
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 191 DIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLRfA 270
Cdd:COG2319 164 AVTSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKLLASGSADGT---VRLWDLA-T 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 271 SSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWIS 350
Cdd:COG2319 236 GKLLRTLTGHSGSVRSVAFS-PDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASG-SDDGTVR 313
|
....
gi 1958646703 351 LYSV 354
Cdd:COG2319 314 LWDL 317
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
819-1097 |
4.99e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.19 E-value: 4.99e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 819 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 898
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 899 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 977
Cdd:PHA03247 2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 978 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1057
Cdd:PHA03247 2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1958646703 1058 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1097
Cdd:PHA03247 2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
73-223 |
7.58e-06 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 49.95 E-value: 7.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 73 DPSLDLKrkGILSVSSRFHKLIWGSSSSGLLENTGVIAGGGDSGMLTLYNVTHilspgKEPLIAQKqKHTGAVRALDFNP 152
Cdd:PTZ00420 13 DPSNNLF--DDLRICSRVIDSCGIACSSGFVAVPWEVEGGGLIGAIRLENQMR-----KPPVIKLK-GHTSSILDLQFNP 84
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958646703 153 FQGNLLASGASDSEIFIWDLNH----LTVPMTPGSKSQNPPEDIKALSWNRQVQHILSSAHPSGKAVVWDLrKNE 223
Cdd:PTZ00420 85 CFSEILASGSEDLTIRVWEIPHndesVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDI-ENE 158
|
|
| ACE1-Sec16-like |
cd09233 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
581-673 |
1.16e-05 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.
Pssm-ID: 187750 [Multi-domain] Cd Length: 314 Bit Score: 48.79 E-value: 1.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 581 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 650
Cdd:cd09233 54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
|
90 100 110
....*....|....*....|....*....|..
gi 1958646703 651 KNWKDLVCACS---------LKNWREALALLL 673
Cdd:cd09233 133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
141-171 |
2.44e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 42.30 E-value: 2.44e-05
10 20 30
....*....|....*....|....*....|.
gi 1958646703 141 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 171
Cdd:smart00320 11 HTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
141-171 |
1.68e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.02 E-value: 1.68e-04
10 20 30
....*....|....*....|....*....|.
gi 1958646703 141 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 171
Cdd:pfam00400 10 HTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
810-1094 |
6.90e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 6.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 810 VLGRQAPAFPFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPR-AAAQPSVMPFLPSHPIPSVGSwTQSSSDYRVPK 888
Cdd:pfam03154 166 ILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSpATSQPPNQTQSTAAPHTLIQQ-TPTLHPQRLPS 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 889 PQatlpvhfvPGVRPAfSQPQPFGGQSVQAINPVGFCGTWPlPGPTPVMAPPDVMQ-PGSTHLpetprlLPLPPVGPPGP 967
Cdd:pfam03154 245 PH--------PPLQPM-TQPPPPSQVSPQPLPQPSLHGQMP-PMPHSLQTGPSHMQhPVPPQP------FPLTPQSSQSQ 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 968 TPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQ----ENLQRKKLPETFMPPAPIitap 1043
Cdd:pfam03154 309 VPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPipqlPNPQSHKHPPHLSGPSPF---- 384
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 1958646703 1044 LMSLGPEPQQALLPQSLVSGASLPPPGAPRECSLQQLQPLPPEKTQkelPP 1094
Cdd:pfam03154 385 QMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQ---PP 432
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
108-355 |
1.52e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 116.67 E-value: 1.52e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 108 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 185
Cdd:cd00200 24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 186 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 265
Cdd:cd00200 95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 266 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 345
Cdd:cd00200 163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
|
250
....*....|
gi 1958646703 346 DGWISLYSVM 355
Cdd:cd00200 240 DGTIRVWDLR 249
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
34-353 |
7.16e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 114.74 E-value: 7.16e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 34 AWSPAKQYpvyLATGtsaqqldasfSTNATLEIFEVDFRDPSLDLKrkgilsvsSRFHKLIWGSSSSglleNTGVIAGGG 113
Cdd:cd00200 16 AFSPDGKL---LATG----------SGDGTIKVWDLETGELLRTLK--------GHTGPVRDVAASA----DGTYLASGS 70
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 114 DSGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSQnppeDIK 193
Cdd:cd00200 71 SDKTIRLWDLE-----TGECVRTLTG-HTSYVSSVAFSP-DGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD----WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 194 ALSWNrQVQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDrlpVIQLWDLRfASSP 273
Cdd:cd00200 140 SVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKLWDLS-TGKC 211
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 274 LKVLESHSRGILSVSWSQaDAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWISLYS 353
Cdd:cd00200 212 LGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG-SADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
31-354 |
7.01e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.62 E-value: 7.01e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 31 AVQAWSPAKQYPVYLATGTSAQQLDASFSTNATLEIFEVDFRDPSLDLKRKGILSVSSRFHklIWGSSSSGLLENTGVIA 110
Cdd:COG2319 17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH--TAAVLSVAFSPDGRLLA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 111 GGGDSGMLTLYNVTHilspgkEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNPPE 190
Cdd:COG2319 95 SASADGTVRLWDLAT------GLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTL----TGHSG 163
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 191 DIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLRfA 270
Cdd:COG2319 164 AVTSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKLLASGSADGT---VRLWDLA-T 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 271 SSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWIS 350
Cdd:COG2319 236 GKLLRTLTGHSGSVRSVAFS-PDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASG-SDDGTVR 313
|
....
gi 1958646703 351 LYSV 354
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
109-354 |
3.23e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 112.70 E-value: 3.23e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 109 IAGGGDSGMLTLYNVTHilspGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSQNp 188
Cdd:COG2319 177 LASGSDDGTVRLWDLAT----GK--LLRTLTGHTGAVRSVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 189 pedIKALSWNRQVQHILSsAHPSGKAVVWDLRKNEPIIKVSDHSSRMNcsGLAWNPDiATQLVLCSEDDRlpvIQLWDLR 268
Cdd:COG2319 249 ---VRSVAFSPDGRLLAS-GSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 269 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGW 348
Cdd:COG2319 319 -TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASG-SADGT 395
|
....*.
gi 1958646703 349 ISLYSV 354
Cdd:COG2319 396 VRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
109-354 |
7.33e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.46 E-value: 7.33e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 109 IAGGGDSGMLTLYNvthiLSPGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNP 188
Cdd:COG2319 135 LASGSADGTVRLWD----LATGK--LLRTLTGHSGAVTSVAFSP-DGKLLASGSDDGTVRLWDLATGKLLRTL----TGH 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 189 PEDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLR 268
Cdd:COG2319 204 TGAVRSVAFSPD-GKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRS--VAFSPD-GRLLASGSADGT---VRLWDLA 276
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 269 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGW 348
Cdd:COG2319 277 -TGELLRTLTGHSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASG-SDDGT 353
|
....*.
gi 1958646703 349 ISLYSV 354
Cdd:COG2319 354 VRLWDL 359
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
109-313 |
1.71e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 92.28 E-value: 1.71e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 109 IAGGGDSGMLTLYNVThilspgKEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSksqnP 188
Cdd:COG2319 219 LASGSADGTVRLWDLA------TGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG----H 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 189 PEDIKALSWNRQVQHILSSAHpSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLR 268
Cdd:COG2319 288 SGGVNSVAFSPDGKLLASGSD-DGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKTLASGSDDGT---VRLWDLA 360
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 1958646703 269 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSS 313
Cdd:COG2319 361 -TGELLRTLTGHTGAVTSVAFS-PDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
231-354 |
2.88e-10 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 62.74 E-value: 2.88e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 231 HSSRMNCsgLAWNPD---IATqlvlCSEDDRlpvIQLWDLRFaSSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIF 307
Cdd:cd00200 8 HTGGVTC--VAFSPDgklLAT----GSGDGT---IKVWDLET-GELLRTLKGHTGPVRDVAAS-ADGTYLASGSSDKTIR 76
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1958646703 308 CWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSFDGWISLYSV 354
Cdd:cd00200 77 LWDLETGECVRTLTGHTSYVSSVAFSP-DGRILSSSSRDKTIKVWDV 122
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
819-1097 |
4.99e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.19 E-value: 4.99e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 819 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 898
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 899 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 977
Cdd:PHA03247 2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 978 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1057
Cdd:PHA03247 2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1958646703 1058 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1097
Cdd:PHA03247 2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
812-1097 |
2.44e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.33 E-value: 2.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 812 GRQAPAFPFPRVAVGAAlhPKETSSHRMGFQPPR--QVPAPSVRPRAAAQPsvmpflpshpiPSVGSWTQSSSDYRVPKP 889
Cdd:PHA03247 2641 HPPPTVPPPERPRDDPA--PGRVSRPRRARRLGRaaQASSPPQRPRRRAAR-----------PTVGSLTSLADPPPPPPT 2707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 890 QATLPVHFVPGVrpafsqPQPFGGQSVQAINPVGFCGTWPLP---------GPTPVMAPPDVMQPgsthlpetprllplp 960
Cdd:PHA03247 2708 PEPAPHALVSAT------PLPPGPAAARQASPALPAAPAPPAvpagpatpgGPARPARPPTTAGP--------------- 2766
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 961 pvgppgptpLSSQPAASPVTfsvAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPP--AP 1038
Cdd:PHA03247 2767 ---------PAPAPPAAPAA---GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPptSA 2834
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958646703 1039 IITAPLMSLGPEPQQALLPQSLVSGASL---PPPGA----------PRECSLQQLQPLPPEKTQKELPPEHQ 1097
Cdd:PHA03247 2835 QPTAPPPPPGPPPPSLPLGGSVAPGGDVrrrPPSRSpaakpaaparPPVRRLARPAVSRSTESFALPPDQPE 2906
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
758-1093 |
1.26e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 1.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 758 PGPATTHRFTQYASLLAAQGSLA-IAMSVLPSDCTQPAVLQLKDRLFHAQGSTVLGRQAPAFPF----PRVAVGAALHPK 832
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTVGSLTsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAapapPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 833 ETSSHR--MGFQPPRQVPA---PSVRPRAAAQPSVMPFLPSHP-IPSvgSWTQSSSDYRVPKPQATLPvhfvPGVRPAFS 906
Cdd:PHA03247 2754 PARPARppTTAGPPAPAPPaapAAGPPRRLTRPAVASLSESREsLPS--PWDPADPPAAVLAPAAALP----PAASPAGP 2827
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 907 QPQPFGGQSVQAINPVGfcgtwPLPGPTPV---MAP-PDVMQPGSTHLPETPRLLPLPPVGPPGptplsSQPAASPVTFS 982
Cdd:PHA03247 2828 LPPPTSAQPTAPPPPPG-----PPPPSLPLggsVAPgGDVRRRPPSRSPAAKPAAPARPPVRRL-----ARPAVSRSTES 2897
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 983 VAHPPGGPGAPRSSALPssgilaTRPGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAPLMSLGPEPQQ---ALLPQS 1059
Cdd:PHA03247 2898 FALPPDQPERPPQPQAP------PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlgALVPGR 2971
|
330 340 350
....*....|....*....|....*....|....*
gi 1958646703 1060 L-VSGASLPPPGAPRECSlqqlQPLPPEKTQKELP 1093
Cdd:PHA03247 2972 VaVPRFRVPQPAPSREAP----ASSTPPLTGHSLS 3002
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
273-356 |
3.25e-06 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 50.41 E-value: 3.25e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 273 PLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWISLY 352
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFS-PDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASG-SSDKTIRLW 78
|
....
gi 1958646703 353 SVMG 356
Cdd:cd00200 79 DLET 82
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
811-1097 |
3.36e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 3.36e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 811 LGRQAPAFPFPRvavgAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPsVMPFLPSHPIP---SVGSWTQSSSDY-RV 886
Cdd:PHA03247 2791 LSESRESLPSPW----DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP-TAPPPPPGPPPpslPLGGSVAPGGDVrRR 2865
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 887 PKPQATLPVHFVPGVRPAFSQPQPFGGQSVQ--AINPVGfcgtwPLPGPTPVMAPPDVMQPgsthlpeTPRLLPLPPVGP 964
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTEsfALPPDQ-----PERPPQPQAPPPPQPQP-------QPPPPPQPQPPP 2933
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 965 PGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATR---PGPQDTwKVAPASQENLQRKKlpetfmpPAPIIT 1041
Cdd:PHA03247 2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRfrvPQPAPS-REAPASSTPPLTGH-------SLSRVS 3005
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958646703 1042 APLMSLG----PEPQQALLPQSLV-------SGASLPPPGAPRECSLQQLQPLPPEKTqkeLPPEHQ 1097
Cdd:PHA03247 3006 SWASSLAlheeTDPPPVSLKQTLWppddtedSDADSLFDSDSERSDLEALDPLPPEPH---DPFAHE 3069
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
73-223 |
7.58e-06 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 49.95 E-value: 7.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 73 DPSLDLKrkGILSVSSRFHKLIWGSSSSGLLENTGVIAGGGDSGMLTLYNVTHilspgKEPLIAQKqKHTGAVRALDFNP 152
Cdd:PTZ00420 13 DPSNNLF--DDLRICSRVIDSCGIACSSGFVAVPWEVEGGGLIGAIRLENQMR-----KPPVIKLK-GHTSSILDLQFNP 84
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958646703 153 FQGNLLASGASDSEIFIWDLNH----LTVPMTPGSKSQNPPEDIKALSWNRQVQHILSSAHPSGKAVVWDLrKNE 223
Cdd:PTZ00420 85 CFSEILASGSEDLTIRVWEIPHndesVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDI-ENE 158
|
|
| PTZ00421 |
PTZ00421 |
coronin; Provisional |
135-333 |
7.64e-06 |
|
coronin; Provisional
Pssm-ID: 173611 [Multi-domain] Cd Length: 493 Bit Score: 49.89 E-value: 7.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 135 IAQKQKHTGAVRALDFNPFQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSqnppEDIKALSWNRQvQHILSSAHPSGKA 214
Cdd:PTZ00421 118 IVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHS----DQITSLEWNLD-GSLLCTTSKDKKL 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 215 VVWDLRKNEPIIKVSDHSSRMNCSGLaWNPDIATQLVLCSEDDRLPVIQLWDLRFASSPLKVLESHSRGILSVSWSQADA 294
Cdd:PTZ00421 193 NIIDPRDGTIVSSVEAHASAKSQRCL-WAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDT 271
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1958646703 295 ELL-LSSAKDNQI-----------FCWNLSSSEVVYKLPTQSSWCFDVQWC 333
Cdd:PTZ00421 272 NLLyIGSKGEGNIrcfelmnerltFCSSYSSVEPHKGLCMMPKWSLDTRKC 322
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
108-220 |
1.07e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 49.14 E-value: 1.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 108 VIAGGGDSGMLTLYNVthilSPGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQN 187
Cdd:COG2319 302 LLASGSDDGTVRLWDL----ATGK--LLRTLTGHTGAVRSVAFSP-DGKTLASGSDDGTVRLWDLATGELLRTL----TG 370
|
90 100 110
....*....|....*....|....*....|...
gi 1958646703 188 PPEDIKALSWNRQVQHILSSAHpSGKAVVWDLR 220
Cdd:COG2319 371 HTGAVTSVAFSPDGRTLASGSA-DGTVRLWDLA 402
|
|
| ACE1-Sec16-like |
cd09233 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
581-673 |
1.16e-05 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.
Pssm-ID: 187750 [Multi-domain] Cd Length: 314 Bit Score: 48.79 E-value: 1.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 581 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 650
Cdd:cd09233 54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
|
90 100 110
....*....|....*....|....*....|..
gi 1958646703 651 KNWKDLVCACS---------LKNWREALALLL 673
Cdd:cd09233 133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
141-171 |
2.44e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 42.30 E-value: 2.44e-05
10 20 30
....*....|....*....|....*....|.
gi 1958646703 141 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 171
Cdd:smart00320 11 HTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
224-327 |
2.47e-05 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 48.54 E-value: 2.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 224 PIIKVSdhsSRMNCSGLAWNPDIATQLVLCSEDDrlpVIQLWDLrfASSPLKV-LESHSRGILSVSWSQADAELLLSSAK 302
Cdd:PLN00181 525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
|
90 100
....*....|....*....|....*
gi 1958646703 303 DNQIFCWNLSSSEVVYKLPTQSSWC 327
Cdd:PLN00181 597 DGSVKLWSINQGVSIGTIKTKANIC 621
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
812-1102 |
5.68e-05 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 47.35 E-value: 5.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 812 GRQAPAFPFPRVAVGAALHPKETSSHR---MGFQPPRQVPAPSV---------RPRAAAQPSVMPFLPSHPIpsvgswtQ 879
Cdd:PHA03377 644 GPKPKSFWEMRAGRDGSGIQQEPSSRRqpaTQSTPPRPSWLPSVfvlpsvdagRAQPSEESHLSSMSPTQPI-------S 716
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 880 SSSDYRVPKPQATLPVHFVPGVRPAFSQPQPFGG----QSVQAINPvgfcGTW-PLPGPTPVMAppdVMQPGSTHLPETP 954
Cdd:PHA03377 717 HEEQPRYEDPDDPLDLSLHPDQAPPPSHQAPYSGheepQAQQAPYP----GYWePRPPQAPYLG---YQEPQAQGVQVSS 789
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 955 RLLPLPPVGPPGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGpQDTWKVAPASQenlqrkklPETfM 1034
Cdd:PHA03377 790 YPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAGHG-QDQVSQFPHLQ--------SET-G 859
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958646703 1035 PPAPIITaplmslgpEPQQALLPQSLVSGASL----PPPGAPrecslqqLQPLPpektqKELPPEHQCLKDS 1102
Cdd:PHA03377 860 PPRLQLS--------QVPQLPYSQTLVSSSAPswssPQPRAP-------IRPIP-----TRFPPPPMPLQDS 911
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
842-1043 |
9.31e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 46.77 E-value: 9.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 842 QPPRQVPAPsvRPRAAAQPSVMPFLPSHPIPSVGSwtqsssdyRVPKPQATLPVHFVPGVRPAFSQPQPFGGQSVQAINP 921
Cdd:PRK07003 375 RVAGAVPAP--GARAAAAVGASAVPAVTAVTGAAG--------AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAAD 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 922 vgfcGTWPLPGPTPVMAPPDvmqpgsthlpetprllplppvgpPGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSS 1001
Cdd:PRK07003 445 ----GDAPVPAKANARASAD-----------------------SRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAA 497
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1958646703 1002 GILATRPGPQDTWKVAPASQENlqrkKLPETFMPPAPIITAP 1043
Cdd:PRK07003 498 APSAATPAAVPDARAPAAASRE----DAPAAAAPPAPEARPP 535
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
141-171 |
1.68e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.02 E-value: 1.68e-04
10 20 30
....*....|....*....|....*....|.
gi 1958646703 141 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 171
Cdd:pfam00400 10 HTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
271-310 |
6.21e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.48 E-value: 6.21e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1958646703 271 SSPLKVLESHSRGILSVSWSQaDAELLLSSAKDNQIFCWN 310
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
810-1094 |
6.90e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 6.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 810 VLGRQAPAFPFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPR-AAAQPSVMPFLPSHPIPSVGSwTQSSSDYRVPK 888
Cdd:pfam03154 166 ILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSpATSQPPNQTQSTAAPHTLIQQ-TPTLHPQRLPS 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 889 PQatlpvhfvPGVRPAfSQPQPFGGQSVQAINPVGFCGTWPlPGPTPVMAPPDVMQ-PGSTHLpetprlLPLPPVGPPGP 967
Cdd:pfam03154 245 PH--------PPLQPM-TQPPPPSQVSPQPLPQPSLHGQMP-PMPHSLQTGPSHMQhPVPPQP------FPLTPQSSQSQ 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 968 TPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQ----ENLQRKKLPETFMPPAPIitap 1043
Cdd:pfam03154 309 VPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPipqlPNPQSHKHPPHLSGPSPF---- 384
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 1958646703 1044 LMSLGPEPQQALLPQSLVSGASLPPPGAPRECSLQQLQPLPPEKTQkelPP 1094
Cdd:pfam03154 385 QMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQ---PP 432
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
812-1085 |
9.74e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 9.74e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 812 GRQAPAFpFPRVAVGAALHPK---------ETSSHRMGFQPPrqvPAPSVRPRAAAQPSVMPflpSHPIPsvgswtqsss 882
Cdd:PHA03247 2513 SRLAPAI-LPDEPVGEPVHPRmltwirgleELASDDAGDPPP---PLPPAAPPAAPDRSVPP---PRPAP---------- 2575
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 883 dyRVPKPQATLPVHfVPGVRPAFSQPQpfggqsvqainpvgfcgtwplpgpTPVmAPPDvmqpgsthlpetprllplppv 962
Cdd:PHA03247 2576 --RPSEPAVTSRAR-RPDAPPQSARPR------------------------APV-DDRG--------------------- 2606
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 963 gppgPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKL----------PET 1032
Cdd:PHA03247 2607 ----DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRlgraaqasspPQR 2682
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 1033 FMPPA-PIITAPLMSLG----PEPQQALLPQSLVSGASLPP-PGAPRECS-LQQLQPLPP 1085
Cdd:PHA03247 2683 PRRRAaRPTVGSLTSLAdpppPPPTPEPAPHALVSATPLPPgPAAARQASpALPAAPAPP 2742
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
840-1068 |
1.19e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.94 E-value: 1.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 840 GFQPPRQVPAPSVRPRAAAQ-PSVMPFLPSHPIPSVGSWTQSSSdyrVPKPQATLPVHFVPGVRPAFSQPQPFGGQSVQA 918
Cdd:PRK12323 371 GAGPATAAAAPVAQPAPAAAaPAAAAPAPAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAARQASARGPGGA 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 919 INPVgfcgtwPLPGPTPVMAPPDVMQPgsthlpetprllPLPPVGPPGPTPLSSQPAASPVTFSVAHPPGGPGAPrssAL 998
Cdd:PRK12323 448 PAPA------PAPAAAPAAAARPAAAG------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPP---EF 506
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 999 PSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAPLMSLGPEPQQALLPQSLvSGASLPP 1068
Cdd:PRK12323 507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRA-SASGLPD 575
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
221-332 |
1.34e-03 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 42.63 E-value: 1.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 221 KNEPIIKVSDHSSRMncSGLAWNPDIATQLVLCSEDdrlPVIQLWDLRF-------ASSPLKVLESHSRGILSVSWSQAD 293
Cdd:PTZ00420 63 RKPPVIKLKGHTSSI--LDLQFNPCFSEILASGSED---LTIRVWEIPHndesvkeIKDPQCILKGHKKKISIIDWNPMN 137
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1958646703 294 AELLLSSAKDNQIFCWNLSSSEVVYK--LPTQSSwcfDVQW 332
Cdd:PTZ00420 138 YYIMCSSGFDSFVNIWDIENEKRAFQinMPKKLS---SLKW 175
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
830-1072 |
2.54e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 41.98 E-value: 2.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 830 HPKETSSHRMGFQPPRQVPAPSVRPRAAA-QPSVMPfLPSHP-----IPSVGSWTQSSSDYRVPKP---QATLPVHFVPG 900
Cdd:PHA03378 614 HIPETSAPRQWPMPLRPIPMRPLRMQPITfNVLVFP-TPHQPpqveiTPYKPTWTQIGHIPYQPSPtgaNTMLPIQWAPG 692
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 901 -----------VRPAFSQP---QPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSTHLPETPRLLPLPPVGPPG 966
Cdd:PHA03378 693 tmqpppraptpMRPPAAPPgraQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG 772
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 967 PTPLSSQPAASPVtfSVAHPPGGPG-APRSSALPSSGILATR--PGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAP 1043
Cdd:PHA03378 773 APTPQPPPQAPPA--PQQRPRGAPTpQPPPQAGPTSMQLMPRaaPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQA 850
|
250 260
....*....|....*....|....*....
gi 1958646703 1044 LMSLGPEPQQALLPQSLVSGASLPPPGAP 1072
Cdd:PHA03378 851 AAGPTPSPGSGTSDKIVQAPVFYPPVLQP 879
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
814-1051 |
5.64e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.91 E-value: 5.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 814 QAPAFPFPRVAVGAA--LHPKETSSHRMGFQPPRQVPAPsvrPRAAAQPSVMPfLPSHPIPSVG--------SWTQSSSD 883
Cdd:pfam03154 308 QVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLP---PAPLSMPHIKP-PPTTPIPQLPnpqshkhpPHLSGPSP 383
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 884 YRVP---------KPQATLPVHFVPGVRPAFSQPQPfGGQSVQA--INPVGFCGTWPLPGPTPVMAPPDVMQPGSTHLPE 952
Cdd:pfam03154 384 FQMNsnlppppalKPLSSLSTHHPPSAHPPPLQLMP-QSQQLPPppAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPF 462
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 953 TPRLLPLPPVGPpgPTPLSSQPAASPVTFSVAHPPggpgaprSSALPSSGIlatrPGPQDTWKVAPASQenLQRKKLPET 1032
Cdd:pfam03154 463 PQHPFVPGGPPP--ITPPSGPPTSTSSAMPGIQPP-------SSASVSSSG----PVPAAVSCPLPPVQ--IKEEALDEA 527
|
250
....*....|....*....
gi 1958646703 1033 FMPPAPiiTAPLMSLGPEP 1051
Cdd:pfam03154 528 EEPESP--PPPPRSPSPEP 544
|
|
|