NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958646703|ref|XP_038936266|]
View 

protein transport protein Sec31B isoform X7 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
108-355 1.52e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 116.67  E-value: 1.52e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  108 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 185
Cdd:cd00200     24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  186 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 265
Cdd:cd00200     95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  266 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 345
Cdd:cd00200    163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
                          250
                   ....*....|
gi 1958646703  346 DGWISLYSVM 355
Cdd:cd00200    240 DGTIRVWDLR 249
PHA03247 super family cl33720
large tegument protein UL36; Provisional
819-1097 4.99e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 4.99e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  819 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 898
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  899 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 977
Cdd:PHA03247  2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  978 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1057
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958646703 1058 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1097
Cdd:PHA03247  2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
ACE1-Sec16-like super family cl14807
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
581-673 1.16e-05

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


The actual alignment was detected with superfamily member cd09233:

Pssm-ID: 449359 [Multi-domain]  Cd Length: 314  Bit Score: 48.79  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  581 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 650
Cdd:cd09233     54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1958646703  651 KNWKDLVCACS---------LKNWREALALLL 673
Cdd:cd09233    133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
108-355 1.52e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 116.67  E-value: 1.52e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  108 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 185
Cdd:cd00200     24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  186 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 265
Cdd:cd00200     95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  266 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 345
Cdd:cd00200    163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
                          250
                   ....*....|
gi 1958646703  346 DGWISLYSVM 355
Cdd:cd00200    240 DGTIRVWDLR 249
WD40 COG2319
WD40 repeat [General function prediction only];
31-354 7.01e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.62  E-value: 7.01e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703   31 AVQAWSPAKQYPVYLATGTSAQQLDASFSTNATLEIFEVDFRDPSLDLKRKGILSVSSRFHklIWGSSSSGLLENTGVIA 110
Cdd:COG2319     17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH--TAAVLSVAFSPDGRLLA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  111 GGGDSGMLTLYNVTHilspgkEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNPPE 190
Cdd:COG2319     95 SASADGTVRLWDLAT------GLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTL----TGHSG 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  191 DIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLRfA 270
Cdd:COG2319    164 AVTSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKLLASGSADGT---VRLWDLA-T 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  271 SSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWIS 350
Cdd:COG2319    236 GKLLRTLTGHSGSVRSVAFS-PDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASG-SDDGTVR 313

                   ....
gi 1958646703  351 LYSV 354
Cdd:COG2319    314 LWDL 317
PHA03247 PHA03247
large tegument protein UL36; Provisional
819-1097 4.99e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 4.99e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  819 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 898
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  899 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 977
Cdd:PHA03247  2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  978 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1057
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958646703 1058 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1097
Cdd:PHA03247  2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
PTZ00420 PTZ00420
coronin; Provisional
73-223 7.58e-06

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 49.95  E-value: 7.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703   73 DPSLDLKrkGILSVSSRFHKLIWGSSSSGLLENTGVIAGGGDSGMLTLYNVTHilspgKEPLIAQKqKHTGAVRALDFNP 152
Cdd:PTZ00420    13 DPSNNLF--DDLRICSRVIDSCGIACSSGFVAVPWEVEGGGLIGAIRLENQMR-----KPPVIKLK-GHTSSILDLQFNP 84
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958646703  153 FQGNLLASGASDSEIFIWDLNH----LTVPMTPGSKSQNPPEDIKALSWNRQVQHILSSAHPSGKAVVWDLrKNE 223
Cdd:PTZ00420    85 CFSEILASGSEDLTIRVWEIPHndesVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDI-ENE 158
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
581-673 1.16e-05

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 48.79  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  581 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 650
Cdd:cd09233     54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1958646703  651 KNWKDLVCACS---------LKNWREALALLL 673
Cdd:cd09233    133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
141-171 2.44e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.30  E-value: 2.44e-05
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1958646703   141 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 171
Cdd:smart00320   11 HTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
141-171 1.68e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.02  E-value: 1.68e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1958646703  141 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 171
Cdd:pfam00400   10 HTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
810-1094 6.90e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.99  E-value: 6.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  810 VLGRQAPAFPFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPR-AAAQPSVMPFLPSHPIPSVGSwTQSSSDYRVPK 888
Cdd:pfam03154  166 ILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSpATSQPPNQTQSTAAPHTLIQQ-TPTLHPQRLPS 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  889 PQatlpvhfvPGVRPAfSQPQPFGGQSVQAINPVGFCGTWPlPGPTPVMAPPDVMQ-PGSTHLpetprlLPLPPVGPPGP 967
Cdd:pfam03154  245 PH--------PPLQPM-TQPPPPSQVSPQPLPQPSLHGQMP-PMPHSLQTGPSHMQhPVPPQP------FPLTPQSSQSQ 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  968 TPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQ----ENLQRKKLPETFMPPAPIitap 1043
Cdd:pfam03154  309 VPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPipqlPNPQSHKHPPHLSGPSPF---- 384
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958646703 1044 LMSLGPEPQQALLPQSLVSGASLPPPGAPRECSLQQLQPLPPEKTQkelPP 1094
Cdd:pfam03154  385 QMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQ---PP 432
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
108-355 1.52e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 116.67  E-value: 1.52e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  108 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 185
Cdd:cd00200     24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  186 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 265
Cdd:cd00200     95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  266 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 345
Cdd:cd00200    163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
                          250
                   ....*....|
gi 1958646703  346 DGWISLYSVM 355
Cdd:cd00200    240 DGTIRVWDLR 249
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
34-353 7.16e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 114.74  E-value: 7.16e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703   34 AWSPAKQYpvyLATGtsaqqldasfSTNATLEIFEVDFRDPSLDLKrkgilsvsSRFHKLIWGSSSSglleNTGVIAGGG 113
Cdd:cd00200     16 AFSPDGKL---LATG----------SGDGTIKVWDLETGELLRTLK--------GHTGPVRDVAASA----DGTYLASGS 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  114 DSGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSQnppeDIK 193
Cdd:cd00200     71 SDKTIRLWDLE-----TGECVRTLTG-HTSYVSSVAFSP-DGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD----WVN 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  194 ALSWNrQVQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDrlpVIQLWDLRfASSP 273
Cdd:cd00200    140 SVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKLWDLS-TGKC 211
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  274 LKVLESHSRGILSVSWSQaDAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWISLYS 353
Cdd:cd00200    212 LGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG-SADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
31-354 7.01e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.62  E-value: 7.01e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703   31 AVQAWSPAKQYPVYLATGTSAQQLDASFSTNATLEIFEVDFRDPSLDLKRKGILSVSSRFHklIWGSSSSGLLENTGVIA 110
Cdd:COG2319     17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH--TAAVLSVAFSPDGRLLA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  111 GGGDSGMLTLYNVTHilspgkEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNPPE 190
Cdd:COG2319     95 SASADGTVRLWDLAT------GLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTL----TGHSG 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  191 DIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLRfA 270
Cdd:COG2319    164 AVTSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKLLASGSADGT---VRLWDLA-T 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  271 SSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWIS 350
Cdd:COG2319    236 GKLLRTLTGHSGSVRSVAFS-PDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASG-SDDGTVR 313

                   ....
gi 1958646703  351 LYSV 354
Cdd:COG2319    314 LWDL 317
WD40 COG2319
WD40 repeat [General function prediction only];
109-354 3.23e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 112.70  E-value: 3.23e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  109 IAGGGDSGMLTLYNVTHilspGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSQNp 188
Cdd:COG2319    177 LASGSDDGTVRLWDLAT----GK--LLRTLTGHTGAVRSVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  189 pedIKALSWNRQVQHILSsAHPSGKAVVWDLRKNEPIIKVSDHSSRMNcsGLAWNPDiATQLVLCSEDDRlpvIQLWDLR 268
Cdd:COG2319    249 ---VRSVAFSPDGRLLAS-GSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  269 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGW 348
Cdd:COG2319    319 -TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASG-SADGT 395

                   ....*.
gi 1958646703  349 ISLYSV 354
Cdd:COG2319    396 VRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
109-354 7.33e-25

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 108.46  E-value: 7.33e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  109 IAGGGDSGMLTLYNvthiLSPGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNP 188
Cdd:COG2319    135 LASGSADGTVRLWD----LATGK--LLRTLTGHSGAVTSVAFSP-DGKLLASGSDDGTVRLWDLATGKLLRTL----TGH 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  189 PEDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLR 268
Cdd:COG2319    204 TGAVRSVAFSPD-GKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRS--VAFSPD-GRLLASGSADGT---VRLWDLA 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  269 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGW 348
Cdd:COG2319    277 -TGELLRTLTGHSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASG-SDDGT 353

                   ....*.
gi 1958646703  349 ISLYSV 354
Cdd:COG2319    354 VRLWDL 359
WD40 COG2319
WD40 repeat [General function prediction only];
109-313 1.71e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 92.28  E-value: 1.71e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  109 IAGGGDSGMLTLYNVThilspgKEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSksqnP 188
Cdd:COG2319    219 LASGSADGTVRLWDLA------TGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG----H 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  189 PEDIKALSWNRQVQHILSSAHpSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLR 268
Cdd:COG2319    288 SGGVNSVAFSPDGKLLASGSD-DGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKTLASGSDDGT---VRLWDLA 360
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1958646703  269 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSS 313
Cdd:COG2319    361 -TGELLRTLTGHTGAVTSVAFS-PDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
231-354 2.88e-10

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 62.74  E-value: 2.88e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  231 HSSRMNCsgLAWNPD---IATqlvlCSEDDRlpvIQLWDLRFaSSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIF 307
Cdd:cd00200      8 HTGGVTC--VAFSPDgklLAT----GSGDGT---IKVWDLET-GELLRTLKGHTGPVRDVAAS-ADGTYLASGSSDKTIR 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1958646703  308 CWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSFDGWISLYSV 354
Cdd:cd00200     77 LWDLETGECVRTLTGHTSYVSSVAFSP-DGRILSSSSRDKTIKVWDV 122
PHA03247 PHA03247
large tegument protein UL36; Provisional
819-1097 4.99e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.19  E-value: 4.99e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  819 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 898
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  899 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 977
Cdd:PHA03247  2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  978 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1057
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958646703 1058 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1097
Cdd:PHA03247  2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
PHA03247 PHA03247
large tegument protein UL36; Provisional
812-1097 2.44e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 2.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  812 GRQAPAFPFPRVAVGAAlhPKETSSHRMGFQPPR--QVPAPSVRPRAAAQPsvmpflpshpiPSVGSWTQSSSDYRVPKP 889
Cdd:PHA03247  2641 HPPPTVPPPERPRDDPA--PGRVSRPRRARRLGRaaQASSPPQRPRRRAAR-----------PTVGSLTSLADPPPPPPT 2707
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  890 QATLPVHFVPGVrpafsqPQPFGGQSVQAINPVGFCGTWPLP---------GPTPVMAPPDVMQPgsthlpetprllplp 960
Cdd:PHA03247  2708 PEPAPHALVSAT------PLPPGPAAARQASPALPAAPAPPAvpagpatpgGPARPARPPTTAGP--------------- 2766
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  961 pvgppgptpLSSQPAASPVTfsvAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPP--AP 1038
Cdd:PHA03247  2767 ---------PAPAPPAAPAA---GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPptSA 2834
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958646703 1039 IITAPLMSLGPEPQQALLPQSLVSGASL---PPPGA----------PRECSLQQLQPLPPEKTQKELPPEHQ 1097
Cdd:PHA03247  2835 QPTAPPPPPGPPPPSLPLGGSVAPGGDVrrrPPSRSpaakpaaparPPVRRLARPAVSRSTESFALPPDQPE 2906
PHA03247 PHA03247
large tegument protein UL36; Provisional
758-1093 1.26e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 1.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  758 PGPATTHRFTQYASLLAAQGSLA-IAMSVLPSDCTQPAVLQLKDRLFHAQGSTVLGRQAPAFPF----PRVAVGAALHPK 832
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTVGSLTsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAapapPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  833 ETSSHR--MGFQPPRQVPA---PSVRPRAAAQPSVMPFLPSHP-IPSvgSWTQSSSDYRVPKPQATLPvhfvPGVRPAFS 906
Cdd:PHA03247  2754 PARPARppTTAGPPAPAPPaapAAGPPRRLTRPAVASLSESREsLPS--PWDPADPPAAVLAPAAALP----PAASPAGP 2827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  907 QPQPFGGQSVQAINPVGfcgtwPLPGPTPV---MAP-PDVMQPGSTHLPETPRLLPLPPVGPPGptplsSQPAASPVTFS 982
Cdd:PHA03247  2828 LPPPTSAQPTAPPPPPG-----PPPPSLPLggsVAPgGDVRRRPPSRSPAAKPAAPARPPVRRL-----ARPAVSRSTES 2897
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  983 VAHPPGGPGAPRSSALPssgilaTRPGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAPLMSLGPEPQQ---ALLPQS 1059
Cdd:PHA03247  2898 FALPPDQPERPPQPQAP------PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlgALVPGR 2971
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1958646703 1060 L-VSGASLPPPGAPRECSlqqlQPLPPEKTQKELP 1093
Cdd:PHA03247  2972 VaVPRFRVPQPAPSREAP----ASSTPPLTGHSLS 3002
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
273-356 3.25e-06

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 50.41  E-value: 3.25e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  273 PLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWISLY 352
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFS-PDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASG-SSDKTIRLW 78

                   ....
gi 1958646703  353 SVMG 356
Cdd:cd00200     79 DLET 82
PHA03247 PHA03247
large tegument protein UL36; Provisional
811-1097 3.36e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 3.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  811 LGRQAPAFPFPRvavgAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPsVMPFLPSHPIP---SVGSWTQSSSDY-RV 886
Cdd:PHA03247  2791 LSESRESLPSPW----DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP-TAPPPPPGPPPpslPLGGSVAPGGDVrRR 2865
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  887 PKPQATLPVHFVPGVRPAFSQPQPFGGQSVQ--AINPVGfcgtwPLPGPTPVMAPPDVMQPgsthlpeTPRLLPLPPVGP 964
Cdd:PHA03247  2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTEsfALPPDQ-----PERPPQPQAPPPPQPQP-------QPPPPPQPQPPP 2933
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  965 PGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATR---PGPQDTwKVAPASQENLQRKKlpetfmpPAPIIT 1041
Cdd:PHA03247  2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRfrvPQPAPS-REAPASSTPPLTGH-------SLSRVS 3005
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958646703 1042 APLMSLG----PEPQQALLPQSLV-------SGASLPPPGAPRECSLQQLQPLPPEKTqkeLPPEHQ 1097
Cdd:PHA03247  3006 SWASSLAlheeTDPPPVSLKQTLWppddtedSDADSLFDSDSERSDLEALDPLPPEPH---DPFAHE 3069
PTZ00420 PTZ00420
coronin; Provisional
73-223 7.58e-06

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 49.95  E-value: 7.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703   73 DPSLDLKrkGILSVSSRFHKLIWGSSSSGLLENTGVIAGGGDSGMLTLYNVTHilspgKEPLIAQKqKHTGAVRALDFNP 152
Cdd:PTZ00420    13 DPSNNLF--DDLRICSRVIDSCGIACSSGFVAVPWEVEGGGLIGAIRLENQMR-----KPPVIKLK-GHTSSILDLQFNP 84
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958646703  153 FQGNLLASGASDSEIFIWDLNH----LTVPMTPGSKSQNPPEDIKALSWNRQVQHILSSAHPSGKAVVWDLrKNE 223
Cdd:PTZ00420    85 CFSEILASGSEDLTIRVWEIPHndesVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDI-ENE 158
PTZ00421 PTZ00421
coronin; Provisional
135-333 7.64e-06

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 49.89  E-value: 7.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  135 IAQKQKHTGAVRALDFNPFQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSqnppEDIKALSWNRQvQHILSSAHPSGKA 214
Cdd:PTZ00421   118 IVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHS----DQITSLEWNLD-GSLLCTTSKDKKL 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  215 VVWDLRKNEPIIKVSDHSSRMNCSGLaWNPDIATQLVLCSEDDRLPVIQLWDLRFASSPLKVLESHSRGILSVSWSQADA 294
Cdd:PTZ00421   193 NIIDPRDGTIVSSVEAHASAKSQRCL-WAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDT 271
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958646703  295 ELL-LSSAKDNQI-----------FCWNLSSSEVVYKLPTQSSWCFDVQWC 333
Cdd:PTZ00421   272 NLLyIGSKGEGNIrcfelmnerltFCSSYSSVEPHKGLCMMPKWSLDTRKC 322
WD40 COG2319
WD40 repeat [General function prediction only];
108-220 1.07e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 49.14  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  108 VIAGGGDSGMLTLYNVthilSPGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQN 187
Cdd:COG2319    302 LLASGSDDGTVRLWDL----ATGK--LLRTLTGHTGAVRSVAFSP-DGKTLASGSDDGTVRLWDLATGELLRTL----TG 370
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1958646703  188 PPEDIKALSWNRQVQHILSSAHpSGKAVVWDLR 220
Cdd:COG2319    371 HTGAVTSVAFSPDGRTLASGSA-DGTVRLWDLA 402
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
581-673 1.16e-05

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 48.79  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  581 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 650
Cdd:cd09233     54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1958646703  651 KNWKDLVCACS---------LKNWREALALLL 673
Cdd:cd09233    133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
141-171 2.44e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.30  E-value: 2.44e-05
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1958646703   141 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 171
Cdd:smart00320   11 HTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
224-327 2.47e-05

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 48.54  E-value: 2.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  224 PIIKVSdhsSRMNCSGLAWNPDIATQLVLCSEDDrlpVIQLWDLrfASSPLKV-LESHSRGILSVSWSQADAELLLSSAK 302
Cdd:PLN00181   525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
                           90       100
                   ....*....|....*....|....*
gi 1958646703  303 DNQIFCWNLSSSEVVYKLPTQSSWC 327
Cdd:PLN00181   597 DGSVKLWSINQGVSIGTIKTKANIC 621
PHA03377 PHA03377
EBNA-3C; Provisional
812-1102 5.68e-05

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 47.35  E-value: 5.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  812 GRQAPAFPFPRVAVGAALHPKETSSHR---MGFQPPRQVPAPSV---------RPRAAAQPSVMPFLPSHPIpsvgswtQ 879
Cdd:PHA03377   644 GPKPKSFWEMRAGRDGSGIQQEPSSRRqpaTQSTPPRPSWLPSVfvlpsvdagRAQPSEESHLSSMSPTQPI-------S 716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  880 SSSDYRVPKPQATLPVHFVPGVRPAFSQPQPFGG----QSVQAINPvgfcGTW-PLPGPTPVMAppdVMQPGSTHLPETP 954
Cdd:PHA03377   717 HEEQPRYEDPDDPLDLSLHPDQAPPPSHQAPYSGheepQAQQAPYP----GYWePRPPQAPYLG---YQEPQAQGVQVSS 789
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  955 RLLPLPPVGPPGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGpQDTWKVAPASQenlqrkklPETfM 1034
Cdd:PHA03377   790 YPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAGHG-QDQVSQFPHLQ--------SET-G 859
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958646703 1035 PPAPIITaplmslgpEPQQALLPQSLVSGASL----PPPGAPrecslqqLQPLPpektqKELPPEHQCLKDS 1102
Cdd:PHA03377   860 PPRLQLS--------QVPQLPYSQTLVSSSAPswssPQPRAP-------IRPIP-----TRFPPPPMPLQDS 911
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
842-1043 9.31e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 46.77  E-value: 9.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  842 QPPRQVPAPsvRPRAAAQPSVMPFLPSHPIPSVGSwtqsssdyRVPKPQATLPVHFVPGVRPAFSQPQPFGGQSVQAINP 921
Cdd:PRK07003   375 RVAGAVPAP--GARAAAAVGASAVPAVTAVTGAAG--------AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAAD 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  922 vgfcGTWPLPGPTPVMAPPDvmqpgsthlpetprllplppvgpPGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSS 1001
Cdd:PRK07003   445 ----GDAPVPAKANARASAD-----------------------SRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAA 497
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1958646703 1002 GILATRPGPQDTWKVAPASQENlqrkKLPETFMPPAPIITAP 1043
Cdd:PRK07003   498 APSAATPAAVPDARAPAAASRE----DAPAAAAPPAPEARPP 535
WD40 pfam00400
WD domain, G-beta repeat;
141-171 1.68e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.02  E-value: 1.68e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1958646703  141 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 171
Cdd:pfam00400   10 HTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
WD40 pfam00400
WD domain, G-beta repeat;
271-310 6.21e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.48  E-value: 6.21e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1958646703  271 SSPLKVLESHSRGILSVSWSQaDAELLLSSAKDNQIFCWN 310
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
810-1094 6.90e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.99  E-value: 6.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  810 VLGRQAPAFPFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPR-AAAQPSVMPFLPSHPIPSVGSwTQSSSDYRVPK 888
Cdd:pfam03154  166 ILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSpATSQPPNQTQSTAAPHTLIQQ-TPTLHPQRLPS 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  889 PQatlpvhfvPGVRPAfSQPQPFGGQSVQAINPVGFCGTWPlPGPTPVMAPPDVMQ-PGSTHLpetprlLPLPPVGPPGP 967
Cdd:pfam03154  245 PH--------PPLQPM-TQPPPPSQVSPQPLPQPSLHGQMP-PMPHSLQTGPSHMQhPVPPQP------FPLTPQSSQSQ 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  968 TPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQ----ENLQRKKLPETFMPPAPIitap 1043
Cdd:pfam03154  309 VPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPipqlPNPQSHKHPPHLSGPSPF---- 384
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958646703 1044 LMSLGPEPQQALLPQSLVSGASLPPPGAPRECSLQQLQPLPPEKTQkelPP 1094
Cdd:pfam03154  385 QMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQ---PP 432
PHA03247 PHA03247
large tegument protein UL36; Provisional
812-1085 9.74e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 9.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  812 GRQAPAFpFPRVAVGAALHPK---------ETSSHRMGFQPPrqvPAPSVRPRAAAQPSVMPflpSHPIPsvgswtqsss 882
Cdd:PHA03247  2513 SRLAPAI-LPDEPVGEPVHPRmltwirgleELASDDAGDPPP---PLPPAAPPAAPDRSVPP---PRPAP---------- 2575
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  883 dyRVPKPQATLPVHfVPGVRPAFSQPQpfggqsvqainpvgfcgtwplpgpTPVmAPPDvmqpgsthlpetprllplppv 962
Cdd:PHA03247  2576 --RPSEPAVTSRAR-RPDAPPQSARPR------------------------APV-DDRG--------------------- 2606
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  963 gppgPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKL----------PET 1032
Cdd:PHA03247  2607 ----DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRlgraaqasspPQR 2682
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703 1033 FMPPA-PIITAPLMSLG----PEPQQALLPQSLVSGASLPP-PGAPRECS-LQQLQPLPP 1085
Cdd:PHA03247  2683 PRRRAaRPTVGSLTSLAdpppPPPTPEPAPHALVSATPLPPgPAAARQASpALPAAPAPP 2742
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
840-1068 1.19e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 1.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  840 GFQPPRQVPAPSVRPRAAAQ-PSVMPFLPSHPIPSVGSWTQSSSdyrVPKPQATLPVHFVPGVRPAFSQPQPFGGQSVQA 918
Cdd:PRK12323   371 GAGPATAAAAPVAQPAPAAAaPAAAAPAPAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAARQASARGPGGA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  919 INPVgfcgtwPLPGPTPVMAPPDVMQPgsthlpetprllPLPPVGPPGPTPLSSQPAASPVTFSVAHPPGGPGAPrssAL 998
Cdd:PRK12323   448 PAPA------PAPAAAPAAAARPAAAG------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPP---EF 506
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  999 PSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAPLMSLGPEPQQALLPQSLvSGASLPP 1068
Cdd:PRK12323   507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRA-SASGLPD 575
PTZ00420 PTZ00420
coronin; Provisional
221-332 1.34e-03

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 42.63  E-value: 1.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  221 KNEPIIKVSDHSSRMncSGLAWNPDIATQLVLCSEDdrlPVIQLWDLRF-------ASSPLKVLESHSRGILSVSWSQAD 293
Cdd:PTZ00420    63 RKPPVIKLKGHTSSI--LDLQFNPCFSEILASGSED---LTIRVWEIPHndesvkeIKDPQCILKGHKKKISIIDWNPMN 137
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1958646703  294 AELLLSSAKDNQIFCWNLSSSEVVYK--LPTQSSwcfDVQW 332
Cdd:PTZ00420   138 YYIMCSSGFDSFVNIWDIENEKRAFQinMPKKLS---SLKW 175
PHA03378 PHA03378
EBNA-3B; Provisional
830-1072 2.54e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.98  E-value: 2.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  830 HPKETSSHRMGFQPPRQVPAPSVRPRAAA-QPSVMPfLPSHP-----IPSVGSWTQSSSDYRVPKP---QATLPVHFVPG 900
Cdd:PHA03378   614 HIPETSAPRQWPMPLRPIPMRPLRMQPITfNVLVFP-TPHQPpqveiTPYKPTWTQIGHIPYQPSPtgaNTMLPIQWAPG 692
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  901 -----------VRPAFSQP---QPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSTHLPETPRLLPLPPVGPPG 966
Cdd:PHA03378   693 tmqpppraptpMRPPAAPPgraQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG 772
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  967 PTPLSSQPAASPVtfSVAHPPGGPG-APRSSALPSSGILATR--PGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAP 1043
Cdd:PHA03378   773 APTPQPPPQAPPA--PQQRPRGAPTpQPPPQAGPTSMQLMPRaaPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQA 850
                          250       260
                   ....*....|....*....|....*....
gi 1958646703 1044 LMSLGPEPQQALLPQSLVSGASLPPPGAP 1072
Cdd:PHA03378   851 AAGPTPSPGSGTSDKIVQAPVFYPPVLQP 879
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
814-1051 5.64e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.91  E-value: 5.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  814 QAPAFPFPRVAVGAA--LHPKETSSHRMGFQPPRQVPAPsvrPRAAAQPSVMPfLPSHPIPSVG--------SWTQSSSD 883
Cdd:pfam03154  308 QVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLP---PAPLSMPHIKP-PPTTPIPQLPnpqshkhpPHLSGPSP 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  884 YRVP---------KPQATLPVHFVPGVRPAFSQPQPfGGQSVQA--INPVGFCGTWPLPGPTPVMAPPDVMQPGSTHLPE 952
Cdd:pfam03154  384 FQMNsnlppppalKPLSSLSTHHPPSAHPPPLQLMP-QSQQLPPppAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPF 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958646703  953 TPRLLPLPPVGPpgPTPLSSQPAASPVTFSVAHPPggpgaprSSALPSSGIlatrPGPQDTWKVAPASQenLQRKKLPET 1032
Cdd:pfam03154  463 PQHPFVPGGPPP--ITPPSGPPTSTSSAMPGIQPP-------SSASVSSSG----PVPAAVSCPLPPVQ--IKEEALDEA 527
                          250
                   ....*....|....*....
gi 1958646703 1033 FMPPAPiiTAPLMSLGPEP 1051
Cdd:pfam03154  528 EEPESP--PPPPRSPSPEP 544
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH