|
Name |
Accession |
Description |
Interval |
E-value |
| Tub super family |
cl08308 |
Tub family; |
1513-1583 |
1.79e-23 |
|
Tub family; The actual alignment was detected with superfamily member pfam01167:
Pssm-ID: 460094 Cd Length: 251 Bit Score: 101.50 E-value: 1.79e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391005 1513 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1583
Cdd:pfam01167 174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-217 |
7.02e-11 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 66.09 E-value: 7.02e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319 246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
|
170
....*....|...
gi 1720391005 205 LLHESDGILSMSW 217
Cdd:COG2319 326 LTGHTGAVRSVAF 338
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1049-1380 |
2.10e-07 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.10 E-value: 2.10e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1049 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1128
Cdd:PHA03247 2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1129 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1208
Cdd:PHA03247 2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1209 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1283
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1284 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1353
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 1720391005 1354 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1380
Cdd:PHA03247 2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
|
|
| SOCS super family |
cl02533 |
SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region ... |
373-408 |
3.85e-04 |
|
SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region of CIS/SOCS family proteins (in combination with a SH2 domain), ASBs (ankyrin repeat-containing proteins with a SOCS box), SSBs (SPRY domain-containing proteins with a SOCS box), and WSBs (WD40 repeat-containing proteins with a SOCS box), as well as, other miscellaneous proteins. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions. The actual alignment was detected with superfamily member cd03717:
Pssm-ID: 470605 Cd Length: 39 Bit Score: 39.50 E-value: 3.85e-04
10 20 30
....*....|....*....|....*....|....*.
gi 1720391005 373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717 2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Tub |
pfam01167 |
Tub family; |
1513-1583 |
1.79e-23 |
|
Tub family;
Pssm-ID: 460094 Cd Length: 251 Bit Score: 101.50 E-value: 1.79e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391005 1513 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1583
Cdd:pfam01167 174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-217 |
7.02e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 66.09 E-value: 7.02e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319 246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
|
170
....*....|...
gi 1720391005 205 LLHESDGILSMSW 217
Cdd:COG2319 326 LTGHTGAVRSVAF 338
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1049-1380 |
2.10e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.10 E-value: 2.10e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1049 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1128
Cdd:PHA03247 2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1129 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1208
Cdd:PHA03247 2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1209 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1283
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1284 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1353
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 1720391005 1354 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1380
Cdd:PHA03247 2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
78-218 |
8.00e-07 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 52.72 E-value: 8.00e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVNDRGAqVSDFTWSHDGTQALISYRDGFVLVGSVSGQ 157
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLETG 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720391005 158 R-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV-----------MDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:cd00200 84 EcvrtltgH-------TSYVSSVAFSPDGRILSSSSRDKTIKVwdvetgkclttLRGHTDWVNSVAFSPDGTFVASSSQ 155
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1093-1349 |
8.02e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.45 E-value: 8.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1093 SSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAFTEDEALSQHCQLEKPLRHPP-----------LPEAAV 1161
Cdd:pfam03154 156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPnqtqstaaphtLIQQTP 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1162 TMKRP------PPYQWDPMLGEDVWVPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP-PKSPSSPTATFPTGYGM 1234
Cdd:pfam03154 236 TLHPQrlpsphPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSP 315
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1235 GMPYPgSYNNPSLPGVQapcspkdalSQAQFAQQESAVVLQPAyPPSLSYCTLPPTYPGSstcssvQLPPIALHPWNSYS 1314
Cdd:pfam03154 316 AAPGQ-SQQRIHTPPSQ---------SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIP------QLPNPQSHKHPPHL 378
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1720391005 1315 TCPPMQNTQGTLPPKPHLvveKPLVS----------PPPAELQSH 1349
Cdd:pfam03154 379 SGPSPFQMNSNLPPPPAL---KPLSSlsthhppsahPPPLQLMPQ 420
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
146-219 |
3.06e-04 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 41.11 E-value: 3.06e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720391005 146 DGFVLVGSVSGQRHWS-SEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWNY 219
Cdd:pfam12894 16 DGELLLHRLNWQRVWTlSPDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGWGE 90
|
|
| SOCS_SOCS_like |
cd03717 |
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ... |
373-408 |
3.85e-04 |
|
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.
Pssm-ID: 239687 Cd Length: 39 Bit Score: 39.50 E-value: 3.85e-04
10 20 30
....*....|....*....|....*....|....*.
gi 1720391005 373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717 2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
|
|
| SOCS_box |
smart00969 |
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ... |
375-409 |
6.51e-04 |
|
The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;
Pssm-ID: 198037 Cd Length: 34 Bit Score: 38.54 E-value: 6.51e-04
10 20 30
....*....|....*....|....*....|....*
gi 1720391005 375 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 409
Cdd:smart00969 1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
|
|
| SOCS_box |
pfam07525 |
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ... |
374-408 |
1.96e-03 |
|
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
Pssm-ID: 462192 Cd Length: 39 Bit Score: 37.53 E-value: 1.96e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1720391005 374 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 408
Cdd:pfam07525 2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
78-109 |
8.67e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 35.75 E-value: 8.67e-03
10 20 30
....*....|....*....|....*....|..
gi 1720391005 78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320 8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Tub |
pfam01167 |
Tub family; |
1513-1583 |
1.79e-23 |
|
Tub family;
Pssm-ID: 460094 Cd Length: 251 Bit Score: 101.50 E-value: 1.79e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391005 1513 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1583
Cdd:pfam01167 174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-217 |
7.02e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 66.09 E-value: 7.02e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319 246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
|
170
....*....|...
gi 1720391005 205 LLHESDGILSMSW 217
Cdd:COG2319 326 LTGHTGAVRSVAF 338
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-217 |
1.93e-10 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 64.93 E-value: 1.93e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 45 WLATGNGRGVVGVTftsshcrrDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 218 LLASGSADGTVRLW--------DLATGKLL-RTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLW-DLATGELLRTLTGH 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319 288 SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRT 367
|
170
....*....|...
gi 1720391005 205 LLHESDGILSMSW 217
Cdd:COG2319 368 LTGHTGAVTSVAF 380
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1049-1380 |
2.10e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.10 E-value: 2.10e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1049 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1128
Cdd:PHA03247 2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1129 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1208
Cdd:PHA03247 2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1209 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1283
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1284 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1353
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 1720391005 1354 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1380
Cdd:PHA03247 2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-193 |
2.82e-07 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 54.92 E-value: 2.82e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 45 WLATGNGRGVVGVtftsshcrRDRSTPQRINFnLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 260 LLASGSADGTVRL--------WDLATGELLRT-LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLW-DLATGKLLRTLTGH 329
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720391005 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIV 193
Cdd:COG2319 330 TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
78-218 |
8.00e-07 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 52.72 E-value: 8.00e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVNDRGAqVSDFTWSHDGTQALISYRDGFVLVGSVSGQ 157
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLETG 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720391005 158 R-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV-----------MDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:cd00200 84 EcvrtltgH-------TSYVSSVAFSPDGRILSSSSRDKTIKVwdvetgkclttLRGHTDWVNSVAFSPDGTFVASSSQ 155
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
992-1344 |
2.21e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 2.21e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 992 RLTVPRYSIPTGDPPPYP------EIASQLAQGRSAA----QRLDNSLIHATLRRNNREVALKMAQLADSS-RAPLQPLA 1060
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPpppspsPAANEPDPHPPPTvpppERPRDDPAPGRVSRPRRARRLGRAAQASSPpQRPRRRAA 2688
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1061 KPKGGAAGAVAQLPARPP---PALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAfte 1137
Cdd:PHA03247 2689 RPTVGSLTSLADPPPPPPtpePAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG--- 2765
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1138 dealsqhcqlekplrhPPLPEAAVTMKRPPPYQWDPMLGEDVWVpqERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP 1217
Cdd:PHA03247 2766 ----------------PPAPAPPAAPAAGPPRRLTRPAVASLSE--SRESLPSPWDPADPPAAVLAPAAALPPAASPAGP 2827
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1218 PKSPSSPTATFPtgygmgmPYPGSYNNPSLP--GVQAPCSPkdaLSQAQFAQQESAVVLQPAYPPSLSyctLPPTYPGSS 1295
Cdd:PHA03247 2828 LPPPTSAQPTAP-------PPPPGPPPPSLPlgGSVAPGGD---VRRRPPSRSPAAKPAAPARPPVRR---LARPAVSRS 2894
|
330 340 350 360
....*....|....*....|....*....|....*....|....*....
gi 1720391005 1296 TCSSVQLPPIALHPWNSYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPA 1344
Cdd:PHA03247 2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
67-218 |
1.77e-05 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 48.49 E-value: 1.77e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 67 DRSTPQRINfNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiqyEGRWSVELVNDRG--AQVSDFTWSHDGTQALISY 144
Cdd:cd00200 79 DLETGECVR-TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW---DVETGKCLTTLRGhtDWVNSVAFSPDGTFVASSS 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 145 RDGFVLVGSVSGQR-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 217
Cdd:cd00200 155 QDGTIKLWDLRTGKcvatltgH-------TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAF 227
|
.
gi 1720391005 218 N 218
Cdd:cd00200 228 S 228
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
944-1346 |
2.27e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 49.30 E-value: 2.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 944 VEEVCRPRTRMLCSQNTYTLPGPGSSATLRLTATEKkVPQPCTSatlnrLTVPrYSIPTGDPPPYPEIASQLAQGRSAAQ 1023
Cdd:PHA03378 426 IEEEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQP-LEGPTGP-----LSVQ-APLEPWQPLPHPQVTPVILHQPPAQG 498
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1024 RLDNSLIHATLRRNNREVALK-MAQLADssRAPLQPLAKPKG-----GAAGAVAQLPARPPPALYTCSQCSGAGP---SS 1094
Cdd:PHA03378 499 VQAHGSMLDLLEKDDEDMEQRvMATLLP--PSPPQPRAGRRApcvytEDLDIESDEPASTEPVHDQLLPAPGLGPlqiQP 576
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1095 QSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTdyVNSAFTEDEALSQHCQLEKPLRHPPLPEAAVTMK---RPPPY-- 1169
Cdd:PHA03378 577 LTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPT--TQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNvlvFPTPHqp 654
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1170 -QWDPMLGEDVWV-PQERTAQPTVPNPLKLSPLMLGQGQHLDVARVP--FVPPKSPSSPtATFPTGYGMGMPYPGSYNNP 1245
Cdd:PHA03378 655 pQVEITPYKPTWTqIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPtpMRPPAAPPGR-AQRPAAATGRARPPAAAPGR 733
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1246 SLPGVQAPcSPKDALSQAQFAQQESAVVLQPAYPPSLSYCTLPPTYPGsstcssvQLPPIalhpwnsystcpPMQNTQGT 1325
Cdd:PHA03378 734 ARPPAAAP-GRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP-------QAPPA------------PQQRPRGA 793
|
410 420
....*....|....*....|.
gi 1720391005 1326 LPPKPhlvveKPLVSPPPAEL 1346
Cdd:PHA03378 794 PTPQP-----PPQAGPTSMQL 809
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
65-218 |
3.46e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 47.98 E-value: 3.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 65 RRDRSTPQRINFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDRGAQVSDFTWSHDGTQALISY 144
Cdd:COG2319 61 LLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLW-DLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGS 139
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391005 145 RDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:COG2319 140 ADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS 213
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
65-217 |
4.76e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 47.60 E-value: 4.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 65 RRDRSTPQRINFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVnDRGAQVSDFTWSHDGTQALISY 144
Cdd:COG2319 19 ALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLL-GHTAAVLSVAFSPDGRLLASAS 97
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720391005 145 RDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 217
Cdd:COG2319 98 ADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF 170
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1093-1349 |
8.02e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.45 E-value: 8.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1093 SSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAFTEDEALSQHCQLEKPLRHPP-----------LPEAAV 1161
Cdd:pfam03154 156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPnqtqstaaphtLIQQTP 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1162 TMKRP------PPYQWDPMLGEDVWVPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP-PKSPSSPTATFPTGYGM 1234
Cdd:pfam03154 236 TLHPQrlpsphPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSP 315
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1235 GMPYPgSYNNPSLPGVQapcspkdalSQAQFAQQESAVVLQPAyPPSLSYCTLPPTYPGSstcssvQLPPIALHPWNSYS 1314
Cdd:pfam03154 316 AAPGQ-SQQRIHTPPSQ---------SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIP------QLPNPQSHKHPPHL 378
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1720391005 1315 TCPPMQNTQGTLPPKPHLvveKPLVS----------PPPAELQSH 1349
Cdd:pfam03154 379 SGPSPFQMNSNLPPPPAL---KPLSSlsthhppsahPPPLQLMPQ 420
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
989-1237 |
2.89e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 45.64 E-value: 2.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 989 TLNRLTVPRYSIPTGDPPPYPEIASQLAQGRSAAqrldnslihatlrrnnreVALKMAQLADSSRAPLQPLAKPKGGAAG 1068
Cdd:PRK12323 356 TLLRMLAFRPGQSGGGAGPATAAAAPVAQPAPAA------------------AAPAAAAPAPAAPPAAPAAAPAAAAAAR 417
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1069 AVAQLPARPPP---ALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSYNLLSPPDTSrdrtdyVNSAFTEDEALSQHC 1145
Cdd:PRK12323 418 AVAAAPARRSPapeALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAA------AAAPARAAPAAAPAP 491
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 1146 QLEKPlrhPPLPEAAVTMKRPPPYQWDPMLGEDVW--VPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVPPKSPSS 1223
Cdd:PRK12323 492 ADDDP---PPWEELPPEFASPAPAQPDAAPAGWVAesIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRA 568
|
250
....*....|....
gi 1720391005 1224 PTATFPTGYGMGMP 1237
Cdd:PRK12323 569 SASGLPDMFDGDWP 582
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
146-219 |
3.06e-04 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 41.11 E-value: 3.06e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720391005 146 DGFVLVGSVSGQRHWS-SEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWNY 219
Cdd:pfam12894 16 DGELLLHRLNWQRVWTlSPDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGWGE 90
|
|
| SOCS_SOCS_like |
cd03717 |
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ... |
373-408 |
3.85e-04 |
|
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.
Pssm-ID: 239687 Cd Length: 39 Bit Score: 39.50 E-value: 3.85e-04
10 20 30
....*....|....*....|....*....|....*.
gi 1720391005 373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717 2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
|
|
| SOCS_box |
smart00969 |
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ... |
375-409 |
6.51e-04 |
|
The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;
Pssm-ID: 198037 Cd Length: 34 Bit Score: 38.54 E-value: 6.51e-04
10 20 30
....*....|....*....|....*....|....*
gi 1720391005 375 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 409
Cdd:smart00969 1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
|
|
| SOCS_box |
pfam07525 |
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ... |
374-408 |
1.96e-03 |
|
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
Pssm-ID: 462192 Cd Length: 39 Bit Score: 37.53 E-value: 1.96e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1720391005 374 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 408
Cdd:pfam07525 2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
53-217 |
2.76e-03 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 41.55 E-value: 2.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 53 GVVGVTFTSSH-----CRRDRS-------TPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRwSVEL 120
Cdd:cd00200 95 YVSSVAFSPDGrilssSSRDKTikvwdveTGKCL-TTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK-CVAT 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391005 121 VNDRGAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQR-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV 193
Cdd:cd00200 173 LTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKclgtlrgH-------ENGVNSVAFSPDGYLLASGSEDGTIRV 245
|
170 180
....*....|....*....|....*
gi 1720391005 194 MD-CHGRMLAHVLLHESdGILSMSW 217
Cdd:cd00200 246 WDlRTGECVQTLSGHTN-SVTSLAW 269
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
78-109 |
8.67e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 35.75 E-value: 8.67e-03
10 20 30
....*....|....*....|....*....|..
gi 1720391005 78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320 8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
|