|
Name |
Accession |
Description |
Interval |
E-value |
| Tub super family |
cl08308 |
Tub family; |
1469-1539 |
1.41e-23 |
|
Tub family; The actual alignment was detected with superfamily member pfam01167:
Pssm-ID: 460094 Cd Length: 251 Bit Score: 101.50 E-value: 1.41e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 157820391 1469 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1539
Cdd:pfam01167 174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-217 |
6.79e-11 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 66.09 E-value: 6.79e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319 246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
|
170
....*....|...
gi 157820391 205 LLHESDGILSMSW 217
Cdd:COG2319 326 LTGHTGAVRSVAF 338
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1010-1299 |
1.81e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 1.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1010 APLQP--PAKPKGGTGGAVAQLPARPPPALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSyNLLSPPDTSRDRtdyv 1087
Cdd:PHA03247 2591 APPQSarPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR-DDPAPGRVSRPR---- 2665
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1088 nsaftEDEALSQHCQLEKPLRHP-----PLPEATVT-MKRPPPYQWDPVLGEDVWVPQerTAQPTVPN---------PLK 1152
Cdd:PHA03247 2666 -----RARRLGRAAQASSPPQRPrrraaRPTVGSLTsLADPPPPPPTPEPAPHALVSA--TPLPPGPAaarqaspalPAA 2738
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1153 LSPLMIGQGQHLDVARVPFVSPKSPS---------SPTATFQTGYGMGVPYPGSYNTPSLPGVQAPCSPKDALSPAQFAQ 1223
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAgppapappaAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1224 QESAVVLQPAYPPSLSYCTLPPTYPGSSTCSslqLPPI-ALHPWNSYSTCPPMQNTQGTLPSKPHLVVEK---PLVSPPP 1299
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPS---LPLGgSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlarPAVSRST 2895
|
|
| SOCS super family |
cl02533 |
SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region ... |
373-408 |
3.71e-04 |
|
SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region of CIS/SOCS family proteins (in combination with a SH2 domain), ASBs (ankyrin repeat-containing proteins with a SOCS box), SSBs (SPRY domain-containing proteins with a SOCS box), and WSBs (WD40 repeat-containing proteins with a SOCS box), as well as, other miscellaneous proteins. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions. The actual alignment was detected with superfamily member cd03717:
Pssm-ID: 470605 Cd Length: 39 Bit Score: 39.50 E-value: 3.71e-04
10 20 30
....*....|....*....|....*....|....*.
gi 157820391 373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717 2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Tub |
pfam01167 |
Tub family; |
1469-1539 |
1.41e-23 |
|
Tub family;
Pssm-ID: 460094 Cd Length: 251 Bit Score: 101.50 E-value: 1.41e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 157820391 1469 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1539
Cdd:pfam01167 174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-217 |
6.79e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 66.09 E-value: 6.79e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319 246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
|
170
....*....|...
gi 157820391 205 LLHESDGILSMSW 217
Cdd:COG2319 326 LTGHTGAVRSVAF 338
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
78-218 |
7.76e-07 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 52.72 E-value: 7.76e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVNDRGAqVSDFTWSHDGTQALISYRDGFVLVGSVSGQ 157
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLETG 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 157820391 158 R-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV-----------MDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:cd00200 84 EcvrtltgH-------TSYVSSVAFSPDGRILSSSSRDKTIKVwdvetgkclttLRGHTDWVNSVAFSPDGTFVASSSQ 155
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1010-1299 |
1.81e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 1.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1010 APLQP--PAKPKGGTGGAVAQLPARPPPALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSyNLLSPPDTSRDRtdyv 1087
Cdd:PHA03247 2591 APPQSarPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR-DDPAPGRVSRPR---- 2665
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1088 nsaftEDEALSQHCQLEKPLRHP-----PLPEATVT-MKRPPPYQWDPVLGEDVWVPQerTAQPTVPN---------PLK 1152
Cdd:PHA03247 2666 -----RARRLGRAAQASSPPQRPrrraaRPTVGSLTsLADPPPPPPTPEPAPHALVSA--TPLPPGPAaarqaspalPAA 2738
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1153 LSPLMIGQGQHLDVARVPFVSPKSPS---------SPTATFQTGYGMGVPYPGSYNTPSLPGVQAPCSPKDALSPAQFAQ 1223
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAgppapappaAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1224 QESAVVLQPAYPPSLSYCTLPPTYPGSSTCSslqLPPI-ALHPWNSYSTCPPMQNTQGTLPSKPHLVVEK---PLVSPPP 1299
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPS---LPLGgSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlarPAVSRST 2895
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
146-219 |
2.97e-04 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 41.11 E-value: 2.97e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 157820391 146 DGFVLVGSVSGQRHWS-SEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWNY 219
Cdd:pfam12894 16 DGELLLHRLNWQRVWTlSPDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGWGE 90
|
|
| SOCS_SOCS_like |
cd03717 |
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ... |
373-408 |
3.71e-04 |
|
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.
Pssm-ID: 239687 Cd Length: 39 Bit Score: 39.50 E-value: 3.71e-04
10 20 30
....*....|....*....|....*....|....*.
gi 157820391 373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717 2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
|
|
| SOCS_box |
smart00969 |
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ... |
375-409 |
5.91e-04 |
|
The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;
Pssm-ID: 198037 Cd Length: 34 Bit Score: 38.54 E-value: 5.91e-04
10 20 30
....*....|....*....|....*....|....*
gi 157820391 375 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 409
Cdd:smart00969 1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
|
|
| SOCS_box |
pfam07525 |
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ... |
374-408 |
1.81e-03 |
|
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
Pssm-ID: 462192 Cd Length: 39 Bit Score: 37.53 E-value: 1.81e-03
10 20 30
....*....|....*....|....*....|....*..
gi 157820391 374 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 408
Cdd:pfam07525 2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
78-109 |
8.43e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 35.75 E-value: 8.43e-03
10 20 30
....*....|....*....|....*....|..
gi 157820391 78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320 8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Tub |
pfam01167 |
Tub family; |
1469-1539 |
1.41e-23 |
|
Tub family;
Pssm-ID: 460094 Cd Length: 251 Bit Score: 101.50 E-value: 1.41e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 157820391 1469 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1539
Cdd:pfam01167 174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-217 |
6.79e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 66.09 E-value: 6.79e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319 246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
|
170
....*....|...
gi 157820391 205 LLHESDGILSMSW 217
Cdd:COG2319 326 LTGHTGAVRSVAF 338
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-217 |
1.87e-10 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 64.93 E-value: 1.87e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 45 WLATGNGRGVVGVTftsshcrrDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 218 LLASGSADGTVRLW--------DLATGKLL-RTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLW-DLATGELLRTLTGH 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319 288 SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRT 367
|
170
....*....|...
gi 157820391 205 LLHESDGILSMSW 217
Cdd:COG2319 368 LTGHTGAVTSVAF 380
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-193 |
2.73e-07 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 54.92 E-value: 2.73e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 45 WLATGNGRGVVGVtftsshcrRDRSTPQRINFnLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319 260 LLASGSADGTVRL--------WDLATGELLRT-LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLW-DLATGKLLRTLTGH 329
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 157820391 125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIV 193
Cdd:COG2319 330 TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
78-218 |
7.76e-07 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 52.72 E-value: 7.76e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVNDRGAqVSDFTWSHDGTQALISYRDGFVLVGSVSGQ 157
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLETG 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 157820391 158 R-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV-----------MDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:cd00200 84 EcvrtltgH-------TSYVSSVAFSPDGRILSSSSRDKTIKVwdvetgkclttLRGHTDWVNSVAFSPDGTFVASSSQ 155
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
67-218 |
1.72e-05 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 48.49 E-value: 1.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 67 DRSTPQRINfNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiqyEGRWSVELVNDRG--AQVSDFTWSHDGTQALISY 144
Cdd:cd00200 79 DLETGECVR-TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW---DVETGKCLTTLRGhtDWVNSVAFSPDGTFVASSS 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 145 RDGFVLVGSVSGQR-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 217
Cdd:cd00200 155 QDGTIKLWDLRTGKcvatltgH-------TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAF 227
|
.
gi 157820391 218 N 218
Cdd:cd00200 228 S 228
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1010-1299 |
1.81e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 1.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1010 APLQP--PAKPKGGTGGAVAQLPARPPPALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSyNLLSPPDTSRDRtdyv 1087
Cdd:PHA03247 2591 APPQSarPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR-DDPAPGRVSRPR---- 2665
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1088 nsaftEDEALSQHCQLEKPLRHP-----PLPEATVT-MKRPPPYQWDPVLGEDVWVPQerTAQPTVPN---------PLK 1152
Cdd:PHA03247 2666 -----RARRLGRAAQASSPPQRPrrraaRPTVGSLTsLADPPPPPPTPEPAPHALVSA--TPLPPGPAaarqaspalPAA 2738
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1153 LSPLMIGQGQHLDVARVPFVSPKSPS---------SPTATFQTGYGMGVPYPGSYNTPSLPGVQAPCSPKDALSPAQFAQ 1223
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAgppapappaAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1224 QESAVVLQPAYPPSLSYCTLPPTYPGSSTCSslqLPPI-ALHPWNSYSTCPPMQNTQGTLPSKPHLVVEK---PLVSPPP 1299
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPS---LPLGgSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRlarPAVSRST 2895
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
65-218 |
3.35e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 47.98 E-value: 3.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 65 RRDRSTPQRINFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDRGAQVSDFTWSHDGTQALISY 144
Cdd:COG2319 61 LLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLW-DLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGS 139
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 157820391 145 RDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:COG2319 140 ADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS 213
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
65-217 |
4.61e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 47.60 E-value: 4.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 65 RRDRSTPQRINFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVnDRGAQVSDFTWSHDGTQALISY 144
Cdd:COG2319 19 ALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLL-GHTAAVLSVAFSPDGRLLASAS 97
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 157820391 145 RDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 217
Cdd:COG2319 98 ADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF 170
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
146-219 |
2.97e-04 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 41.11 E-value: 2.97e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 157820391 146 DGFVLVGSVSGQRHWS-SEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWNY 219
Cdd:pfam12894 16 DGELLLHRLNWQRVWTlSPDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGWGE 90
|
|
| SOCS_SOCS_like |
cd03717 |
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ... |
373-408 |
3.71e-04 |
|
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.
Pssm-ID: 239687 Cd Length: 39 Bit Score: 39.50 E-value: 3.71e-04
10 20 30
....*....|....*....|....*....|....*.
gi 157820391 373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717 2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
|
|
| SOCS_box |
smart00969 |
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ... |
375-409 |
5.91e-04 |
|
The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;
Pssm-ID: 198037 Cd Length: 34 Bit Score: 38.54 E-value: 5.91e-04
10 20 30
....*....|....*....|....*....|....*
gi 157820391 375 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 409
Cdd:smart00969 1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
|
|
| SOCS_box |
pfam07525 |
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ... |
374-408 |
1.81e-03 |
|
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
Pssm-ID: 462192 Cd Length: 39 Bit Score: 37.53 E-value: 1.81e-03
10 20 30
....*....|....*....|....*....|....*..
gi 157820391 374 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 408
Cdd:pfam07525 2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
53-217 |
2.67e-03 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 41.55 E-value: 2.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 53 GVVGVTFTSSH-----CRRDRS-------TPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRwSVEL 120
Cdd:cd00200 95 YVSSVAFSPDGrilssSSRDKTikvwdveTGKCL-TTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK-CVAT 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 121 VNDRGAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQR-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV 193
Cdd:cd00200 173 LTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKclgtlrgH-------ENGVNSVAFSPDGYLLASGSEDGTIRV 245
|
170 180
....*....|....*....|....*
gi 157820391 194 MD-CHGRMLAHVLLHESdGILSMSW 217
Cdd:cd00200 246 WDlRTGECVQTLSGHTN-SVTSLAW 269
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1005-1366 |
4.17e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 4.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1005 ADSSRAPLQPPAKPKGGTGGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1084
Cdd:PHA03247 2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1085 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEATVTMKRPPPYQWDPVLGEDVWVPQERTAQPTVPNPLklsplmigqGQHL 1164
Cdd:PHA03247 2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1165 DVARVPFVSPKSPSSPTATfqtgygmgvpypgsynTPSLPGVQAPCSPKDALSPAQFAQQESAVVLQPAYPPSLSYCTLP 1244
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP 2760
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1245 PTYPGSSTCSSLQLPPIALHPWNSYSTCPPMQNTQGTLPSKPHLVVEKPLVSPPPAELQSHMGTEVMVETADNFQEVLSL 1324
Cdd:PHA03247 2761 PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 157820391 1325 TESPVPQRTEKF------GKKNRKRLDSRAEEGSVQAITEGKVRKDAR 1366
Cdd:PHA03247 2841 PPPGPPPPSLPLggsvapGGDVRRRPPSRSPAAKPAAPARPPVRRLAR 2888
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
78-109 |
8.43e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 35.75 E-value: 8.43e-03
10 20 30
....*....|....*....|....*....|..
gi 157820391 78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320 8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
900-1300 |
8.71e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.82 E-value: 8.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 900 VEEVCRPRTRMLCSQNTYTLPGPGSSATLRLTATEKkVPQPCTSatlnrLTVPrYSIPTGDPPPYPEIASQLAQGRNATQ 979
Cdd:PHA03378 426 IEEEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQP-LEGPTGP-----LSVQ-APLEPWQPLPHPQVTPVILHQPPAQG 498
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 980 RLDNSLIHATLRRNNREVALK-MAQLADssRAPLQPPAKPKG-----GTGGAVAQLPARPPPALYTCSQCSGAGP---SS 1050
Cdd:PHA03378 499 VQAHGSMLDLLEKDDEDMEQRvMATLLP--PSPPQPRAGRRApcvytEDLDIESDEPASTEPVHDQLLPAPGLGPlqiQP 576
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1051 QSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTdyVNSAFTEDEALSQHCQLEKPLRHPPLPEATVTMK---RPPPY-- 1125
Cdd:PHA03378 577 LTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPT--TQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNvlvFPTPHqp 654
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1126 -QWDPVLGEDVWV-PQERTAQPTVPNPLKLSPLMIGQGQHLDVARVPFVSPKSPSSPTATFQTGYGMGVPYPGSYNTPSL 1203
Cdd:PHA03378 655 pQVEITPYKPTWTqIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRA 734
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157820391 1204 PGVQAPCSPKDALSPAQFAQQESAVVLQPAYPPSLSYCTLPPTYPGsstcsslQLPPIalhpwnsystcpPMQNTQGT-L 1282
Cdd:PHA03378 735 RPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP-------QAPPA------------PQQRPRGApT 795
|
410
....*....|....*...
gi 157820391 1283 PSKPHLVVEKPLVSPPPA 1300
Cdd:PHA03378 796 PQPPPQAGPTSMQLMPRA 813
|
|
|