NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1622885920|ref|XP_001087439|]
View 

proline and serine-rich protein 1 isoform X1 [Macaca mulatta]

Protein Classification

DUF4476 domain-containing protein( domain architecture ID 10632145)

DUF4476 domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4476 pfam14771
Domain of unknown function (DUF4476);
29-121 4.40e-19

Domain of unknown function (DUF4476);


:

Pssm-ID: 434196  Cd Length: 91  Bit Score: 82.62  E-value: 4.40e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  29 VHGYFSSEQVVDLLRYFSWAEPQLKAMKA--LQHKMVAVQpteVVNILNCFTFSKDKLVALELLASNIIDAQNSRPIEDL 106
Cdd:pfam14771   1 VMSDNDFDQLVEKLKRFSFDDDKLKLLEQalLNNYFTCSQ---AAQLLKIFSFDDDRLKALKLLYPNIVDKQNYEVIIDV 77
                          90
                  ....*....|....*
gi 1622885920 107 FRINmSEKKRCKRIL 121
Cdd:pfam14771  78 FTFS-SDKDKAREIL 91
PHA03247 super family cl33720
large tegument protein UL36; Provisional
273-798 7.69e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.95  E-value: 7.69e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  273 PHGSNPSTPAATPVPTASP---VKAINHPSASAAATVSGMNLPNTVLPVFPGQVSSAVHTPQPSTPNPTVIRTPSL-PTA 348
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGppdPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAApPAA 2563
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  349 PVTSIHSTTTTPVPSIFSGLVSLPGPSATPTPGpTPRSTLGSSETFASTSAPfTSLPFSISSTAASTSNPNSASlSSVFA 428
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSA-RPRAPVDDRGDPRGPAPP-SPLPPDTHAPDPPPPSPSPAA-NEPDP 2640
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  429 GLPLPLPPTSQGLSNPTP---------VIAGGSTPSVAGPLGVNSPLLSALKGFLTSSDTSLISSSALSSAVTSgLASLS 499
Cdd:PHA03247  2641 HPPPTVPPPERPRDDPAPgrvsrprraRRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHA-LVSAT 2719
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  500 SLTLQNSDSSASAPSKCYAPSAIPPPQRTSTPGLALFPGLP-SPVANSTSTPLTLPVQSP--LATVASASTSAPVSCGSS 576
Cdd:PHA03247  2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPpTTAGPPAPAPPAAPAAGPprRLTRPAVASLSESRESLP 2799
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  577 APLLHGPHPG-----TSDLHISSTPAvTTLPVMIKTEPTSPTPSafKGPSHSGNPSHGTLGLSGTLGRAYTSTSVPISLS 651
Cdd:PHA03247  2800 SPWDPADPPAavlapAAALPPAASPA-GPLPPPTSAQPTAPPPP--PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA 2876
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  652 TCLNPALSGLSSLSThlngSNTLSSISLPPHGSSTPIAPVfTALPPFTSLTNNFPLTGNPSLNPSvslpgsliatsstaa 731
Cdd:PHA03247  2877 APARPPVRRLARPAV----SRSTESFALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPPPP--------------- 2936
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622885920  732 tstslPHPSSTAAVLSGLSASAPVSAAPFPLNLSTAVPSLFSVTQGPLSSSNPSYPGfSVSNTPSVT 798
Cdd:PHA03247  2937 -----PRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA-PASSTPPLT 2997
 
Name Accession Description Interval E-value
DUF4476 pfam14771
Domain of unknown function (DUF4476);
29-121 4.40e-19

Domain of unknown function (DUF4476);


Pssm-ID: 434196  Cd Length: 91  Bit Score: 82.62  E-value: 4.40e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  29 VHGYFSSEQVVDLLRYFSWAEPQLKAMKA--LQHKMVAVQpteVVNILNCFTFSKDKLVALELLASNIIDAQNSRPIEDL 106
Cdd:pfam14771   1 VMSDNDFDQLVEKLKRFSFDDDKLKLLEQalLNNYFTCSQ---AAQLLKIFSFDDDRLKALKLLYPNIVDKQNYEVIIDV 77
                          90
                  ....*....|....*
gi 1622885920 107 FRINmSEKKRCKRIL 121
Cdd:pfam14771  78 FTFS-SDKDKAREIL 91
PHA03247 PHA03247
large tegument protein UL36; Provisional
273-798 7.69e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.95  E-value: 7.69e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  273 PHGSNPSTPAATPVPTASP---VKAINHPSASAAATVSGMNLPNTVLPVFPGQVSSAVHTPQPSTPNPTVIRTPSL-PTA 348
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGppdPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAApPAA 2563
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  349 PVTSIHSTTTTPVPSIFSGLVSLPGPSATPTPGpTPRSTLGSSETFASTSAPfTSLPFSISSTAASTSNPNSASlSSVFA 428
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSA-RPRAPVDDRGDPRGPAPP-SPLPPDTHAPDPPPPSPSPAA-NEPDP 2640
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  429 GLPLPLPPTSQGLSNPTP---------VIAGGSTPSVAGPLGVNSPLLSALKGFLTSSDTSLISSSALSSAVTSgLASLS 499
Cdd:PHA03247  2641 HPPPTVPPPERPRDDPAPgrvsrprraRRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHA-LVSAT 2719
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  500 SLTLQNSDSSASAPSKCYAPSAIPPPQRTSTPGLALFPGLP-SPVANSTSTPLTLPVQSP--LATVASASTSAPVSCGSS 576
Cdd:PHA03247  2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPpTTAGPPAPAPPAAPAAGPprRLTRPAVASLSESRESLP 2799
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  577 APLLHGPHPG-----TSDLHISSTPAvTTLPVMIKTEPTSPTPSafKGPSHSGNPSHGTLGLSGTLGRAYTSTSVPISLS 651
Cdd:PHA03247  2800 SPWDPADPPAavlapAAALPPAASPA-GPLPPPTSAQPTAPPPP--PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA 2876
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  652 TCLNPALSGLSSLSThlngSNTLSSISLPPHGSSTPIAPVfTALPPFTSLTNNFPLTGNPSLNPSvslpgsliatsstaa 731
Cdd:PHA03247  2877 APARPPVRRLARPAV----SRSTESFALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPPPP--------------- 2936
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622885920  732 tstslPHPSSTAAVLSGLSASAPVSAAPFPLNLSTAVPSLFSVTQGPLSSSNPSYPGfSVSNTPSVT 798
Cdd:PHA03247  2937 -----PRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA-PASSTPPLT 2997
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
247-627 2.73e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 2.73e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 247 ENEDLSNQSKPIQNQTFSTPASQLFSPHGSNPSTPAATPVPTASPVkainhpSASAAATVSGMNLPNTVLPVFPGQVSSA 326
Cdd:pfam03154 153 DNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAG------PTPSAPSVPPQGSPATSQPPNQTQSTAA 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 327 VHTPQPSTPNPTVIRTPS-------LPTAPVTSIHSTTTTPVPSIFSGLVSLP-----GPSATPTPGPtPRSTLGSSETF 394
Cdd:pfam03154 227 PHTLIQQTPTLHPQRLPSphpplqpMTQPPPPSQVSPQPLPQPSLHGQMPPMPhslqtGPSHMQHPVP-PQPFPLTPQSS 305
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 395 ASTSAPFTSLPFSISSTAASTSNPNSASLSSVFAGLPLPLPPTSQGLSNPTPVIAGGSTPSVAGPLGVNSPLLSALKGFl 474
Cdd:pfam03154 306 QSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPF- 384
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 475 tssdtslisSSALSSAVTSGLASLSSLTLQNSDSSASAPSKCYAPSAIPPPQRTSTPGLALFPGLPSPVAN--STSTPLT 552
Cdd:pfam03154 385 ---------QMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAAShpPTSGLHQ 455
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 553 LPVQSPLAT---------VASASTSAPVSCGSSAPLLHGPH--PGTSDLHISSTPAVTTLPVMIKTEPTSPTPSAFKGPS 621
Cdd:pfam03154 456 VPSQSPFPQhpfvpggppPITPPSGPPTSTSSAMPGIQPPSsaSVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPP 535

                  ....*.
gi 1622885920 622 HSGNPS 627
Cdd:pfam03154 536 PPRSPS 541
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
252-425 1.99e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 41.66  E-value: 1.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 252 SNQSKPIQNQTFSTPASQLFSPHGSNPSTPAATPVPTASPVKAINHPSASAAATVSGMNLPNTVLPVFPGQVSSAVHTPQ 331
Cdd:COG3469    36 AATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTT 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 332 PSTPNPTVIRTPSLPTAPVTSIHSTT-----TTPVPSIFSGLVSLPGPSATPTPGPTPRSTLGSSETFASTSAPFTSLPF 406
Cdd:COG3469   116 STGAGSVTSTTSSTAGSTTTSGASATssagsTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATT 195
                         170
                  ....*....|....*....
gi 1622885920 407 SISSTAASTSNPNSASLSS 425
Cdd:COG3469   196 PSATTTATTTGPPTPGLPK 214
 
Name Accession Description Interval E-value
DUF4476 pfam14771
Domain of unknown function (DUF4476);
29-121 4.40e-19

Domain of unknown function (DUF4476);


Pssm-ID: 434196  Cd Length: 91  Bit Score: 82.62  E-value: 4.40e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  29 VHGYFSSEQVVDLLRYFSWAEPQLKAMKA--LQHKMVAVQpteVVNILNCFTFSKDKLVALELLASNIIDAQNSRPIEDL 106
Cdd:pfam14771   1 VMSDNDFDQLVEKLKRFSFDDDKLKLLEQalLNNYFTCSQ---AAQLLKIFSFDDDRLKALKLLYPNIVDKQNYEVIIDV 77
                          90
                  ....*....|....*
gi 1622885920 107 FRINmSEKKRCKRIL 121
Cdd:pfam14771  78 FTFS-SDKDKAREIL 91
PHA03247 PHA03247
large tegument protein UL36; Provisional
273-798 7.69e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.95  E-value: 7.69e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  273 PHGSNPSTPAATPVPTASP---VKAINHPSASAAATVSGMNLPNTVLPVFPGQVSSAVHTPQPSTPNPTVIRTPSL-PTA 348
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGppdPDAPPAPSRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAApPAA 2563
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  349 PVTSIHSTTTTPVPSIFSGLVSLPGPSATPTPGpTPRSTLGSSETFASTSAPfTSLPFSISSTAASTSNPNSASlSSVFA 428
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSA-RPRAPVDDRGDPRGPAPP-SPLPPDTHAPDPPPPSPSPAA-NEPDP 2640
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  429 GLPLPLPPTSQGLSNPTP---------VIAGGSTPSVAGPLGVNSPLLSALKGFLTSSDTSLISSSALSSAVTSgLASLS 499
Cdd:PHA03247  2641 HPPPTVPPPERPRDDPAPgrvsrprraRRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHA-LVSAT 2719
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  500 SLTLQNSDSSASAPSKCYAPSAIPPPQRTSTPGLALFPGLP-SPVANSTSTPLTLPVQSP--LATVASASTSAPVSCGSS 576
Cdd:PHA03247  2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPpTTAGPPAPAPPAAPAAGPprRLTRPAVASLSESRESLP 2799
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  577 APLLHGPHPG-----TSDLHISSTPAvTTLPVMIKTEPTSPTPSafKGPSHSGNPSHGTLGLSGTLGRAYTSTSVPISLS 651
Cdd:PHA03247  2800 SPWDPADPPAavlapAAALPPAASPA-GPLPPPTSAQPTAPPPP--PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA 2876
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  652 TCLNPALSGLSSLSThlngSNTLSSISLPPHGSSTPIAPVfTALPPFTSLTNNFPLTGNPSLNPSvslpgsliatsstaa 731
Cdd:PHA03247  2877 APARPPVRRLARPAV----SRSTESFALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPPPP--------------- 2936
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622885920  732 tstslPHPSSTAAVLSGLSASAPVSAAPFPLNLSTAVPSLFSVTQGPLSSSNPSYPGfSVSNTPSVT 798
Cdd:PHA03247  2937 -----PRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA-PASSTPPLT 2997
PHA03247 PHA03247
large tegument protein UL36; Provisional
169-600 9.66e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.57  E-value: 9.66e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  169 EECTNEGKGIAARILGPSKPPPST---YNPHKPVPYPIPPC---------------RPHATIAPSAYNHAGLVPLASVIA 230
Cdd:PHA03247  2541 EELASDDAGDPPPPLPPAAPPAAPdrsVPPPRPAPRPSEPAvtsrarrpdappqsaRPRAPVDDRGDPRGPAPPSPLPPD 2620
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  231 PGTPPPPPYTPNPVGTENEDLSNQSKPIQNQTFSTPASQLFSPH--GSNPSTPAATPVPTASPVKAINHPSASAAATVSG 308
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD 2700
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  309 MNLPNTVLPVFPGQVSSAVHTP-------QPSTPNPTVIRTPSLPTAPVTSIhSTTTTPVPSIFSGLVSlPGPSATPTPG 381
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPpgpaaarQASPALPAAPAPPAVPAGPATPG-GPARPARPPTTAGPPA-PAPPAAPAAG 2778
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  382 PTPRSTLGSSETfASTSAPFTSLPFSISSTAASTSNPNSASLSSVFAGLPLPLPPTSQGLSNPTP--------VIAGGST 453
Cdd:PHA03247  2779 PPRRLTRPAVAS-LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPpgppppslPLGGSVA 2857
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  454 PsvAGPLGVNSPLLSALKGFLTSSDTSLISSSALSSAVTSGLASLSSLTLQNSDSSASAPSKCYAPSAIPPPQRTS---T 530
Cdd:PHA03247  2858 P--GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPpppP 2935
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622885920  531 PGLALFPGLPSPVANSTSTPLTLPVQSPLATVASASTSAP-VSCGSSAPLLHGPHPGTSDLHISSTPAVTT 600
Cdd:PHA03247  2936 PPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
PHA03247 PHA03247
large tegument protein UL36; Provisional
141-647 5.88e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 5.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  141 TIPGNPYPKGRPSRinGIFPGTPLKKDGEECTNEGKGIAARILGPSKP--PPSTYNPHKPVPYPIPPCRPHATIAPSAYN 218
Cdd:PHA03247  2458 TILGAPFSLSLLLG--ELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPpdPDAPPAPSRLAPAILPDEPVGEPVHPRMLT 2535
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  219 HA-GLVPLASVIAPGTPPPPPYTPNPVGTENEDLSNQ-----------------SKPIQNQTFSTPAsqlfSPHGSNPST 280
Cdd:PHA03247  2536 WIrGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRpaprpsepavtsrarrpDAPPQSARPRAPV----DDRGDPRGP 2611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  281 PAATPVPTASPVKAINHPSASAAATVSGMNLPNTVLPVFPGQVSSAVhtpqPSTPNPTVIRTPSLPTAPVTSIHSTTTTP 360
Cdd:PHA03247  2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAP----GRVSRPRRARRLGRAAQASSPPQRPRRRA 2687
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  361 VPSIFSGLVSLPGPSATP-TPGPTPRSTLGSSETFASTSAPFTSLPfSISSTAASTSNPNSASLSSVFAglPLPLPPTSQ 439
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPpTPEPAPHALVSATPLPPGPAAARQASP-ALPAAPAPPAVPAGPATPGGPA--RPARPPTTA 2764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  440 GLSNPTPVIAGGSTPSVAGPLGVNSPLLSALKGFLTSSDTSLISSSALSSAVTSGLASLSSLTLQNSDSSASAPSkcyaP 519
Cdd:PHA03247  2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP----P 2840
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  520 SAIPPPQRTSTPGLALFPGLPSPVANSTSTPLTLPVQSPLATVASASTSAPVSCGSSAPL--LHGPHPGTSDLHISSTPA 597
Cdd:PHA03247  2841 PPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALppDQPERPPQPQAPPPPQPQ 2920
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|
gi 1622885920  598 vTTLPVMIKTEPTSPTPSAFKGPSHSGNPSHGTLGLSGTLGRAYTSTSVP 647
Cdd:PHA03247  2921 -PQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVP 2969
PHA03247 PHA03247
large tegument protein UL36; Provisional
143-626 1.72e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 1.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  143 PGNPYPKGRPSRINGIFPGTPLKKDGEECTNEGKGIAARilgpskPPPSTYNPHKPVPYPIPPCRPHATIAPSAynhagL 222
Cdd:PHA03247  2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVS------RPRRARRLGRAAQASSPPQRPRRRAARPT-----V 2692
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  223 VPLASVIAPGTPPPPPYTPNPVGTENEDLSNQSKPIQNQTFSTPASQLFSPHGSNPSTPAATPVPTASPVKAINHPSASA 302
Cdd:PHA03247  2693 GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP 2772
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  303 AATVSGmnlPNTVLPVFPGQVSSAVHTPQPSTPNPTVIRTPSLPTAPVTSIHSTTTTPVPsifsglvslPGPSATPTPGP 382
Cdd:PHA03247  2773 AAPAAG---PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP---------PPTSAQPTAPP 2840
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  383 TPRSTLGSSETFASTSAPftSLPFSISSTAASTSNPNSASLSSVFAGLPLP-LPPTSQGLSNPTPVIAGGSTPSVAGPLG 461
Cdd:PHA03247  2841 PPPGPPPPSLPLGGSVAP--GGDVRRRPPSRSPAAKPAAPARPPVRRLARPaVSRSTESFALPPDQPERPPQPQAPPPPQ 2918
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  462 VNSPLLSALKGFLTSSDTSLISSSALSSAVTSGlASLSSLTLQNSDSSASAPSKCYAPSAIPPPQRTSTPGLALFP---- 537
Cdd:PHA03247  2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG-AGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTpplt 2997
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  538 GLPSPVANSTSTPLTLPVQSPLATVASASTSAPVSC--GSSAPLLHGPHPGTSDLH-ISSTPAVTTLPVMIKTEPTSPTP 614
Cdd:PHA03247  2998 GHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDteDSDADSLFDSDSERSDLEaLDPLPPEPHDPFAHEPDPATPEA 3077
                          490
                   ....*....|...
gi 1622885920  615 SAFKGP-SHSGNP 626
Cdd:PHA03247  3078 GARESPsSQFGPP 3090
PHA03247 PHA03247
large tegument protein UL36; Provisional
510-937 2.33e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 2.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  510 ASAPSKCYAPSAIPPPQRTSTPGLALFPGLPSPVANSTSTPLTLPVQSplatvasASTSAPVSCGSSAPLLHGPHPGTSD 589
Cdd:PHA03247  2548 AGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQS-------ARPRAPVDDRGDPRGPAPPSPLPPD 2620
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  590 LHISSTPAVTTLPVmiKTEPTSPTPSAFKGPSHSGN-PSHGTLGLS---GTLGRAYTSTSVPISLSTCLNPALSGlssls 665
Cdd:PHA03247  2621 THAPDPPPPSPSPA--ANEPDPHPPPTVPPPERPRDdPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPTVG----- 2693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  666 thlngsnTLSSISLPPHGSSTPIAPVFTALPPFTSLTNNFPLTGNPSLNPSVSLPGSLIATSSTAATSTSLPHPSSTAAV 745
Cdd:PHA03247  2694 -------SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP 2766
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  746 LSGLSASAPVSAAPFPLNLSTAVPSLFSVTQGPLSSSNPSYPGFSVSNTPSVTPALPSFPGLQAPSTVAAVTPLPVAATV 825
Cdd:PHA03247  2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920  826 PSPAP----VLPG--FASAFSSNFNSALVAQAGLSSGLQAAGSSVFPGLLSLPGIPGFSQNPSQSslqelqhnAAAQSAL 899
Cdd:PHA03247  2847 PPSLPlggsVAPGgdVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQP--------QAPPPPQ 2918
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 1622885920  900 LQQVHSASALESYPAQPDGFPSYPSAPGTPFSLQPSLS 937
Cdd:PHA03247  2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
247-627 2.73e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 2.73e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 247 ENEDLSNQSKPIQNQTFSTPASQLFSPHGSNPSTPAATPVPTASPVkainhpSASAAATVSGMNLPNTVLPVFPGQVSSA 326
Cdd:pfam03154 153 DNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAG------PTPSAPSVPPQGSPATSQPPNQTQSTAA 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 327 VHTPQPSTPNPTVIRTPS-------LPTAPVTSIHSTTTTPVPSIFSGLVSLP-----GPSATPTPGPtPRSTLGSSETF 394
Cdd:pfam03154 227 PHTLIQQTPTLHPQRLPSphpplqpMTQPPPPSQVSPQPLPQPSLHGQMPPMPhslqtGPSHMQHPVP-PQPFPLTPQSS 305
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 395 ASTSAPFTSLPFSISSTAASTSNPNSASLSSVFAGLPLPLPPTSQGLSNPTPVIAGGSTPSVAGPLGVNSPLLSALKGFl 474
Cdd:pfam03154 306 QSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPF- 384
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 475 tssdtslisSSALSSAVTSGLASLSSLTLQNSDSSASAPSKCYAPSAIPPPQRTSTPGLALFPGLPSPVAN--STSTPLT 552
Cdd:pfam03154 385 ---------QMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAAShpPTSGLHQ 455
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 553 LPVQSPLAT---------VASASTSAPVSCGSSAPLLHGPH--PGTSDLHISSTPAVTTLPVMIKTEPTSPTPSAFKGPS 621
Cdd:pfam03154 456 VPSQSPFPQhpfvpggppPITPPSGPPTSTSSAMPGIQPPSsaSVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPP 535

                  ....*.
gi 1622885920 622 HSGNPS 627
Cdd:pfam03154 536 PPRSPS 541
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
505-830 8.38e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 8.38e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 505 NSDSSA------SAPSKCYAPSAIPPPQRTSTPGLALFPGLPSPVANSTSTPLTLPVQSPLAtVASASTSAPVSCGSSAP 578
Cdd:pfam03154 157 DSDSSAqqqilqTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPP-NQTQSTAAPHTLIQQTP 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 579 LLH-----GPHPGTSDLHISSTPAVTT---LPVMIKTEPTSPTPSAFK-GPSHSGNPShgtlglsGTLGRAYTSTSVPIS 649
Cdd:pfam03154 236 TLHpqrlpSPHPPLQPMTQPPPPSQVSpqpLPQPSLHGQMPPMPHSLQtGPSHMQHPV-------PPQPFPLTPQSSQSQ 308
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 650 LSTCLNPALSGLSSLSTHLNGSNTLSSISLPPHGSSTPIAPVFTA--LPPFTSLTNNFPLTGNPSLNPSVSLPGSLiats 727
Cdd:pfam03154 309 VPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhiKPPPTTPIPQLPNPQSHKHPPHLSGPSPF---- 384
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 728 staATSTSLPHPSSTAAvLSGLSASAPVSAAPFPLNL--------------------------STAVPSLFSVTQGPLSS 781
Cdd:pfam03154 385 ---QMNSNLPPPPALKP-LSSLSTHHPPSAHPPPLQLmpqsqqlppppaqppvltqsqslpppAASHPPTSGLHQVPSQS 460
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1622885920 782 SNPSYPgFSVSNTPSVTPAL-------PSFPGLQAPSTVAAVTPLPVAATVPSPAP 830
Cdd:pfam03154 461 PFPQHP-FVPGGPPPITPPSgpptstsSAMPGIQPPSSASVSSSGPVPAAVSCPLP 515
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
252-425 1.99e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 41.66  E-value: 1.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 252 SNQSKPIQNQTFSTPASQLFSPHGSNPSTPAATPVPTASPVKAINHPSASAAATVSGMNLPNTVLPVFPGQVSSAVHTPQ 331
Cdd:COG3469    36 AATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTT 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 332 PSTPNPTVIRTPSLPTAPVTSIHSTT-----TTPVPSIFSGLVSLPGPSATPTPGPTPRSTLGSSETFASTSAPFTSLPF 406
Cdd:COG3469   116 STGAGSVTSTTSSTAGSTTTSGASATssagsTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATT 195
                         170
                  ....*....|....*....
gi 1622885920 407 SISSTAASTSNPNSASLSS 425
Cdd:COG3469   196 PSATTTATTTGPPTPGLPK 214
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
272-400 4.36e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.06  E-value: 4.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 272 SPHGSNPSTPAATPVPTA-SPVKAINHPSASAAATVSGMNLPntvlpvfpgqvSSAVHTPQPSTPNPT-VIRTP----SL 345
Cdd:pfam05109 503 APDMTSPTSAVTTPTPNAtSPTPAVTTPTPNATSPTLGKTSP-----------TSAVTTPTPNATSPTpAVTTPtpnaTI 571
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1622885920 346 PTAPVTSIHSTTTTPVPSIFSGLVSLPGPSATptpgpTPRSTLGSSETFASTSAP 400
Cdd:pfam05109 572 PTLGKTSPTSAVTTPTPNATSPTVGETSPQAN-----TTNHTLGGTSSTPVVTSP 621
PRK11901 PRK11901
hypothetical protein; Reviewed
248-391 7.08e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 39.67  E-value: 7.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 248 NEDLSNQS-KPIQNQTFSTPASQLFSPHGSNPSTPAATP----VPTASPVKAINHPSASAA------------------- 303
Cdd:PRK11901   78 NIDLSGSSsLSSGNQSSPSAANNTSDGHDASGVKNTAPPqdisAPPISPTPTQAAPPQTPNgqqrielpgnisdalsqqq 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622885920 304 ----ATVSGMNLPNTVLPVFPGQVSSAVHTPQPSTPNPTVIRTPSLPTA-PVTSIHSTTTTPVPsifsglvslPGPSATP 378
Cdd:PRK11901  158 gqvnAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKkPAVNHHKTATVAVP---------PATSGKP 228
                         170
                  ....*....|...
gi 1622885920 379 TPGPTPRSTLGSS 391
Cdd:PRK11901  229 KSGAASARALSSA 241
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH