NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907114282|ref|XP_036015495|]
View 

protein FAM186A isoform X4 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1079-1483 1.64e-15

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 83.83  E-value: 1.64e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1079 QALGITPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQTQAL 1157
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALV 2716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1158 GITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQAQALGITP 1237
Cdd:PHA03247  2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1238 TPQPITLTPEQAQALGITPTPqpiTLTPEQTQALGITPTPQPITLTPEQaqalgiTPTPQPITLTPEQVQALGITPTPQP 1317
Cdd:PHA03247  2796 ESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGGDVRRRP 2866
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1318 itlTPEQAQALGITPTPQPITLTPEQAqalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLT 1397
Cdd:PHA03247  2867 ---PSRSPAAKPAAPARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1398 PelvqalgitPTPQPITLTPEQAQALGITPTPQPTTLSPEQaqalgitpTPQPITLTPEQAQALgitPTPQPTTLSPEQA 1477
Cdd:PHA03247  2940 Q---------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGR--------VAVPRFRVPQPAPSR---EAPASSTPPLTGH 2999

                   ....*.
gi 1907114282 1478 QALGIS 1483
Cdd:PHA03247  3000 SLSRVS 3005
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2281-2626 5.03e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 5.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2281 QPAQAFPSPFTLEKPATLATSTDRLSQRWKDSYPASIPLQALRPSPTQAPFTPTTSLgigslldsekpwmsptyrqtltd 2360
Cdd:PHA03247  2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL----------------------- 2698
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2361 rgqdvlAQPLAPETPPSLRQLLAPGAPPTPGPPLGPRHFFKPrvpPTSGEVPGLVSGGSAAheelPMSRTTPLQPPEWQG 2440
Cdd:PHA03247  2699 ------ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA---LPAAPAPPAVPAGPAT----PGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2441 PSRLIPEQGfmPAISSIPLHPFTAEALPTPGRPQRSSKAKPLKPKSARGLPNVTLGFETSQA-PFPIEKTQIPKTPDTSE 2519
Cdd:PHA03247  2766 PPAPAPPAA--PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPP 2843
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2520 QTQALQDALG--VQPFGIF-------QPYGTSSGIARSQSPLIDEKALSREKPGTPLPSLTTQLPQTPQISTSEKGQ--- 2587
Cdd:PHA03247  2844 GPPPPSLPLGgsVAPGGDVrrrppsrSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpqp 2923
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114282 2588 ------KPWLPPIDKPWTPTPVSSTREAKMIVSPTDQHPEDGYVV 2626
Cdd:PHA03247  2924 ppppqpQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV 2968
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1079-1483 1.64e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 83.83  E-value: 1.64e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1079 QALGITPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQTQAL 1157
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALV 2716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1158 GITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQAQALGITP 1237
Cdd:PHA03247  2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1238 TPQPITLTPEQAQALGITPTPqpiTLTPEQTQALGITPTPQPITLTPEQaqalgiTPTPQPITLTPEQVQALGITPTPQP 1317
Cdd:PHA03247  2796 ESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGGDVRRRP 2866
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1318 itlTPEQAQALGITPTPQPITLTPEQAqalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLT 1397
Cdd:PHA03247  2867 ---PSRSPAAKPAAPARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1398 PelvqalgitPTPQPITLTPEQAQALGITPTPQPTTLSPEQaqalgitpTPQPITLTPEQAQALgitPTPQPTTLSPEQA 1477
Cdd:PHA03247  2940 Q---------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGR--------VAVPRFRVPQPAPSR---EAPASSTPPLTGH 2999

                   ....*.
gi 1907114282 1478 QALGIS 1483
Cdd:PHA03247  3000 SLSRVS 3005
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1180-1479 9.13e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 64.79  E-value: 9.13e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1180 PTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPItltPEQAQALGITPTPQ 1259
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRL---PSPHPPLQPMTQPP 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1260 PITLTPEQtqalgitPTPQPITLTPeqaqalgITPTPQPITLTPEQVQALGitpTPQPITLTPEQAQALGitpTPQPITL 1339
Cdd:pfam03154  257 PPSQVSPQ-------PLPQPSLHGQ-------MPPMPHSLQTGPSHMQHPV---PPQPFPLTPQSSQSQV---PPGPSPA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1340 TPEQAQALGITPTPQPITLTPEQTQALGITPTPQPIT-LTPEQAQALGITPTPQPITLTPELVqalGITPTPQPITLTPE 1418
Cdd:pfam03154  317 APGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPP 393
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907114282 1419 QA-QALGITPTPQPTTLSPEQAQALgitptPQPITLTPEQAQALGITPTPqptTLSPEQAQA 1479
Cdd:pfam03154  394 PAlKPLSSLSTHHPPSAHPPPLQLM-----PQSQQLPPPPAQPPVLTQSQ---SLPPPAASH 447
PHA03247 PHA03247
large tegument protein UL36; Provisional
2281-2626 5.03e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 5.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2281 QPAQAFPSPFTLEKPATLATSTDRLSQRWKDSYPASIPLQALRPSPTQAPFTPTTSLgigslldsekpwmsptyrqtltd 2360
Cdd:PHA03247  2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL----------------------- 2698
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2361 rgqdvlAQPLAPETPPSLRQLLAPGAPPTPGPPLGPRHFFKPrvpPTSGEVPGLVSGGSAAheelPMSRTTPLQPPEWQG 2440
Cdd:PHA03247  2699 ------ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA---LPAAPAPPAVPAGPAT----PGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2441 PSRLIPEQGfmPAISSIPLHPFTAEALPTPGRPQRSSKAKPLKPKSARGLPNVTLGFETSQA-PFPIEKTQIPKTPDTSE 2519
Cdd:PHA03247  2766 PPAPAPPAA--PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPP 2843
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2520 QTQALQDALG--VQPFGIF-------QPYGTSSGIARSQSPLIDEKALSREKPGTPLPSLTTQLPQTPQISTSEKGQ--- 2587
Cdd:PHA03247  2844 GPPPPSLPLGgsVAPGGDVrrrppsrSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpqp 2923
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114282 2588 ------KPWLPPIDKPWTPTPVSSTREAKMIVSPTDQHPEDGYVV 2626
Cdd:PHA03247  2924 ppppqpQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV 2968
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
1115-1466 1.76e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 46.84  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1115 QTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQA 1194
Cdd:cd22540    122 TNQQYQISPQIQAAGQINNSGQIQIIPGTNQAIITPVQVLQQPQQAHKPVPIKPAPLQTSNTNSASLQVPGNVIKLQSGG 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1195 LGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPitlTPEQAQALGITPTPQPI-----TLTPEQTQ 1269
Cdd:cd22540    202 NVALTLPVNNLVGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVT---VAEQVETVLIETTADNIiqagnNLLIVQSP 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1270 ALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQVQALGITP-----TPQPITLTPEQAQALGITPTPQPITLTPEQA 1344
Cdd:cd22540    279 GTGQPAVLQQVQVLQPKQEQQVVQIPQQALRVVQAASATLPTVPqkplqNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEA 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1345 QALGITPTPQPITLTPEQTQALGiTPTPQPITLTPEQAQALGITPTPQPITLTPELVQALGITPTPQPITLTPEQAQALG 1424
Cdd:cd22540    359 PAATATPSSSTSTVQQQVTANNG-TGTSKPNYNVRKERTLPKIAPAGGIISLNAAQLAAAAQAIQTININGVQVQGVPVT 437
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114282 1425 ITPTPQPTTLSPEQAQALGIT---PTPQPITLTPEQAQALGITPT 1466
Cdd:cd22540    438 ITNAGGQQQLTVQTVSSNNLTisgLSPTQIQLQMEQALEIETQPG 482
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1079-1483 1.64e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 83.83  E-value: 1.64e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1079 QALGITPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQTQAL 1157
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALV 2716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1158 GITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQAQALGITP 1237
Cdd:PHA03247  2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1238 TPQPITLTPEQAQALGITPTPqpiTLTPEQTQALGITPTPQPITLTPEQaqalgiTPTPQPITLTPEQVQALGITPTPQP 1317
Cdd:PHA03247  2796 ESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGGDVRRRP 2866
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1318 itlTPEQAQALGITPTPQPITLTPEQAqalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLT 1397
Cdd:PHA03247  2867 ---PSRSPAAKPAAPARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1398 PelvqalgitPTPQPITLTPEQAQALGITPTPQPTTLSPEQaqalgitpTPQPITLTPEQAQALgitPTPQPTTLSPEQA 1477
Cdd:PHA03247  2940 Q---------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGR--------VAVPRFRVPQPAPSR---EAPASSTPPLTGH 2999

                   ....*.
gi 1907114282 1478 QALGIS 1483
Cdd:PHA03247  3000 SLSRVS 3005
PHA03247 PHA03247
large tegument protein UL36; Provisional
1085-1486 6.83e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 81.91  E-value: 6.83e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1085 PTPQPITLTPEQAqaLGITPTPQPTTLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQ 1164
Cdd:PHA03247  2569 PPPRPAPRPSEPA--VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1165 PITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQAQALGITPTPQPIT 1243
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALVSATPLPPGPA 2726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1244 LTPEQAQALGITPTPQPITLTPEQTQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQVQALGITPTPQPITLTPE 1323
Cdd:PHA03247  2727 AARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1324 QAQALGITPTPqpiTLTPEQAQALGITPTPQPITLTPEQTqalgitPTPQPITLTPEQAQALG----ITPTPQPITLTPE 1399
Cdd:PHA03247  2806 DPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPPP------PGPPPPSLPLGGSVAPGgdvrRRPPSRSPAAKPA 2876
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1400 L--------VQALGITPTPQPITLTPEQAQALGITPTPQPTTLSPEQAQAlgitPTPQPITLTPEQAQalgitPTPQPTT 1471
Cdd:PHA03247  2877 AparppvrrLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPP----PQPQPPPPPPPRPQ-----PPLAPTT 2947
                          410
                   ....*....|....*
gi 1907114282 1472 LSPEQAQALGISLIP 1486
Cdd:PHA03247  2948 DPAGAGEPSGAVPQP 2962
PHA03247 PHA03247
large tegument protein UL36; Provisional
1123-1500 3.62e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 3.62e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1123 PTPQPITLTPEQA-QALGITPTPQPITLTPeqtQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTP 1201
Cdd:PHA03247  2569 PPPRPAPRPSEPAvTSRARRPDAPPQSARP---RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1202 QPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQTQALGITPTPQPI 1280
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALVSATPLPPGP 2725
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1281 TLTPEQAQALGITPTPQPITLTPEQVQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTP 1360
Cdd:PHA03247  2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP 2804
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1361 EQTQALGITPTPqpiTLTPEQAQALGITPTPQPITLTPELvqalgiTPTPQPITLTPEQAQALG---------ITPTPQP 1431
Cdd:PHA03247  2805 ADPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGgdvrrrppsRSPAAKP 2875
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907114282 1432 TTLSPEQAQALG---ITPTPQPITLTPEQAQALGITPTPQPTTLSPEQAQALGISLIPKQQEISLSPEQAQA 1500
Cdd:PHA03247  2876 AAPARPPVRRLArpaVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1180-1479 9.13e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 64.79  E-value: 9.13e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1180 PTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPItltPEQAQALGITPTPQ 1259
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRL---PSPHPPLQPMTQPP 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1260 PITLTPEQtqalgitPTPQPITLTPeqaqalgITPTPQPITLTPEQVQALGitpTPQPITLTPEQAQALGitpTPQPITL 1339
Cdd:pfam03154  257 PPSQVSPQ-------PLPQPSLHGQ-------MPPMPHSLQTGPSHMQHPV---PPQPFPLTPQSSQSQV---PPGPSPA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1340 TPEQAQALGITPTPQPITLTPEQTQALGITPTPQPIT-LTPEQAQALGITPTPQPITLTPELVqalGITPTPQPITLTPE 1418
Cdd:pfam03154  317 APGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPP 393
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907114282 1419 QA-QALGITPTPQPTTLSPEQAQALgitptPQPITLTPEQAQALGITPTPqptTLSPEQAQA 1479
Cdd:pfam03154  394 PAlKPLSSLSTHHPPSAHPPPLQLM-----PQSQQLPPPPAQPPVLTQSQ---SLPPPAASH 447
PHA03247 PHA03247
large tegument protein UL36; Provisional
1078-1400 2.37e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 2.37e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1078 AQALGITPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPiTLTPEQAQALGITPTPQPITLTPEQTQAL 1157
Cdd:PHA03247  2732 SPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAA 2810
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1158 GITPTPqpiTLTPEQAQALGITPTPQPITLTPEQtqalgiTPTPQPITLTPEQAQALGITPTPQPitlTPEQAQALGITP 1237
Cdd:PHA03247  2811 VLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGGDVRRRP---PSRSPAAKPAAP 2878
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1238 TPQPITLTPEQAqalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQAlgitPTPQPITLTPEQVQalgitPTPQP 1317
Cdd:PHA03247  2879 ARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPP----PQPQPPPPPPPRPQ-----PPLAP 2945
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1318 ITLTPEQAQALGITPTPQPITLTPEQAQAL-GITPTPQPITLTPEQTQAlgiTPTPQPITLTPEQAQ--ALGITPTPQPI 1394
Cdd:PHA03247  2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPASSTP---PLTGHSLSRVSSWASslALHEETDPPPV 3022

                   ....*.
gi 1907114282 1395 TLTPEL 1400
Cdd:PHA03247  3023 SLKQTL 3028
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1218-1507 5.52e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 62.09  E-value: 5.52e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1218 PTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPItltPEQAQALGITPTPQ 1297
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRL---PSPHPPLQPMTQPP 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1298 PITLTPEQvqalgitPTPQPITLTPeqaqalgITPTPQPITLTPEQAQALGitpTPQPITLTPEQTQALGitpTPQPITL 1377
Cdd:pfam03154  257 PPSQVSPQ-------PLPQPSLHGQ-------MPPMPHSLQTGPSHMQHPV---PPQPFPLTPQSSQSQV---PPGPSPA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1378 TPEQAQALGITPTPQPITLTPELVQALGITPTPQPIT-LTPEQAQALGITPTPQPTTLSPEQAqalGITPTPQPITLTPE 1456
Cdd:pfam03154  317 APGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPP 393
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907114282 1457 QAqalgITPTPQPTTLSPEQAQALGISLIPKQQEISLSPeqAQALGLTLTP 1507
Cdd:pfam03154  394 PA----LKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPP--AQPPVLTQSQ 438
PHA03247 PHA03247
large tegument protein UL36; Provisional
1009-1358 6.06e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.26  E-value: 6.06e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1009 PTPQPITFTPEQTQALGITPTPqlitLTPEQAKALANTLTAEQVSLSPQQAEALGITPTPQPTTLTP---EQAQALGITP 1085
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLP----PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTagpPAPAPPAAPA 2776
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1086 TPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTP-QPITLTPEQAQALGITPTPQPITLTPEqtqalgitptPQ 1164
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAaLPPAASPAGPLPPPTSAQPTAPPPPPG----------PP 2846
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1165 PITLTPEQAQALGITPTPQPitlTPEQTQALGITPTPQPITLTPEQAqalgITPTPQPITLTPEQAQALGITPTPQPITL 1244
Cdd:PHA03247  2847 PPSLPLGGSVAPGGDVRRRP---PSRSPAAKPAAPARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQP 2919
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1245 TPEQAQAlgitPTPQPITLTPEQTQalgitPTPQPITLTPEQAQALGITPTPQPITLTPEQVQAL-GITPTPQPITLTPE 1323
Cdd:PHA03247  2920 QPQPPPP----PQPQPPPPPPPRPQ-----PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPA 2990
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 1907114282 1324 QAQAlgiTPTPQPITLTPEQAQ--ALGITPTPQPITL 1358
Cdd:PHA03247  2991 SSTP---PLTGHSLSRVSSWASslALHEETDPPPVSL 3024
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1116-1475 1.49e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 60.55  E-value: 1.49e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1116 TQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQAL 1195
Cdd:pfam03154  192 TQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSL 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1196 --GITPTPQPITLTPEQAQALGitpTPQPITLTPEQAQALGitpTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGI 1273
Cdd:pfam03154  272 hgQMPPMPHSLQTGPSHMQHPV---PPQPFPLTPQSSQSQV---PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPL 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1274 TPTPQPIT-LTPEQAQALGITPTPQPITLTPeqvQALGITPTPQPITLTPEQA-QALGITPTPQPITLTPEQAQalgITP 1351
Cdd:pfam03154  346 PPAPLSMPhIKPPPTTPIPQLPNPQSHKHPP---HLSGPSPFQMNSNLPPPPAlKPLSSLSTHHPPSAHPPPLQ---LMP 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1352 TPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPELVQAlgitPTPQPITLTPEQAQALGITPTPQP 1431
Cdd:pfam03154  420 QSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGG----PPPITPPSGPPTSTSSAMPGIQPP 495
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1907114282 1432 TTLSPEQAQALGITPT----PQPITLTPEQAQALGITPTPQPTTLSPE 1475
Cdd:pfam03154  496 SSASVSSSGPVPAAVScplpPVQIKEEALDEAEEPESPPPPPRSPSPE 543
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1114-1500 1.32e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 53.81  E-value: 1.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1114 EQTQALGITPTPQPITLTpEQAQALGITPTPQPITLTPEQTQAlgITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQ 1193
Cdd:pfam17823   56 EQ*NFCAATAAPAPVTLT-KGTSAAHLNSTEVTAEHTPHGTDL--SEPATREGAADGAASRALAAAASSSPSSAAQSLPA 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1194 ALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGI 1273
Cdd:pfam17823  133 AIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGI 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1274 TpTPQPITLTPEQAQALGITPTPQPITLTPEQVqalgiTPTPQPITLTPEQAQALGITPTPQPITLTPEQAQalgitpTP 1353
Cdd:pfam17823  213 S-TAATATGHPAAGTALAAVGNSSPAAGTVTAA-----VGTVTPAALATLAAAAGTVASAAGTINMGDPHAR------RL 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1354 QPITLTPEQTQALGITPTPQPitltpeQAQAlgitPTPQPITLTPELVQALGITPTPQPITLTPEQAQALGITPTPQPTT 1433
Cdd:pfam17823  281 SPAKHMPSDTMARNPAAPMGA------QAQG----PIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTT 350
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907114282 1434 lspEQAQALGITPTPQPITLTPEQAQALGITPTPQPTTLSPEQ-AQALGISLIPKQQEISLSPEQAQA 1500
Cdd:pfam17823  351 ---TKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQgAAGPGILLAPEQVATEATAGTASA 415
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1009-1393 1.14e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.31  E-value: 1.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1009 PTPQPITFTPEQTQALGITPTPQLITLTPEQAKALANTLTAEQVSLSPQQAEALGITPTPQPTTLTPEQAQALGITPTPQ 1088
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPS 259
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1089 PITLTPEQAQALGITPTPQPTTLTPEQTQALGITPtPQPITLTPEQAQALGitpTPQPITLTPEQTQALGITPTPQPITL 1168
Cdd:pfam03154  260 QVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVP-PQPFPLTPQSSQSQV---PPGPSPAAPGQSQQRIHTPPSQSQLQ 335
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1169 TPEQAQALGITPTPQPIT-LTPEQTQALGITPTPQPITLTPEQAqalGITPTPQPITLTPEQA-QALGITPTPQPITLTP 1246
Cdd:pfam03154  336 SQQPPREQPLPPAPLSMPhIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPPPAlKPLSSLSTHHPPSAHP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1247 EQAQalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPeqvqALGITPTPQPITLTPEQAQ 1326
Cdd:pfam03154  413 PPLQ---LMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHP----FVPGGPPPITPPSGPPTST 485
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907114282 1327 ALGITPTPQPITLTPEQAQALGITPTpqpITLTPEQTQALGITPTPQPITLTPEQAqalgiTPTPQP 1393
Cdd:pfam03154  486 SSAMPGIQPPSSASVSSSGPVPAAVS---CPLPPVQIKEEALDEAEEPESPPPPPR-----SPSPEP 544
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1232-1498 3.51e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 49.53  E-value: 3.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1232 ALGITPTPQPITLTPEQA----QALGITPTPQPITLTPeqtqALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQVQ 1307
Cdd:pfam05109  395 GLGTAPKTLIITRTATNAttttHKVIFSKAPESTTTSP----TLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVST 470
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1308 ALGITPTPQPITltpeqAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQ---- 1383
Cdd:pfam05109  471 ADVTSPTPAGTT-----SGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKtspt 545
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1384 ALGITPTPQPITLTPELVqalgiTPTPQPITLTPEQAQALGITPTPQPTTLSPEQAQ----------ALGITPTPQPITL 1453
Cdd:pfam05109  546 SAVTTPTPNATSPTPAVT-----TPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGEtspqanttnhTLGGTSSTPVVTS 620
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907114282 1454 TPEQAQALGITPTPQPTTLSPEQaqalgISLIPKQQEISLSPEQA 1498
Cdd:pfam05109  621 PPKNATSAVTTGQHNITSSSTSS-----MSLRPSSISETLSPSTS 660
PHA03247 PHA03247
large tegument protein UL36; Provisional
2281-2626 5.03e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 5.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2281 QPAQAFPSPFTLEKPATLATSTDRLSQRWKDSYPASIPLQALRPSPTQAPFTPTTSLgigslldsekpwmsptyrqtltd 2360
Cdd:PHA03247  2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL----------------------- 2698
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2361 rgqdvlAQPLAPETPPSLRQLLAPGAPPTPGPPLGPRHFFKPrvpPTSGEVPGLVSGGSAAheelPMSRTTPLQPPEWQG 2440
Cdd:PHA03247  2699 ------ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA---LPAAPAPPAVPAGPAT----PGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2441 PSRLIPEQGfmPAISSIPLHPFTAEALPTPGRPQRSSKAKPLKPKSARGLPNVTLGFETSQA-PFPIEKTQIPKTPDTSE 2519
Cdd:PHA03247  2766 PPAPAPPAA--PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPP 2843
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2520 QTQALQDALG--VQPFGIF-------QPYGTSSGIARSQSPLIDEKALSREKPGTPLPSLTTQLPQTPQISTSEKGQ--- 2587
Cdd:PHA03247  2844 GPPPPSLPLGgsVAPGGDVrrrppsrSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpqp 2923
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114282 2588 ------KPWLPPIDKPWTPTPVSSTREAKMIVSPTDQHPEDGYVV 2626
Cdd:PHA03247  2924 ppppqpQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV 2968
PHA03378 PHA03378
EBNA-3B; Provisional
1225-1488 6.32e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.91  E-value: 6.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1225 LTPEQAQALGITPTPQPITLTPEQAQalgiTPTPQPitlTPEQTQALGITPTPQPITLTPEQaqalgiTPTP-QPITLTP 1303
Cdd:PHA03378   569 LGPLQIQPLTSPTTSQLASSAPSYAQ----TPWPVP---HPSQTPEPPTTQSHIPETSAPRQ------WPMPlRPIPMRP 635
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1304 EQVQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITpTPQPITLTPEQTQALGITPTP-QPITLTPEQA 1382
Cdd:PHA03378   636 LRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGAN-TMLPIQWAPGTMQPPPRAPTPmRPPAAPPGRA 714
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1383 QALGITPTP-QPITLTPELVQALGITPTPQPitlTPEQAQALGITPTPQPTTLSPEQAQALGITPTPQPITLTPEQAQAL 1461
Cdd:PHA03378   715 QRPAAATGRaRPPAAAPGRARPPAAAPGRAR---PPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPR 791
                          250       260
                   ....*....|....*....|....*..
gi 1907114282 1462 GiTPTPQPttlsPEQAQALGISLIPKQ 1488
Cdd:PHA03378   792 G-APTPQP----PPQAGPTSMQLMPRA 813
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
1115-1466 1.76e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 46.84  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1115 QTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQA 1194
Cdd:cd22540    122 TNQQYQISPQIQAAGQINNSGQIQIIPGTNQAIITPVQVLQQPQQAHKPVPIKPAPLQTSNTNSASLQVPGNVIKLQSGG 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1195 LGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPitlTPEQAQALGITPTPQPI-----TLTPEQTQ 1269
Cdd:cd22540    202 NVALTLPVNNLVGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVT---VAEQVETVLIETTADNIiqagnNLLIVQSP 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1270 ALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQVQALGITP-----TPQPITLTPEQAQALGITPTPQPITLTPEQA 1344
Cdd:cd22540    279 GTGQPAVLQQVQVLQPKQEQQVVQIPQQALRVVQAASATLPTVPqkplqNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEA 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1345 QALGITPTPQPITLTPEQTQALGiTPTPQPITLTPEQAQALGITPTPQPITLTPELVQALGITPTPQPITLTPEQAQALG 1424
Cdd:cd22540    359 PAATATPSSSTSTVQQQVTANNG-TGTSKPNYNVRKERTLPKIAPAGGIISLNAAQLAAAAQAIQTININGVQVQGVPVT 437
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114282 1425 ITPTPQPTTLSPEQAQALGIT---PTPQPITLTPEQAQALGITPT 1466
Cdd:cd22540    438 ITNAGGQQQLTVQTVSSNNLTisgLSPTQIQLQMEQALEIETQPG 482
PHA03247 PHA03247
large tegument protein UL36; Provisional
1170-1512 8.63e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 8.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1170 PEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEqaqALGITPTPQPIT----LTPEQAQALGITPTPQPITLT 1245
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDE---PVGEPVHPRMLTwirgLEELASDDAGDPPPPLPPAAP 2560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1246 PeqAQALGITPTPQPITLTPE-QTQALGITPTPQPITLTPeqaQALGITPTPQPITLTPEQVQALGITPTPQPITLTPEQ 1324
Cdd:PHA03247  2561 P--AAPDRSVPPPRPAPRPSEpAVTSRARRPDAPPQSARP---RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA 2635
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1325 AQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPELVQA 1403
Cdd:PHA03247  2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHAL 2715
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1404 LGITPTPQPITLTPEQAQALGITPTPQPTTLSPeqaqALGITPTPQPITLTPeqAQALGITPTPQPTTLSPEQAQALGIS 1483
Cdd:PHA03247  2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGP----ATPGGPARPARPPTT--AGPPAPAPPAAPAAGPPRRLTRPAVA 2789
                          330       340
                   ....*....|....*....|....*....
gi 1907114282 1484 LIPKQQEISLSPEQAQALGLTLTPQQAQV 1512
Cdd:PHA03247  2790 SLSESRESLPSPWDPADPPAAVLAPAAAL 2818
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
2426-2623 8.80e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.07  E-value: 8.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2426 PMSRTTPLQPPEWQGPSRL-IPEQGFMPAISSIPLHPFTAEALPTPGRPQRSSKAKPLKP-------------------- 2484
Cdd:PTZ00449   600 PRSAQRPTRPKSPKLPELLdIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkspkppfdpkfkekfyddy 679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2485 --KSARGLPNVTLgFETSQAPFPIEKTQIPKTPDTSEQT-QALQDALGVQPFGIFQPYGTSSGIARSQS----PLIDEKA 2557
Cdd:PTZ00449   680 ldAAAKSKETKTT-VVLDESFESILKETLPETPGTPFTTpRPLPPKLPRDEEFPFEPIGDPDAEQPDDIefftPPEEERT 758
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907114282 2558 LSREKPG-TPLPSLTTQLPQTPQIsTSEKGQKPwlPPIDKPWTPTPVSSTREAKMIVSPTDQHPEDG 2623
Cdd:PTZ00449   759 FFHETPAdTPLPDILAEEFKEEDI-HAETGEPD--EAMKRPDSPSEHEDKPPGDHPSLPKKRHRLDG 822
PHA03247 PHA03247
large tegument protein UL36; Provisional
2163-2593 1.45e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 1.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2163 SPLTPKQPQAVEPAKAKL-----PPLTPSQAQPLQKQLAPELTQTLLFTITLQKAQHLGVTFTYEQTQAAAVTLTSEQVA 2237
Cdd:PHA03247  2678 SPPQRPRRRAARPTVGSLtsladPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP 2757
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2238 ALEDALTENLAwrweiSVTPGMAQEAPNITTTKQLQALGITARQPAQAFPSPFTLEKPATLATSTDRLSQRWKDSYPASI 2317
Cdd:PHA03247  2758 ARPPTTAGPPA-----PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPT 2832
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2318 PLQALRPSPTQAPFTPTTSLGiGSLLDSEKPWMSPTYRQTltdrgqdvLAQPLAPETPPSLRqllapgaPPTPGPPLGPR 2397
Cdd:PHA03247  2833 SAQPTAPPPPPGPPPPSLPLG-GSVAPGGDVRRRPPSRSP--------AAKPAAPARPPVRR-------LARPAVSRSTE 2896
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2398 HFFKPRVPPTSGEVPGLVSGGSAAHEELPMSRTTPLQPPEWQGPSRLIPEQGFMPAISSIPLHPFTAEALPTPGR----- 2472
Cdd:PHA03247  2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRvavpr 2976
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 2473 -------PQRSSKAKPLKPKSARGLPNVT-----LGFETSQAPFPIEKTQIPKTPDTSEQTQAlqdalgvqpfgifqpyg 2540
Cdd:PHA03247  2977 frvpqpaPSREAPASSTPPLTGHSLSRVSswassLALHEETDPPPVSLKQTLWPPDDTEDSDA----------------- 3039
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907114282 2541 tsSGIARSQSPLIDEKALSREKPGTPLPSLTTQLPQTPQISTSEKGQKPWLPP 2593
Cdd:PHA03247  3040 --DSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPP 3090
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
1119-1356 2.13e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 43.41  E-value: 2.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1119 LGITPTPQPitltPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGIT 1198
Cdd:PRK14948   357 LGLLPSAFI----SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANA 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1199 PTPQPITLTPEQAQA-LGITPTPQPITLTPEQAQALGITPTPQPITLTP-------------EQA--QALGitptpQPIT 1262
Cdd:PRK14948   433 ANAPPSLNLEELWQQiLAKLELPSTRMLLSQQAELVSLDSNRAVIAVSPnwlgmvqsrkpllEQAfaKVLG-----RSIK 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1263 LTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPeqvqALGITPTPQPITLTPEQAQALGITPTPQPITLTPE 1342
Cdd:PRK14948   508 LNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAP----PPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEE 583
                          250
                   ....*....|....
gi 1907114282 1343 QAQALGITPTPQPI 1356
Cdd:PRK14948   584 PTPSPTKDSSPEEI 597
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1023-1433 3.01e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 3.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1023 ALGITPTPQLITLTPEQAkalanTLTAEQVSLSpqqaealgitptpqPTTLTPEQAQALGITPTPQPITLTPEQAQALGI 1102
Cdd:pfam05109  395 GLGTAPKTLIITRTATNA-----TTTTHKVIFS--------------KAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVP 455
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1103 TPTPQPTTLTPEQTQALGITPTPQPITltpeqAQALGITPTPQPI-----TLTPEQTQALGITPTPQPITLTPEQAQAlg 1177
Cdd:pfam05109  456 TNLTAPASTGPTVSTADVTSPTPAGTT-----SGASPVTPSPSPRdngteSKAPDMTSPTSAVTTPTPNATSPTPAVT-- 528
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1178 iTPTPQPITLTPEQTQALGITPTPQPITLTPEQAQAlgiTPTPQPITLTPEQAQALGITPTPQPITLTPEQAQAlgitpT 1257
Cdd:pfam05109  529 -TPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVT---TPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-----S 599
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1258 PQPITltpeQTQALGITPTPQPITLTPEQAQAlgiTPTPQPITLTPEQVQALGITPTPQPITLTPEQAQAlgiTPTPQPI 1337
Cdd:pfam05109  600 PQANT----TNHTLGGTSSTPVVTSPPKNATS---AVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDN---STSHMPL 669
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1338 TLTPEQAQALGITptpqpiTLTPEQTQALGI-TPTPQPITLTPEQAQALGITPTpqpiTLTPELVQALGITPtpqPITLT 1416
Cdd:pfam05109  670 LTSAHPTGGENIT------QVTPASTSTHHVsTSSPAPRPGTTSQASGPGNSST----STKPGEVNVTKGTP---PKNAT 736
                          410
                   ....*....|....*..
gi 1907114282 1417 PEQAQALGITPTPQPTT 1433
Cdd:pfam05109  737 SPQAPSGQKTAVPTVTS 753
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
1114-1337 3.12e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 43.03  E-value: 3.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1114 EQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQ 1193
Cdd:PRK14948   367 EIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSLNLEELWQ 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1194 A-LGITPTPQPITLTPEQAQALGITPTPQPITLTP-------------EQA--QALGitptpQPITLTPEQAQALGITPT 1257
Cdd:PRK14948   447 QiLAKLELPSTRMLLSQQAELVSLDSNRAVIAVSPnwlgmvqsrkpllEQAfaKVLG-----RSIKLNLESQSGSASNTA 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1258 PQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQVQAlgitPTPQPITLTPEQAQALGITPTPQPI 1337
Cdd:PRK14948   522 KTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPA----DSSPPPPIPEEPTPSPTKDSSPEEI 597
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
1271-1497 7.35e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 41.87  E-value: 7.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1271 LGITPTPQPitltPEQAQALGITPTPQPITLTPEQVQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGIT 1350
Cdd:PRK14948   357 LGLLPSAFI----SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANA 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1351 PTPQPITLTPEQTQA-LGITPTPQPITLTPEQAQALGITPTPQPITLTPELV---------------QALGitptpQPIT 1414
Cdd:PRK14948   433 ANAPPSLNLEELWQQiLAKLELPSTRMLLSQQAELVSLDSNRAVIAVSPNWLgmvqsrkplleqafaKVLG-----RSIK 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1415 LTPEQAQALGITPTPQPTTLSPEQAQALGITPTPQPITLTPEQAQALGITPTPQPTTLSPEQAQALGISLIPKQQEISLS 1494
Cdd:PRK14948   508 LNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPS 587

                   ...
gi 1907114282 1495 PEQ 1497
Cdd:PRK14948   588 PTK 590
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
924-1344 8.44e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 8.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282  924 GLPLIPPKPITFTreqTQALGITPTHQPITLTSEqvqalGITPTHQPITLTPEQAQALALILTTEQVKTQRInlsPDQTQ 1003
Cdd:pfam03154  179 GAASPPSPPPPGT---TQAATAGPTPSAPSVPPQ-----GSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRL---PSPHP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1004 ALGITPTPQPITFTPEQTqalgiTPTPQLITLTPEQAKALANTLTAEQVSLSPQqaealgitptPQPTTLTPEQAQalgI 1083
Cdd:pfam03154  248 PLQPMTQPPPPSQVSPQP-----LPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQ----------PFPLTPQSSQSQ---V 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1084 TPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPiTLTPEQAQALGITPTPQPITLTPEQTqalGITPTP 1163
Cdd:pfam03154  310 PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMP-HIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQ 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1164 QPITLTPEQA-QALGITPTPQPITLTPEQTQALgitptPQPITLTPEQAQALGITPTPqpiTLTPEQAQAlgitPTPQPI 1242
Cdd:pfam03154  386 MNSNLPPPPAlKPLSSLSTHHPPSAHPPPLQLM-----PQSQQLPPPPAQPPVLTQSQ---SLPPPAASH----PPTSGL 453
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114282 1243 TLTPEQ---AQALGITPTPQPIT--LTPEQTQALGITPTPQPITLTPEQAQALGITPTpqpITLTPEQVQALGITPTPQP 1317
Cdd:pfam03154  454 HQVPSQspfPQHPFVPGGPPPITppSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVS---CPLPPVQIKEEALDEAEEP 530
                          410       420
                   ....*....|....*....|....*...
gi 1907114282 1318 ITLTPEQAqalgiTPTPQP-ITLTPEQA 1344
Cdd:pfam03154  531 ESPPPPPR-----SPSPEPtVVNTPSHA 553
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH