NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1622920918|ref|XP_014988451|]
View 

protein SON isoform X3 [Macaca mulatta]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
159-460 6.84e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 6.84e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  159 DSEPSAMALELPTRAFGLSETNESPAVVLEPpvvsveVPEPHILETLKPATKTAELSVASTSVISEQSEQSV-AVTPEPS 237
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALPAAP------APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAApAAGPPRR 2782
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  238 MTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAK-VLEPSETLVVSSETPTEV 316
Cdd:PHA03247  2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgPPPPSLPLGGSVAPGGDV 2862
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  317 YPEPSTSTTMDFPESSA-IEALRLPEQPVDVPSEiadsSMTRPQELPELPKTTalelqessvasamELPGPPATSMPELQ 395
Cdd:PHA03247  2863 RRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQP-------------QAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622920918  396 GPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
PHA03379 super family cl33730
EBNA-3A; Provisional
340-673 6.74e-07

EBNA-3A; Provisional


The actual alignment was detected with superfamily member PHA03379:

Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 55.06  E-value: 6.74e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVPELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1622920918  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
rne super family cl35953
ribonuclease E; Reviewed
1220-1381 8.90e-05

ribonuclease E; Reviewed


The actual alignment was detected with superfamily member PRK10811:

Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.11  E-value: 8.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1220 ISEPSAVPTDYSMSASDPSVLVSEATVTVPEPPPEpessiTSTPVESAVVAEEHEVVPERPVtcmVSETPTVSAEPTVVA 1299
Cdd:PRK10811   843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1300 SEPPVLSETA-ETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811   915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994

                   ...
gi 1622920918 1379 STV 1381
Cdd:PRK10811   995 TAV 997
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
159-460 6.84e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 6.84e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  159 DSEPSAMALELPTRAFGLSETNESPAVVLEPpvvsveVPEPHILETLKPATKTAELSVASTSVISEQSEQSV-AVTPEPS 237
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALPAAP------APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAApAAGPPRR 2782
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  238 MTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAK-VLEPSETLVVSSETPTEV 316
Cdd:PHA03247  2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgPPPPSLPLGGSVAPGGDV 2862
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  317 YPEPSTSTTMDFPESSA-IEALRLPEQPVDVPSEiadsSMTRPQELPELPKTTalelqessvasamELPGPPATSMPELQ 395
Cdd:PHA03247  2863 RRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQP-------------QAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622920918  396 GPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
188-493 2.45e-07

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 55.93  E-value: 2.45e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  188 EPPVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPE--PSMTKILDSFaaapvptTTVVLKSSEPVVT 265
Cdd:NF033839   158 KPETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAvaTYMSKILDDI-------QKHHLQKEKHRQI 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  266 MSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETLV------VSSETPTE-VYPEPSTSTT--MDFPESSAIEA 336
Cdd:NF033839   231 VALIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVtkfkkgLTQDTPKEpGNKKPSAPKPgmQPSPQPEKKEV 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  337 LRLPEQPVDVPSEIADSSMTRPQELPELPKTTA---LELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPV 413
Cdd:NF033839   311 KPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVkpqLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQ 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  414 PELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQE--LPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVEL 491
Cdd:NF033839   391 PEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEK 470

                   ..
gi 1622920918  492 PE 493
Cdd:NF033839   471 PK 472
PHA03379 PHA03379
EBNA-3A; Provisional
340-673 6.74e-07

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 55.06  E-value: 6.74e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVPELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1622920918  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
82-579 1.50e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 50.54  E-value: 1.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918   82 DLKEASRKSRCVSV--QTDPTDEIPTKKSKKHKKHKNKKKKKKKEKEKKYKRQPEESE----AKTKSHHDGNIDLESDSF 155
Cdd:pfam03154   38 DLRSSGRNSPSAAStsSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPErataKKSKTQEISRPNSPSEGE 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  156 LKfDSEPSAMALELPTRAFGLSETNESpavvleppvVSVEVPEPHILETlkPATKTAELSVASTSVISEQSEQSVAVTPE 235
Cdd:pfam03154  118 GE-SSDGRSVNDEGSSDPKDIDQDNRS---------TSPSIPSPQDNES--DSDSSAQQQILQTQPPVLQAQSGAASPPS 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  236 PSMTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSV--LKSVEST-SPEPSKIMLVEPPVAKVLEPSEtlvvSSET 312
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTpTLHPQRLPSPHPPLQPMTQPPP----PSQV 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  313 PTEVYPEPSTSTTMDfPESSAIEA------LRLPEQPVDVPSEIADS------SMTRPQELPELPKTTALELQESSVASA 380
Cdd:pfam03154  262 SPQPLPQPSLHGQMP-PMPHSLQTgpshmqHPVPPQPFPLTPQSSQSqvppgpSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  381 MELPGPPA-TSMPELQGPPVTPVPELPGPSATPVP---ELPGPLSTPvPELPGPPA----TAVPELPGPSVTPVP-QL-- 449
Cdd:pfam03154  341 REQPLPPApLSMPHIKPPPTTPIPQLPNPQSHKHPphlSGPSPFQMN-SNLPPPPAlkplSSLSTHHPPSAHPPPlQLmp 419
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  450 -SQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPvTTTELEQPVGMTTVEHPGHP 528
Cdd:pfam03154  420 qSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSA 498
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622920918  529 EVTTATGLLGQPEATmvleLPGQPVATTALELPGQPSVTGVPELPGLPSAT 579
Cdd:pfam03154  499 SVSSSGPVPAAVSCP----LPPVQIKEEALDEAEEPESPPPPPRSPSPEPT 545
rne PRK10811
ribonuclease E; Reviewed
1220-1381 8.90e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.11  E-value: 8.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1220 ISEPSAVPTDYSMSASDPSVLVSEATVTVPEPPPEpessiTSTPVESAVVAEEHEVVPERPVtcmVSETPTVSAEPTVVA 1299
Cdd:PRK10811   843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1300 SEPPVLSETA-ETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811   915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994

                   ...
gi 1622920918 1379 STV 1381
Cdd:PRK10811   995 TAV 997
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
249-430 1.68e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.60  E-value: 1.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  249 PVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPE-PSKIMLVEPPVAKVLE-PSETLVVSSETPT-EVYPEPSTSTT 325
Cdd:NF033839   306 EKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEvKPQLETPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPETPKP 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  326 MDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTalelqeSSVASAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:NF033839   386 EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPK------PEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 459
                          170       180
                   ....*....|....*....|....*
gi 1622920918  406 PGPSATPVPELPGPLSTPVPELPGP 430
Cdd:NF033839   460 PKPEVKPQPEKPKPEVKPQPEKPKP 484
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
312-524 3.62e-03

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 42.36  E-value: 3.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  312 TPTEVYPEPSTSTTMdfPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPkttalelqESSVASAMELPG--PPAT 389
Cdd:TIGR01645  283 TPPDALLQPATVSAI--PAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATP--------SSSLPTDIGNKAvvSSAK 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  390 SMPELQG--PPVTPVPELPGPSATPVPELPGPLstPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPA------PSM 461
Cdd:TIGR01645  353 KEAEEVPplPQAAPAVVKPGPMEIPTPVPPPGL--AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVtfgaldDTL 430
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622920918  462 GLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTElEQPVGMTTVEH 524
Cdd:TIGR01645  431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMN-SEDASLASQEG 492
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
378-711 4.37e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 4.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  378 ASAMELPGPPATSMPELQGPPVTPVPELPGPSATPV-----------PELPGPLS--TPVPELPGPPATAVPELP----- 439
Cdd:pfam03154  193 QAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHtliqqtptlhpQRLPSPHPplQPMTQPPPPSQVSPQPLPqpslh 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  440 ------------GPSVTPVPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELP----EQPAVTVAMEL 503
Cdd:pfam03154  273 gqmppmphslqtGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQqpprEQPLPPAPLSM 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  504 TE-QPVTTTELEQPVGMTTVEHPGHpevttatgLLGQPEATMVLELPGQPVATTALELPG-QPSVTGVPELPGLPSATRA 581
Cdd:pfam03154  353 PHiKPPPTTPIPQLPNPQSHKHPPH--------LSGPSPFQMNSNLPPPPALKPLSSLSThHPPSAHPPPLQLMPQSQQL 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  582 LELSGQ-PVATGALELPGPLMAAgalefsgqsgaagalellgqPLATGVLELPGQPGAPELPGQPVATVAleISVQSVVT 660
Cdd:pfam03154  425 PPPPAQpPVLTQSQSLPPPAASH--------------------PPTSGLHQVPSQSPFPQHPFVPGGPPP--ITPPSGPP 482
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1622920918  661 TSELSTMTVSQS-LEVPSTTALESYNTVAQELPTTLVGETSVTVGVDPLMAP 711
Cdd:pfam03154  483 TSTSSAMPGIQPpSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPP 534
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
159-460 6.84e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 6.84e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  159 DSEPSAMALELPTRAFGLSETNESPAVVLEPpvvsveVPEPHILETLKPATKTAELSVASTSVISEQSEQSV-AVTPEPS 237
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALPAAP------APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAApAAGPPRR 2782
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  238 MTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAK-VLEPSETLVVSSETPTEV 316
Cdd:PHA03247  2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgPPPPSLPLGGSVAPGGDV 2862
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  317 YPEPSTSTTMDFPESSA-IEALRLPEQPVDVPSEiadsSMTRPQELPELPKTTalelqessvasamELPGPPATSMPELQ 395
Cdd:PHA03247  2863 RRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQP-------------QAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622920918  396 GPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
188-493 2.45e-07

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 55.93  E-value: 2.45e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  188 EPPVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPE--PSMTKILDSFaaapvptTTVVLKSSEPVVT 265
Cdd:NF033839   158 KPETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAvaTYMSKILDDI-------QKHHLQKEKHRQI 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  266 MSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETLV------VSSETPTE-VYPEPSTSTT--MDFPESSAIEA 336
Cdd:NF033839   231 VALIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVtkfkkgLTQDTPKEpGNKKPSAPKPgmQPSPQPEKKEV 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  337 LRLPEQPVDVPSEIADSSMTRPQELPELPKTTA---LELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPV 413
Cdd:NF033839   311 KPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVkpqLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQ 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  414 PELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQE--LPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVEL 491
Cdd:NF033839   391 PEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEK 470

                   ..
gi 1622920918  492 PE 493
Cdd:NF033839   471 PK 472
PHA03379 PHA03379
EBNA-3A; Provisional
340-673 6.74e-07

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 55.06  E-value: 6.74e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVPELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1622920918  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
PHA03247 PHA03247
large tegument protein UL36; Provisional
318-642 2.96e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 2.96e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  318 PEPSTSTTMDFPESSAIEALRLPEQPVDVPS---------EIADSSMTRPQELPELPKTTALELQESSVASAMELPGPPA 388
Cdd:PHA03247  2627 PPPSPSPAANEPDPHPPPTVPPPERPRDDPApgrvsrprrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPP 2706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  389 TsmpelqgPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATavpelPGPSVTPVPQLSQELPGLPA---------- 458
Cdd:PHA03247  2707 T-------PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAV-----PAGPATPGGPARPARPPTTAgppapappaa 2774
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  459 PSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQ-PAVTVAMELTEQPVTtteleqPVGMTTVEHPGHPevTTATGLL 537
Cdd:PHA03247  2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAG------PLPPPTSAQPTAP--PPPPGPP 2846
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  538 GQPEATMVLELPGQPVATTAlelPGQPSVTgVPELPGLPSATR----ALELSGQPVATGALELPGPLMAAGALEFSGQSg 613
Cdd:PHA03247  2847 PPSLPLGGSVAPGGDVRRRP---PSRSPAA-KPAAPARPPVRRlarpAVSRSTESFALPPDQPERPPQPQAPPPPQPQP- 2921
                          330       340
                   ....*....|....*....|....*....
gi 1622920918  614 aagalELLGQPLATGVLELPGQPGAPELP 642
Cdd:PHA03247  2922 -----QPPPPPQPQPPPPPPPRPQPPLAP 2945
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
82-579 1.50e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 50.54  E-value: 1.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918   82 DLKEASRKSRCVSV--QTDPTDEIPTKKSKKHKKHKNKKKKKKKEKEKKYKRQPEESE----AKTKSHHDGNIDLESDSF 155
Cdd:pfam03154   38 DLRSSGRNSPSAAStsSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPErataKKSKTQEISRPNSPSEGE 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  156 LKfDSEPSAMALELPTRAFGLSETNESpavvleppvVSVEVPEPHILETlkPATKTAELSVASTSVISEQSEQSVAVTPE 235
Cdd:pfam03154  118 GE-SSDGRSVNDEGSSDPKDIDQDNRS---------TSPSIPSPQDNES--DSDSSAQQQILQTQPPVLQAQSGAASPPS 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  236 PSMTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSV--LKSVEST-SPEPSKIMLVEPPVAKVLEPSEtlvvSSET 312
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTpTLHPQRLPSPHPPLQPMTQPPP----PSQV 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  313 PTEVYPEPSTSTTMDfPESSAIEA------LRLPEQPVDVPSEIADS------SMTRPQELPELPKTTALELQESSVASA 380
Cdd:pfam03154  262 SPQPLPQPSLHGQMP-PMPHSLQTgpshmqHPVPPQPFPLTPQSSQSqvppgpSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  381 MELPGPPA-TSMPELQGPPVTPVPELPGPSATPVP---ELPGPLSTPvPELPGPPA----TAVPELPGPSVTPVP-QL-- 449
Cdd:pfam03154  341 REQPLPPApLSMPHIKPPPTTPIPQLPNPQSHKHPphlSGPSPFQMN-SNLPPPPAlkplSSLSTHHPPSAHPPPlQLmp 419
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  450 -SQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPvTTTELEQPVGMTTVEHPGHP 528
Cdd:pfam03154  420 qSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSA 498
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622920918  529 EVTTATGLLGQPEATmvleLPGQPVATTALELPGQPSVTGVPELPGLPSAT 579
Cdd:pfam03154  499 SVSSSGPVPAAVSCP----LPPVQIKEEALDEAEEPESPPPPPRSPSPEPT 545
rne PRK10811
ribonuclease E; Reviewed
1220-1381 8.90e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.11  E-value: 8.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1220 ISEPSAVPTDYSMSASDPSVLVSEATVTVPEPPPEpessiTSTPVESAVVAEEHEVVPERPVtcmVSETPTVSAEPTVVA 1299
Cdd:PRK10811   843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1300 SEPPVLSETA-ETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811   915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994

                   ...
gi 1622920918 1379 STV 1381
Cdd:PRK10811   995 TAV 997
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
387-558 1.45e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.17  E-value: 1.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  387 PATSMPELQGPPVTPVPeLPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMGLEPP 466
Cdd:PRK07994   361 PAAPLPEPEVPPQSAAP-AASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAK 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  467 QEVPEPPVMAQELPGL-----PLVTAAVELPEQPAVTVAMELTEQPVTTTELEQPVGMT----TVEHPGHPEVTTATGLL 537
Cdd:PRK07994   440 KSEPAAASRARPVNSAlerlaSVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKalkkALEHEKTPELAAKLAAE 519
                          170       180
                   ....*....|....*....|....*.
gi 1622920918  538 GQPE---ATMV--LELPGqPVATTAL 558
Cdd:PRK07994   520 AIERdpwAALVsqLGLPG-LVEQLAL 544
PHA03247 PHA03247
large tegument protein UL36; Provisional
233-525 1.53e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 1.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  233 TPEPSMTKILdsfAAAPVPTTTVVLKSSEPVVTMSveyqmksvlksveSTSPEPSKIMLVEPPVAKVLEPSETLVVSSET 312
Cdd:PHA03247  2707 TPEPAPHALV---SATPLPPGPAAARQASPALPAA-------------PAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  313 PTEV---YPEPSTSTTMDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQEL----PELPKTTALElqessvASAMELPG 385
Cdd:PHA03247  2771 PPAApaaGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspagPLPPPTSAQP------TAPPPPPG 2844
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  386 PPATSMP-----------ELQGPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELP 454
Cdd:PHA03247  2845 PPPPSLPlggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622920918  455 GLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPA-----VTVAMELTEQPVTTTELEQPVGMTTVEHP 525
Cdd:PHA03247  2925 PPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAlvpgrVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
rne PRK10811
ribonuclease E; Reviewed
1196-1509 1.73e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 46.96  E-value: 1.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1196 PAEVPSLPSEESVSQPEPPVSQSEISEP--------------SAVPTDYSMSASDPSVLVSEATVTVPEPPPEPESSITS 1261
Cdd:PRK10811   691 QQEAKALNVEEQSVQETEQEERVQQVQPrrkqrqlnqkvrieQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVKVPLP 770
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1262 TPVESAVVAEEH--------EVVPER----PVTCMVS----------ETPTVSAEPTVVA-------------SEPPVLS 1306
Cdd:PRK10811   771 VVAQTAPEQDEEnnaenrdnNGMPRRsrrsPRHLRVSgqrrrryrdeRYPTQSPMPLTVAcaspemasgkvwiRYPVVRP 850
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1307 ETAETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLESSTVTvlep 1386
Cdd:PRK10811   851 QDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTE---- 926
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1387 svvtvpePPVVAEPDYITIPVPVVSVLEPSVPVLEPAVSVLQPS----MIVSEPSVSVQESTVTVSEPAVTVSEQTQVIP 1462
Cdd:PRK10811   927 -------QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAetaeVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEP 999
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 1622920918 1463 TEVAIESTPMILESSIMSSHVMKGinlPsgdqnlAPEIgMPEIPLHS 1509
Cdd:PRK10811  1000 EVAPAQVPEATVEHNHATAPMTRA---P------APEY-VPEAPRHS 1036
PHA03378 PHA03378
EBNA-3B; Provisional
176-480 2.98e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.21  E-value: 2.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  176 LSETNESPAVVLEPPVVSVEVPEPhILETLKPATKTAELSVASTSViseqseqsvavtpEPSMTKILDSFAAAPVPTTTV 255
Cdd:PHA03378   481 LPHPQVTPVILHQPPAQGVQAHGS-MLDLLEKDDEDMEQRVMATLL-------------PPSPPQPRAGRRAPCVYTEDL 546
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  256 VLKSSEPVVTMSVEYQMKSV-----LKSVESTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDFPE 330
Cdd:PHA03378   547 DIESDEPASTEPVHDQLLPApglgpLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPM 626
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  331 SSAIEALRL---------------PEQPVDVPSEIADSSMTRPQELPELPKTT--ALELQESSVASAMELPGPPATSMPE 393
Cdd:PHA03378   627 PLRPIPMRPlrmqpitfnvlvfptPHQPPQVEITPYKPTWTQIGHIPYQPSPTgaNTMLPIQWAPGTMQPPPRAPTPMRP 706
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  394 LQGPPV-------TPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSqelPGLPAPsmglepp 466
Cdd:PHA03378   707 PAAPPGraqrpaaATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA---PGAPTP------- 776
                          330
                   ....*....|....
gi 1622920918  467 QEVPEPPVMAQELP 480
Cdd:PHA03378   777 QPPPQAPPAPQQRP 790
rne PRK10811
ribonuclease E; Reviewed
1180-1345 5.51e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 45.42  E-value: 5.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1180 PALPTEQSALTAENTWPAEVpslpSEESVSQPEPPVSQSEISEPSAVPtdysMSASDPSVL---VSEATVTVPEpppepe 1256
Cdd:PRK10811   868 PVVAEVPVAAAVEPVVSAPV----VEAVAEVVEEPVVVAEPQPEEVVV----VETTHPEVIaapVTEQPQVITE------ 933
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918 1257 ssiTSTPVESAVVAEEHEVVPERPVTcmVSETPTVSAEPTVVAsEPPVLSETAETFESMRAsgyvASEVSTSLLEPAVTT 1336
Cdd:PRK10811   934 ---SDVAVAQEVAEHAEPVVEPQDET--ADIEEAAETAEVVVA-EPEVVAQPAAPVVAEVA----AEVETVTAVEPEVAP 1003

                   ....*....
gi 1622920918 1337 PVLAESILE 1345
Cdd:PRK10811  1004 AQVPEATVE 1012
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
369-452 5.66e-04

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 41.60  E-value: 5.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  369 ALELQESSVASAMELPGPPATSMPelqgPPVTPVPELPGPSATPVPELPGPlsTPVPELPGPPATAVPELPGPSVTPVPQ 448
Cdd:pfam12526   31 PPESAHPDPPPPVGDPRPPVVDTP----PPVSAVWVLPPPSEPAAPEPDLV--PPVTGPAGPPSPLAPPAPAQKPPLPPP 104

                   ....
gi 1622920918  449 LSQE 452
Cdd:pfam12526  105 RPQR 108
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
319-677 5.89e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 5.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  319 EPSTSTTMDFPESSAIEALRLPeQPVDVPSEIADSSMTRPQEL---PELPKTTALELQESSVASAMELP-GPPATSMPEL 394
Cdd:PHA03307     1 SDNAPDLYDLIEAAAEGGEFFP-RPPATPGDAADDLLSGSQGQlvsDSAELAAVTVVAGAAACDRFEPPtGPPPGPGTEA 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  395 QGPPVTPVPELPGPSATPVPELPGPLSTP--VPELPGPPATAVPELPGPSvtPVPQLSQELPGLPAPSMGLEPPQEVPEP 472
Cdd:PHA03307    80 PANESRSTPTWSLSTLAPASPAREGSPTPpgPSSPDPPPPTPPPASPPPS--PAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  473 PVMAqelpglplVTAAVELPEQPAVTVAM-ELTEQPVTTTELEQPVGMTTVEHPGHPEVttatglLGQPEATMVLELPGQ 551
Cdd:PHA03307   158 SPAA--------VASDAASSRQAALPLSSpEETARAPSSPPAEPPPSTPPAAASPRPPR------RSSPISASASSPAPA 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  552 PVATTALELPGQPSVTGVPELPGLPSATRALELSGQPvatGALELPGPLMAA-----GALEFSGQSGAAGALELLGQPla 626
Cdd:PHA03307   224 PGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRP---APITLPTRIWEAsgwngPSSRPGPASSSSSPRERSPSP-- 298
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622920918  627 tgvleLPGQPGAPELPGQPVAtVALEISVQSVVTTSELSTMTVSQSLEVPS 677
Cdd:PHA03307   299 -----SPSSPGSGPAPSSPRA-SSSSSSSRESSSSSTSSSSESSRGAAVSP 343
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
379-445 1.27e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 43.90  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  379 SAMELPG--------PPATSMPELQGP-----PVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTP 445
Cdd:PRK14959   384 SAAEGPAsggaatipTPGTQGPQGTAPaagmtPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVP 463
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
249-430 1.68e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.60  E-value: 1.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  249 PVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPE-PSKIMLVEPPVAKVLE-PSETLVVSSETPT-EVYPEPSTSTT 325
Cdd:NF033839   306 EKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEvKPQLETPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPETPKP 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  326 MDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTalelqeSSVASAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:NF033839   386 EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPK------PEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 459
                          170       180
                   ....*....|....*....|....*
gi 1622920918  406 PGPSATPVPELPGPLSTPVPELPGP 430
Cdd:NF033839   460 PKPEVKPQPEKPKPEVKPQPEKPKP 484
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
332-520 2.07e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 2.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  332 SAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTALELQESSVASAMELPGP---PATSMPELQGPPVTPVPElPGP 408
Cdd:PRK12323   376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPealAAARQASARGPGGAPAPA-PAP 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  409 SATPVPELPGPLSTPVPelPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAA 488
Cdd:PRK12323   455 AAAPAAAARPAAAGPRP--VAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATA 532
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1622920918  489 VELPEQPAVTVAMELTEQPVTTTELEQPVGMT 520
Cdd:PRK12323   533 DPDDAFETLAPAPAAAPAPRAAAATEPVVAPR 564
PHA03377 PHA03377
EBNA-3C; Provisional
159-445 3.18e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 42.73  E-value: 3.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  159 DSEPSAMALELPTRAFGLSETNESPAVVleppVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPEPSM 238
Cdd:PHA03377   380 DVELESSDDELPYIDPNMEPVQQRPVMF----VSRVPWRKPRTLPWPTPKTHPVKRTLVKTSGRSDEAEQAQSTPERPGP 455
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  239 TKildsfaAAPVPTttvvlkssEPVVTMSVEYQMKSVLKSVESTSPEPskimlVEPPVAKVLEPSETLVVSSETPTEVyp 318
Cdd:PHA03377   456 SD------QPSVPV--------EPAHLTPVEHTTVILHQPPQSPPTVA-----IKPAPPPSRRRRGACVVYDDDIIEV-- 514
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  319 epststtMDFPESSAIEALRLPEQPVDVPSEIADSSmTRPQELPELPKTTALElqessvasamelPGPPATSmPELQGPP 398
Cdd:PHA03377   515 -------IDVETTEEEESVTQPAKPHRKVQDGFQRS-GRRQKRATPPKVSPSD------------RGPPKAS-PPVMAPP 573
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1622920918  399 VTPVPELPGPSATPvPELPGPLSTP---VPELPGPPATAVPELPGPSVTP 445
Cdd:PHA03377   574 STGPRVMATPSTGP-RDMAPPSTGPrqqAKCKDGPPASGPHEKQPPSSAP 622
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
312-524 3.62e-03

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 42.36  E-value: 3.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  312 TPTEVYPEPSTSTTMdfPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPkttalelqESSVASAMELPG--PPAT 389
Cdd:TIGR01645  283 TPPDALLQPATVSAI--PAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATP--------SSSLPTDIGNKAvvSSAK 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  390 SMPELQG--PPVTPVPELPGPSATPVPELPGPLstPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPA------PSM 461
Cdd:TIGR01645  353 KEAEEVPplPQAAPAVVKPGPMEIPTPVPPPGL--AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVtfgaldDTL 430
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622920918  462 GLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTElEQPVGMTTVEH 524
Cdd:TIGR01645  431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMN-SEDASLASQEG 492
PHA03379 PHA03379
EBNA-3A; Provisional
161-462 4.10e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 42.35  E-value: 4.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  161 EPSAMALELPTRAFGLSETNESPAVVLEPPVVSVEVPEP--HILETLKPAtktaeLSVASTSVISEQSEQSVAVTPEPSM 238
Cdd:PHA03379   463 APCPVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPagPIVRPWEAS-----LSQVPGVAFAPVMPQPMPVEPVPVP 537
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  239 TKILDSfAAAPVPTTTVVLKSSEPvvTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKV------------LEPSETL 306
Cdd:PHA03379   538 TVALER-PVCPAPPLIAMQGPGET--SGIVRVRERWRPAPWTPNPPRSPSQMSVRDRLARLraeaqpyqasveVQPPQLT 614
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  307 VVSSETPTEVYPEPSTSTTMDFPESSAIEALRLPEQPVDVPSEIaDSSMTRPQE--------------LPELPKTTALEL 372
Cdd:PHA03379   615 QVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYF-DLPLQQPISqgaplaplrasmgpVPPVPATQPQYF 693
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  373 Q-------ESSVASAMELPGPPATS--MPELQGPPVTPVPELPGPSATPVPELPGPLSTPVPE-LPGPPATAVPELPGPS 442
Cdd:PHA03379   694 DipltepiNQGASAAHFLPQQPMEGplVPERWMFQGATLSQSVRPGVAQSQYFDLPLTQPINHgAPAAHFLHQPPMEGPW 773
                          330       340
                   ....*....|....*....|
gi 1622920918  443 VtPVPQLSQELPGLPAPSMG 462
Cdd:PHA03379   774 V-PEQWMFQGAPPSQGTDVV 792
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
378-711 4.37e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 4.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  378 ASAMELPGPPATSMPELQGPPVTPVPELPGPSATPV-----------PELPGPLS--TPVPELPGPPATAVPELP----- 439
Cdd:pfam03154  193 QAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHtliqqtptlhpQRLPSPHPplQPMTQPPPPSQVSPQPLPqpslh 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  440 ------------GPSVTPVPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELP----EQPAVTVAMEL 503
Cdd:pfam03154  273 gqmppmphslqtGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQqpprEQPLPPAPLSM 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  504 TE-QPVTTTELEQPVGMTTVEHPGHpevttatgLLGQPEATMVLELPGQPVATTALELPG-QPSVTGVPELPGLPSATRA 581
Cdd:pfam03154  353 PHiKPPPTTPIPQLPNPQSHKHPPH--------LSGPSPFQMNSNLPPPPALKPLSSLSThHPPSAHPPPLQLMPQSQQL 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  582 LELSGQ-PVATGALELPGPLMAAgalefsgqsgaagalellgqPLATGVLELPGQPGAPELPGQPVATVAleISVQSVVT 660
Cdd:pfam03154  425 PPPPAQpPVLTQSQSLPPPAASH--------------------PPTSGLHQVPSQSPFPQHPFVPGGPPP--ITPPSGPP 482
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1622920918  661 TSELSTMTVSQS-LEVPSTTALESYNTVAQELPTTLVGETSVTVGVDPLMAP 711
Cdd:pfam03154  483 TSTSSAMPGIQPpSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPP 534
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
340-460 5.65e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 5.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  340 PEQPVDVPSEIADSSMTRPQELPELPKTTALELQESSVASAMELPGPPATSMPE-LQGPPVTPVPELPGPSATPVPELPG 418
Cdd:PRK14951   371 EAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPApVAAPAAAAPAAAPAAAPAAVALAPA 450
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1622920918  419 PLSTPVPELPGPPATAVPELPGPSVTPVPqlsqelPGLPAPS 460
Cdd:PRK14951   451 PPAQAAPETVAIPVRVAPEPAVASAAPAP------AAAPAAA 486
PHA02030 PHA02030
hypothetical protein
326-440 5.97e-03

hypothetical protein


Pssm-ID: 222843 [Multi-domain]  Cd Length: 336  Bit Score: 41.50  E-value: 5.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  326 MDFPeSSAIEALRLPEQPVDVPSEIADSSMTrpqeLPELPKTTAlelqessvaSAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:PHA02030   236 TDFP-GSALHILLGGGEDLIIKPKSKAAGSN----LPAVPNVAA---------DAGSAAAPAVPAAAAAVAQAAPSVPQV 301
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1622920918  406 PGPSATPVPELPGPLSTP-VPELPGPPatAVPELPG 440
Cdd:PHA02030   302 PNVAVLPDVPQVAPVAAPaAPEVPAVP--VVPAAPQ 335
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
205-446 5.98e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 5.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  205 LKPATKTAELSVASTSVISEQSEQSVAVTPEPSMTKILDSFAAAPVPTTTVVLKSSEPVVTMsveyqmksvLKSVESTSP 284
Cdd:pfam05109  396 LGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTN---------LTAPASTGP 466
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  285 EPSkimlveppVAKVLEPSETLVVSSETPTEVYPEPSTSTTmdfpESSAIEaLRLPEQPVDVPSEIADSSMtrPQELPEL 364
Cdd:pfam05109  467 TVS--------TADVTSPTPAGTTSGASPVTPSPSPRDNGT----ESKAPD-MTSPTSAVTTPTPNATSPT--PAVTTPT 531
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  365 PKTTALELQESSVASAMELPGPPATS-MPELQGP-PVTPVPELPGPSATPVPELPGPLSTP--VPELPGPPATAVPELPG 440
Cdd:pfam05109  532 PNATSPTLGKTSPTSAVTTPTPNATSpTPAVTTPtPNATIPTLGKTSPTSAVTTPTPNATSptVGETSPQANTTNHTLGG 611

                   ....*.
gi 1622920918  441 PSVTPV 446
Cdd:pfam05109  612 TSSTPV 617
PHA03291 PHA03291
envelope glycoprotein I; Provisional
362-451 8.50e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 41.09  E-value: 8.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  362 PELPkTTALELQESSV-ASAMELPGPPATSMPELQGPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVP---E 437
Cdd:PHA03291   188 PALP-LSAPRLGPADVfVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPgggE 266
                           90
                   ....*....|....
gi 1622920918  438 LPGPSVTPVPQLSQ 451
Cdd:PHA03291   267 APPANATPAPEASR 280
PHA03247 PHA03247
large tegument protein UL36; Provisional
401-706 8.95e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 8.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  401 PVPELPGPSATPVPELPGPLSTPVPELPGPPATA---VPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEV-----PE 471
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSrarRPDAPPQSARPrAPVDDRGDPRGPAPPSPLPPDTHApdpppPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  472 PPVMAQELPGLPlvTAAVELPEQPAVTVAMELTEQPVTTTELEQPVGMT-TVEHPGHPEVTTATGLL---GQPEATMVLE 547
Cdd:PHA03247  2631 PSPAANEPDPHP--PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASsPPQRPRRRAARPTVGSLtslADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  548 LPGQPVATTALELPGQPSVTGvPELPGLPSATRALELSGQPVATGALELPG-PLMAAGAlefsgQSGAAGALELLGQPLA 626
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAAR-QASPALPAAPAPPAVPAGPATPGGPARPArPPTTAGP-----PAPAPPAAPAAGPPRR 2782
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  627 TGVleLPGQPGAPELPGQPVATVALEISVQSVVTTSELSTMTVSQSLEVPSTTALESYNTVAQE-LPTTLVGETSVTVGV 705
Cdd:PHA03247  2783 LTR--PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGpPPPSLPLGGSVAPGG 2860

                   .
gi 1622920918  706 D 706
Cdd:PHA03247  2861 D 2861
PHA03247 PHA03247
large tegument protein UL36; Provisional
386-509 9.02e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 9.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  386 PPATSMPELQGPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATA--VPELPGPSVTPVPQLSQELPGLPAPSMGL 463
Cdd:PHA03247   379 SLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAppPPATPLPSAEPGSDDGPAPPPERQPPAPA 458
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1622920918  464 EPPQEVPEPPVMAQELPGLplvtAAVELPEQPAVTVAMELTEQPVT 509
Cdd:PHA03247   459 TEPAPDDPDDATRKALDAL----RERRPPEPPGADLAELLGRHPDT 500
dnaA PRK14086
chromosomal replication initiator protein DnaA;
369-460 9.72e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 40.96  E-value: 9.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920918  369 ALELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQ 448
Cdd:PRK14086    85 AITVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADD 164
                           90
                   ....*....|..
gi 1622920918  449 LSQELPGLPAPS 460
Cdd:PRK14086   165 YGWQQQRLGFPP 176
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH