|
Name |
Accession |
Description |
Interval |
E-value |
| DUF4795 |
pfam16043 |
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ... |
729-919 |
3.70e-53 |
|
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.
Pssm-ID: 464990 [Multi-domain] Cd Length: 181 Bit Score: 184.04 E-value: 3.70e-53
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 729 TTVDILQKKIGSLQksrlkeEELERIWGNQIEMMKDryitldkavenLQIRMDEFKTLQAQIKRLEMNKVNKSTMEEELR 808
Cdd:pfam16043 7 ELLDQLQALILDLQ------EELEKLSETTSELSER-----------LQQRQKHLEALYQQIEKLEKVKADKEVVEEELD 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 809 EKADRSALAGKASRVDLETVALELNEMIQGILFKVTIHEDSWKKAMEELSKDVNTKLVHSDLDPLKKEMEEVWKIVRKLL 888
Cdd:pfam16043 70 EKADKEALASKVSRDQFDETLEELNQMLQELLDKLEGQEDAWKKALETLSEELDTKLDRLELDPLKELLERRIKALQKLL 149
|
170 180 190
....*....|....*....|....*....|..
gi 1908918739 889 IEGLRLDPD-SAAGFRRKLFKRVKCISCDRPV 919
Cdd:pfam16043 150 QEGSEELDEaEAAGFRKKLLERFHCISCDRPV 181
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
286-436 |
7.24e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.34 E-value: 7.24e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 286 PELLPEGSSAQAVSLSRaQEPAQPPALTP--ESAPGCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVTPGSL 363
Cdd:PHA03247 2848 PSLPLGGSVAPGGDVRR-RPPSRSPAAKPaaPARPPVRRLARPAVSRSTESFA--------LPPDQPERPPQPQAPPPPQ 2918
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 364 PAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRvGSWPLWDLGVLRP----------TQPQPSR---APPPATEF 430
Cdd:PHA03247 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS-GAVPQPWLGALVPgrvavprfrvPQPAPSReapASSTPPLT 2997
|
....*.
gi 1908918739 431 GSLWPR 436
Cdd:PHA03247 2998 GHSLSR 3003
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
290-444 |
6.54e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 6.54e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 290 PEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPV 369
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 370 LGPVPAPGAQPPPLGDW------------PALP-----RRWPLPQGWPRVGSWPLWDLGVLRPTQPQ-PSRAPPPATEFG 431
Cdd:PHA03247 2846 PPPSLPLGGSVAPGGDVrrrppsrspaakPAAParppvRRLARPAVSRSTESFALPPDQPERPPQPQaPPPPQPQPQPPP 2925
|
170
....*....|...
gi 1908918739 432 SLWPRPLQPYQSR 444
Cdd:PHA03247 2926 PPQPQPPPPPPPR 2938
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
293-478 |
1.30e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 56.15 E-value: 1.30e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 293 SSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPgslPAPWPVLGP 372
Cdd:PRK07764 617 APAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA---PAPAAPAAP 693
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 373 VPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLW------DLGVLRPTQPQPSRAPPPATEFGslwPRPLQPYQSRQG 446
Cdd:PRK07764 694 AGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASapspaaDDPVPLPPEPDDPPDPAGAPAQP---PPPPAPAPAAAP 770
|
170 180 190
....*....|....*....|....*....|..
gi 1908918739 447 EALQLAAVQVKGEENDVPSLRGLRERARKDGA 478
Cdd:PRK07764 771 AAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAE 802
|
|
| PHA03201 |
PHA03201 |
uracil DNA glycosylase; Provisional |
301-399 |
2.62e-07 |
|
uracil DNA glycosylase; Provisional
Pssm-ID: 165468 Cd Length: 318 Bit Score: 53.74 E-value: 2.62e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 301 SRAQEPAQ---PPALTPESAPGCTTEFAPG--PAPGTEPVPGLELGlelEPVPALGPVPGPS-VTPGSLPAPWPVLGPVP 374
Cdd:PHA03201 6 SRSPSPPRrpsPPRPTPPRSPDASPEETPPspPGPGAEPPPGRAAG---PAAPRRRPRGCPAgVTFSSSAPPRPPLGLDD 82
|
90 100
....*....|....*....|....*
gi 1908918739 375 APGAQPPPLgDWPALPRRWPLPQGW 399
Cdd:PHA03201 83 APAATPPPL-DWTEFRRRFLVGDAW 106
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
285-487 |
2.63e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 54.88 E-value: 2.63e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 285 VPELLPEGSSAQAVSLSRAQ---EPAQPPALTPESAPGCTTEFAPGPAPgtEPVPGLELGLELEPVPALGPVPGPSVTPG 361
Cdd:PRK12323 382 VAQPAPAAAAPAAAAPAPAAppaAPAAAPAAAAAARAVAAAPARRSPAP--EALAAARQASARGPGGAPAPAPAPAAAPA 459
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 362 SLPAPwpvlgpvPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGVLRPTQPQPSR-APPPATEFGSLWPRPLQP 440
Cdd:PRK12323 460 AAARP-------AAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQpDAAPAGWVAESIPDPATA 532
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1908918739 441 YQSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGAPKDRTRKDG 487
Cdd:PRK12323 533 DPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDG 579
|
|
| MISS |
pfam15822 |
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ... |
283-427 |
3.50e-07 |
|
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.
Pssm-ID: 318115 [Multi-domain] Cd Length: 238 Bit Score: 52.68 E-value: 3.50e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 283 YEVPELLPEGSSAQ--AVSLSRAQEPAQ-----PPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLeLEPVPALGPVPG 355
Cdd:pfam15822 1 FSLADALPEQSPAKtsAVSNPKPGQPPQgwpgsNPWNNPSAPPAVPSGLPPSTAPSTVPFGPAPTGM-YPSIPLTGPSPG 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 356 P---------SVTPGSLPAPWPVL-GPVPaPGAQPPPLGDWPALPRRW--------PLPQG-WPRVGSWPlWDLGV---- 412
Cdd:pfam15822 80 PpapfppsgpSCPPPGGPYPAPTVpGPGP-IGPYPTPNMPFPELPRPYgaptdpaaAAPSGpWGSMSSGP-WAPGMggqy 157
|
170
....*....|....*
gi 1908918739 413 LRPTQPQPSRAPPPA 427
Cdd:pfam15822 158 PAPNMPYPSPGPYPA 172
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
286-422 |
5.15e-07 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 54.30 E-value: 5.15e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 286 PELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGS-LP 364
Cdd:PHA03378 697 PPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGApTP 776
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1908918739 365 APWPVLGPVP------APGAQPPPLGDWPAL---PRRWPLPQGWPRVGSWPLWDLGVL--RPTQPQPSR 422
Cdd:PHA03378 777 QPPPQAPPAPqqrprgAPTPQPPPQAGPTSMqlmPRAAPGQQGPTKQILRQLLTGGVKrgRPSLKKPAA 845
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
288-467 |
6.15e-07 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.92 E-value: 6.15e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 288 LLPEGSSAQAVSLSRAQEPAQPPALTPESAPgcttefAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPW 367
Cdd:PHA03378 685 LPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQ------RPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 758
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 368 PVLGPVPAPGAQPPPLGDWPAlPRRWPLPQGWPRVGswplwdlgvlrPTQPQPSRAPPPATEFGslwprPLQPYQSRQGE 447
Cdd:PHA03378 759 AAPGRARPPAAAPGAPTPQPP-PQAPPAPQQRPRGA-----------PTPQPPPQAGPTSMQLM-----PRAAPGQQGPT 821
|
170 180
....*....|....*....|
gi 1908918739 448 ALQLAAVQVKGEENDVPSLR 467
Cdd:PHA03378 822 KQILRQLLTGGVKRGRPSLK 841
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
306-442 |
7.74e-07 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.53 E-value: 7.74e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 306 PAQPPALTPESAPGCTTEFAPGPA------PGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQ 379
Cdd:PHA03378 651 PHQPPQVEITPYKPTWTQIGHIPYqpsptgANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAA 730
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1908918739 380 PPPLGDWPALPRRWPLPQGWPrvgswplwdlGVLRPTQPQPSRAPPPATEFGSlwPRPLQPYQ 442
Cdd:PHA03378 731 PGRARPPAAAPGRARPPAAAP----------GRARPPAAAPGRARPPAAAPGA--PTPQPPPQ 781
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
307-482 |
1.37e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 52.76 E-value: 1.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 307 AQPPALTPESAP-GCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPPPLGD 385
Cdd:PHA03378 667 TQIGHIPYQPSPtGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARP 746
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 386 WPALPRRWPLPQGWPRVGSWPLWDLGvlRPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGeALQLAAVQVKGEENDVPS 465
Cdd:PHA03378 747 PAAAPGRARPPAAAPGRARPPAAAPG--APTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPT-SMQLMPRAAPGQQGPTKQ 823
|
170
....*....|....*...
gi 1908918739 466 -LRGLRERARKDGAPKDR 482
Cdd:PHA03378 824 iLRQLLTGGVKRGRPSLK 841
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
249-463 |
3.84e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 51.14 E-value: 3.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 249 EAALAQTTKYLEatRAIQVSEPVQNPQllqtvwhyevpellPEGSSAQAVSLSRAQEPAQP-PALTPESAPGCTTEFAPG 327
Cdd:PRK07764 371 ERGLLARLERLE--RRLGVAGGAGAPA--------------AAAPSAAAAAPAAAPAPAAAaPAAAAAPAPAAAPQPAPA 434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 328 PAPGTEPvPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPpplgdwpalprrWPLPQGWPRVgswpl 407
Cdd:PRK07764 435 PAPAPAP-PSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPA------------APAPAAAPAA----- 496
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918739 408 wdlgvlrPTQPQPSRAPPPATEFGSLWPRPLQ--PYQSRQGEALQLAAVQVKGEENDV 463
Cdd:PRK07764 497 -------PAAPAAPAGADDAATLRERWPEILAavPKRSRKTWAILLPEATVLGVRGDT 547
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
226-418 |
4.65e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 4.65e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 226 DAMFTSEIGSSPLDLWQSVEQLPEAALAQTTKYLEATRAIQVSEPVQNPQLlqtvwHYEV-------PELLPEGSSAQAV 298
Cdd:PHA03247 295 DGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQ-----HYPLgfpkrrrPTWTPPSSLEDLS 369
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 299 SLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGlelglelEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPG- 377
Cdd:PHA03247 370 AGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPA-------APVPASVPTPAPTPVPASAPPPPATPLPSAEPGs 442
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1908918739 378 --AQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGVLRPTQP 418
Cdd:PHA03247 443 ddGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEP 485
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
292-434 |
5.32e-06 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 50.48 E-value: 5.32e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 292 GSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPgtePVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLG 371
Cdd:PRK14951 367 AAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAA---APAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA 443
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1908918739 372 PVPAPGAQPPPLGDWPALPRRWPLPQGWPRvgswplwDLGVLRPTQPQPsrAPPPATEFGSLW 434
Cdd:PRK14951 444 AVALAPAPPAQAAPETVAIPVRVAPEPAVA-------SAAPAPAAAPAA--ARLTPTEEGDVW 497
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
290-447 |
1.06e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.60 E-value: 1.06e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 290 PEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLElglelEPVPALGPVPGPSVTPGSLPAPWPV 369
Cdd:PRK07764 632 AAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAP-----PPAPAPAAPAAPAGAAPAQPAPAPA 706
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918739 370 LGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGvLRPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGE 447
Cdd:PRK07764 707 ATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
270-446 |
2.53e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 2.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 270 PVQNPQLLQTVWHYEVPeLLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPA 349
Cdd:PHA03247 2704 PPPTPEPAPHALVSATP-LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 350 LGPVPGPSVTPG--SLPAPWpvlGPVPAPGAQPPPLgdwPALPrrwplPQGWPRVGSWPlwdlgvlrPTQPQPSRAPPPa 427
Cdd:PHA03247 2783 LTRPAVASLSESreSLPSPW---DPADPPAAVLAPA---AALP-----PAASPAGPLPP--------PTSAQPTAPPPP- 2842
|
170
....*....|....*....
gi 1908918739 428 tefgslwPRPLQPYQSRQG 446
Cdd:PHA03247 2843 -------PGPPPPSLPLGG 2854
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
248-446 |
2.83e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 48.33 E-value: 2.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 248 PEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYEVPELLPEGSSAQAVSLSRAQEPAQPPALTPESApgcttefAPG 327
Cdd:PRK12323 392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAA-------ARP 464
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 328 PAPGTEPVPGLELGLELEPVPALGPVPGPSVTPgslpaPW---PVLGPVPAPGAQPPPLGDWPAlprrwplpQGWPRVGS 404
Cdd:PRK12323 465 AAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP-----PWeelPPEFASPAPAQPDAAPAGWVA--------ESIPDPAT 531
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1908918739 405 WPLWDLGVLRPTQPQPSRAPPPATEFGSLWPrPLQPYQSRQG 446
Cdd:PRK12323 532 ADPDDAFETLAPAPAAAPAPRAAAATEPVVA-PRPPRASASG 572
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
290-395 |
3.95e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 47.95 E-value: 3.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 290 PEGSSAQAVSLSRAQEPAQPPALTPESAP---GCTTEFA-PGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPA 365
Cdd:PRK12323 469 PRPVAAAAAAAPARAAPAAAPAPADDDPPpweELPPEFAsPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAA 548
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1908918739 366 PWPV--LGPVPAPGAQPPPL----------GDWPALPRRWPL 395
Cdd:PRK12323 549 PAPRaaAATEPVVAPRPPRAsasglpdmfdGDWPALAARLPV 590
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
296-427 |
5.73e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.63 E-value: 5.73e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 296 QAVSLSRAQEPAQPP-ALTPESAP---GCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVT-PGSLPAPWPVL 370
Cdd:PHA03247 2666 RARRLGRAAQASSPPqRPRRRAARptvGSLTSLADPPPPPPTPEP--------APHALVSATPLPPGPaAARQASPALPA 2737
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 1908918739 371 GPVPAPGAQPPPLGDWPALPRRWPLPQGWPRvgSWPLWDlgvlrPTQPQPSRAPPPA 427
Cdd:PHA03247 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPA--PAPPAA-----PAAGPPRRLTRPA 2787
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
301-445 |
9.34e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.86 E-value: 9.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 301 SRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVTPGSLPAPwpvlGPVPAPgaqP 380
Cdd:PHA03247 2584 SRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPP--------DTHAPDPPPPSPSPAANEPDPH----PPPTVP---P 2648
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 381 PPLGDWPALPRRWPL----------------PQGW------PRVGswPLWDLGVLRPTQPQPSRAPPPATefgSLWPRPL 438
Cdd:PHA03247 2649 PERPRDDPAPGRVSRprrarrlgraaqasspPQRPrrraarPTVG--SLTSLADPPPPPPTPEPAPHALV---SATPLPP 2723
|
....*..
gi 1908918739 439 QPYQSRQ 445
Cdd:PHA03247 2724 GPAAARQ 2730
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
306-445 |
9.81e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 46.68 E-value: 9.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 306 PAQPPALTPESA-----PGCTTEFAPGPAPGTEPVPglelgleLEPVPALGPVP----GPSVTPGSLPAPWPVLGPVPAP 376
Cdd:pfam03154 183 PPSPPPPGTTQAatagpTPSAPSVPPQGSPATSQPP-------NQTQSTAAPHTliqqTPTLHPQRLPSPHPPLQPMTQP 255
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1908918739 377 G--------AQPPPLGDWPALPRRWPLPQGwPRVGSWPLWDLGVLRPTQPQPSRAPPPatefgslwPRPLQPYQSRQ 445
Cdd:pfam03154 256 PppsqvspqPLPQPSLHGQMPPMPHSLQTG-PSHMQHPVPPQPFPLTPQSSQSQVPPG--------PSPAAPGQSQQ 323
|
|
| DUF4813 |
pfam16072 |
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. ... |
291-396 |
1.90e-04 |
|
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 345 and 672 amino acids in length.
Pssm-ID: 435117 [Multi-domain] Cd Length: 288 Bit Score: 44.75 E-value: 1.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 291 EGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEP-VPGLELGlELEPVPALGPVPGPSVTPG--SLPAPW 367
Cdd:pfam16072 153 SAGSGTTVINAGGQQPAAPAAPAYPVAPAAYPAQAPAAAPAPAPgAPQTPLA-PLNPVAAAPAAAAGAAAAPvvAAAAPA 231
|
90 100 110
....*....|....*....|....*....|....*
gi 1908918739 368 PVLGPVPAPGAqPPPLGDWPA------LPRRWPLP 396
Cdd:pfam16072 232 AAAPPPPAPAA-PPADAAPPApggiicVPVRVPEP 265
|
|
| FAP |
pfam07174 |
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ... |
292-399 |
2.99e-04 |
|
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
Pssm-ID: 429334 Cd Length: 301 Bit Score: 44.15 E-value: 2.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 292 GSSAQAVSL---SRAQEPAQPPALTPESAPgctteFAPGPAPgtePVPGlelglelEPVPAlgPVPGPSVTPGSLPAPWP 368
Cdd:pfam07174 25 GASAVAVALpavAHADPEPAPPPPSTATAP-----PAPPPPP---PAPA-------APAPP--PPPAAPNAPNAPPPPAD 87
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1908918739 369 VLGPVPAPG--AQPPPLGDWPALPR-----------RWPLPQGW 399
Cdd:pfam07174 88 PNAPPPPPAdpNAPPPPAVDPNAPEpgridnavggfSYVVPAGW 131
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
292-498 |
3.27e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.98 E-value: 3.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 292 GSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSV---TPGSLPAPWP 368
Cdd:PRK07764 591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVavpDASDGGDGWP 670
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 369 VLGPVPAPGAQPPPLGDWP-----------ALPRRWPLPQGWPRVGSWPLWdlgvlrPTQPQPSRAPPPATEFGSLWPRP 437
Cdd:PRK07764 671 AKAGGAAPAAPPPAPAPAApaapagaapaqPAPAPAATPPAGQADDPAAQP------PQAAQGASAPSPAADDPVPLPPE 744
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1908918739 438 LQPYQSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGAPKDRTRKDGVPKDRGGKDVD 498
Cdd:PRK07764 745 PDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVA 805
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
301-549 |
3.36e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 45.16 E-value: 3.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 301 SRAQEPAQPPALTPESAPGCTTEFAPGPAP-GTEPVPGLELGLELEPVPALgpvPGPSVTPGSLPAPWPVLGPVPAPGAQ 379
Cdd:PHA03307 75 PGTEAPANESRSTPTWSLSTLAPASPAREGsPTPPGPSSPDPPPPTPPPAS---PPPSPAPDLSEMLRPVGSPGPPPAAS 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 380 PPPLGDWPALPRRwplpqgwprvGSWPLWDLGVLRPTQPQPSRAP--PPATEFGSLWPRPLQPYQSRQGEALQLAAVqvk 457
Cdd:PHA03307 152 PPAAGASPAAVAS----------DAASSRQAALPLSSPEETARAPssPPAEPPPSTPPAAASPRPPRRSSPISASAS--- 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 458 geenDVPSLRGLRERARKDGAPKDRTRKDGVPKDRGGKDVDPKDRAHKDDVPKDRGGKDVDPKDRAHKDDVPKDRGGKDG 537
Cdd:PHA03307 219 ----SPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
|
250
....*....|..
gi 1908918739 538 DPKDRVGKDGAP 549
Cdd:PHA03307 295 SPSPSPSSPGSG 306
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
285-429 |
3.55e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 44.84 E-value: 3.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 285 VPELLPEGSSAQAVSLSRAQEPAQPP-------ALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPS 357
Cdd:PRK07003 376 VAGAVPAPGARAAAAVGASAVPAVTAvtgaagaALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANA 455
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1908918739 358 VTPGSLPAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGVLRPTQPQPSRAPPPATE 429
Cdd:PRK07003 456 RASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAP 527
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
321-474 |
4.33e-04 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 44.66 E-value: 4.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 321 TTEFAPGPAPGTE----PVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPV---LGPVPAPGAQPPPLGDWPALP-RR 392
Cdd:PHA03379 404 ALEKASEPTYGTPrppvEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLhdqHSMAPCPVAQLPPGPLQDLEPgDQ 483
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 393 WPLPQGWPRVGSWPLWDLG--VLRPTQPQPSRAP--PPATEF-----GSLWPRPLQPYQSRQGEALQLAAVQVKGEENdv 463
Cdd:PHA03379 484 LPGVVQDGRPACAPVPAPAgpIVRPWEASLSQVPgvAFAPVMpqpmpVEPVPVPTVALERPVCPAPPLIAMQGPGETS-- 561
|
170
....*....|.
gi 1908918739 464 pSLRGLRERAR 474
Cdd:PHA03379 562 -GIVRVRERWR 571
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
274-428 |
5.39e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.37 E-value: 5.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 274 PQLLQTVWHYEVPELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPG-------LELGLELEP 346
Cdd:pfam03154 313 PSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPhlsgpspFQMNSNLPP 392
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 347 VPALGPVPGPSV--TPGSLPAP---WPVLGPVPAPGAQPPPLGDWPALP---RRWPLPQGWPRVGSWPLWDLGVLRPTQP 418
Cdd:pfam03154 393 PPALKPLSSLSThhPPSAHPPPlqlMPQSQQLPPPPAQPPVLTQSQSLPppaASHPPTSGLHQVPSQSPFPQHPFVPGGP 472
|
170
....*....|...
gi 1908918739 419 Q---PSRAPPPAT 428
Cdd:pfam03154 473 PpitPPSGPPTST 485
|
|
| RRM_RBM27 |
cd12517 |
RNA recognition motif (RRM) found in vertebrate RNA-binding protein 27 (RBM27); This subgroup ... |
248-284 |
5.76e-04 |
|
RNA recognition motif (RRM) found in vertebrate RNA-binding protein 27 (RBM27); This subgroup corresponds to the RRM of RBM27 which contains a single RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain). Although the specific function of the RRM in RBM27 remains unclear, it shows high sequence similarity with RRM1of RBM26, which functions as a cutaneous lymphoma (CL)-associated antigen.
Pssm-ID: 409939 [Multi-domain] Cd Length: 76 Bit Score: 39.65 E-value: 5.76e-04
10 20 30
....*....|....*....|....*....|....*..
gi 1908918739 248 PEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYE 284
Cdd:cd12517 40 PEAALIQYTTNEEARRAISSTEAVLNNRFIRVLWHRE 76
|
|
| Pro-rich |
pfam15240 |
Proline-rich protein; This family includes several eukaryotic proline-rich proteins. |
293-439 |
7.20e-04 |
|
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
Pssm-ID: 464580 [Multi-domain] Cd Length: 167 Bit Score: 41.56 E-value: 7.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 293 SSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLP-APWPVLG 371
Cdd:pfam15240 14 SSAQSSSEDVSQEDSPSLISEEEGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPqGPPPQGG 93
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1908918739 372 PVPAPGAQ---PPPLGDWPALPRRWPLPQGWPRVGSWPLWDLG-VLRPTQPQPSR--APPPATEFGSLWPRPLQ 439
Cdd:pfam15240 94 PRPPPGKPqgpPPQGGNQQQGPPPPGKPQGPPPQGGGPPPQGGnQQGPPPPPPGNpqGPPQRPPQPGNPQGPPQ 167
|
|
| RRM1_RBM26 |
cd12516 |
RNA recognition motif 1 (RRM1) found in vertebrate RNA-binding protein 26 (RBM26); This ... |
248-284 |
9.91e-04 |
|
RNA recognition motif 1 (RRM1) found in vertebrate RNA-binding protein 26 (RBM26); This subgroup corresponds to the RRM1 of RBM26, also known as cutaneous T-cell lymphoma (CTCL) tumor antigen se70-2, which represents a cutaneous lymphoma (CL)-associated antigen. It contains two RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The RRMs may play some functional roles in RNA-binding or protein-protein interactions.
Pssm-ID: 409938 [Multi-domain] Cd Length: 76 Bit Score: 38.84 E-value: 9.91e-04
10 20 30
....*....|....*....|....*....|....*..
gi 1908918739 248 PEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYE 284
Cdd:cd12516 40 PEGALIQFATHEEAKRAISSTEAVLNNRFIKVYWHRE 76
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
309-440 |
1.52e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 42.78 E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 309 PPALTPESAPGCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVtPGSLPAPWPVLGPVPAPGAQPPPLGDWPA 388
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPAAAP--------VAQAAAAPAPAAAP-AAAASAPAAPPAAAPPAPVAAPAAAAPAA 436
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1908918739 389 LPRRWPLPQGWPRVGSWPL--WDLGVLRPTQPQPSRAPPPATEFGSLWPRPLQP 440
Cdd:PRK14951 437 APAAAPAAVALAPAPPAQAapETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
272-423 |
1.62e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 42.46 E-value: 1.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 272 QNPQLLQTVWHYEVPELLPEGSSAqavslSRAQEPAQP--PALT-PESAPGCTTEFAPGPAPGTEPVPglelglelEPVP 348
Cdd:PRK14971 346 KNKRLLVELTLIQLAQLTQKGDDA-----SGGRGPKQHikPVFTqPAAAPQPSAAAAASPSPSQSSAA--------AQPS 412
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1908918739 349 ALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRVGswplwdLGVLRPTQPQPSRA 423
Cdd:PRK14971 413 APQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLG------PSTLRPIQEKAEQA 481
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
346-682 |
2.98e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 2.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 346 PVPALGP-VPGPSVTPGSLPAPWPvlGPVPAPGAQPP------PLGDwPALPRRWPLPQGWPRVGSWPLWDlgvlrPTQP 418
Cdd:PHA03247 2483 PAEARFPfAAGAAPDPGGGGPPDP--DAPPAPSRLAPailpdePVGE-PVHPRMLTWIRGLEELASDDAGD-----PPPP 2554
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 419 QPSRAPPPATEFG----SLWPRPLQP----YQSRQGEALQLAAVQVKGEENDVPslrglRERARKDGAPKDRTRKDGVPK 490
Cdd:PHA03247 2555 LPPAAPPAAPDRSvpppRPAPRPSEPavtsRARRPDAPPQSARPRAPVDDRGDP-----RGPAPPSPLPPDTHAPDPPPP 2629
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 491 DRGGKDVDPKDRAHKDDVPKDRGGKDVDP-KDRAHKDDVPKDRGGKDGDPKDRVGKDGAPKE-----------AQPKAPQ 558
Cdd:PHA03247 2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPgRVSRPRRARRLGRAAQASSPPQRPRRRAARPTvgsltsladppPPPPTPE 2709
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 559 SALHRLKTTAAIAAAAAAAYAAATSSAAQAAKVAAKFVKDAPATKMAAIATDTAAAGPLGVFADVLGAGPSRGATESQIL 638
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVA 2789
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1908918739 639 GddseiyeiLSPSYSAASIGPDPALSQAMVATKQAMSPEDKKRA 682
Cdd:PHA03247 2790 S--------LSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
|
|
| penta_MxKDx |
TIGR02953 |
pentapeptide MXKDX repeat protein; Members of this protein family are small bacterial proteins, ... |
501-552 |
3.14e-03 |
|
pentapeptide MXKDX repeat protein; Members of this protein family are small bacterial proteins, each with an N-terminal signal sequence followed by up to 11 imperfect repeats of a pentapeptide. The pentapeptide repeat usually follows the form Met-Xaa-Lys-Asp-Xaa.
Pssm-ID: 131998 [Multi-domain] Cd Length: 75 Bit Score: 37.52 E-value: 3.14e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 1908918739 501 DRAHKDDVPKDRGGKDVDPKDRAHKDDVPKDRGGKDGDPKDRVGKDGAPKEA 552
Cdd:TIGR02953 23 DAMKKDTMKKDAMGKDAMAKDAMSKDAMKKDAMKKDAMKKDGMKKDAMKKDA 74
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
245-562 |
3.20e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 41.59 E-value: 3.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 245 EQLPEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYEVPELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGctTEF 324
Cdd:COG5180 212 EEPPDLTGGADHPRPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAG--SEP 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 325 APGPAPGTEPVPGLELGLELEPvPALGPV-PGPSVTPGSLPAPwpvLGPVPAPGAQPPPLGDWPALPRRWPLPQGwprvg 403
Cdd:COG5180 290 QSDAPEAETARPIDVKGVASAP-PATRPVrPPGGARDPGTPRP---GQPTERPAGVPEAASDAGQPPSAYPPAEE----- 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 404 swplwdlgvLRPTQPQPSRAPPPATEFGSLWP--RPLQPYQSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGA--- 478
Cdd:COG5180 361 ---------AVPGKPLEQGAPRPGSSGGDGAPfqPPNGAPQPGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAagg 431
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 479 ----PKDRTRKDGVPKDRGGKDVDPKDRAHKDDVPKDRGGKDVD--PKDRAHKDDVPKDR-GGKDGDPKDRVGKDGAPKE 551
Cdd:COG5180 432 agqgPKADFVPGDAESVSGPAGLADQAGAAASTAMADFVAPVTDatPVDVADVLGVRPDAiLGGNVAPASGLDAETRIIE 511
|
330
....*....|.
gi 1908918739 552 AQPKAPQSALH 562
Cdd:COG5180 512 AEGAPATEDFV 522
|
|
| PRK05641 |
PRK05641 |
putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated |
332-385 |
3.88e-03 |
|
putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated
Pssm-ID: 235540 [Multi-domain] Cd Length: 153 Bit Score: 39.08 E-value: 3.88e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1908918739 332 TEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGaqPPPLGD 385
Cdd:PRK05641 33 TYEVEAKGLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPA--PASAGE 84
|
|
| FAP |
pfam07174 |
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ... |
326-435 |
4.79e-03 |
|
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
Pssm-ID: 429334 Cd Length: 301 Bit Score: 40.29 E-value: 4.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 326 PGPAPgtePVPglelglelePVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPPplgdwPALPRRWPLPQGWPRVGSW 405
Cdd:pfam07174 41 PEPAP---PPP---------STATAPPAPPPPPPAPAAPAPPPPPAAPNAPNAPPP-----PADPNAPPPPPADPNAPPP 103
|
90 100 110
....*....|....*....|....*....|
gi 1908918739 406 PLWDlgvlrPTQPQPSRAPPPATEFGSLWP 435
Cdd:pfam07174 104 PAVD-----PNAPEPGRIDNAVGGFSYVVP 128
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
735-887 |
4.88e-03 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 41.04 E-value: 4.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 735 QKKIGSLQKSRLKEEELERIWGNQIEMMKDRYITLDKAVENLQIRMDEFKTLQAQIKRLEMN-------------KVNK- 800
Cdd:PRK01156 196 NLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEiktaesdlsmeleKNNYy 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 801 STMEEELREKADRSALAGKASRVDLETVA---LELNEMIQGILFKVTIHEDSWKKAmEELSKDvntklvHSDLDPLKKEM 877
Cdd:PRK01156 276 KELEERHMKIINDPVYKNRNYINDYFKYKndiENKKQILSNIDAEINKYHAIIKKL-SVLQKD------YNDYIKKKSRY 348
|
170
....*....|
gi 1908918739 878 EEVWKIVRKL 887
Cdd:PRK01156 349 DDLNNQILEL 358
|
|
| penta_MxKDx |
TIGR02953 |
pentapeptide MXKDX repeat protein; Members of this protein family are small bacterial proteins, ... |
481-531 |
5.55e-03 |
|
pentapeptide MXKDX repeat protein; Members of this protein family are small bacterial proteins, each with an N-terminal signal sequence followed by up to 11 imperfect repeats of a pentapeptide. The pentapeptide repeat usually follows the form Met-Xaa-Lys-Asp-Xaa.
Pssm-ID: 131998 [Multi-domain] Cd Length: 75 Bit Score: 36.75 E-value: 5.55e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 1908918739 481 DRTRKDGVPKDRGGKDVDPKDRAHKDDVPKDRGGKDVDPKDRAHKDDVPKD 531
Cdd:TIGR02953 23 DAMKKDTMKKDAMGKDAMAKDAMSKDAMKKDAMKKDAMKKDGMKKDAMKKD 73
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
274-435 |
5.56e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.08 E-value: 5.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 274 PQLLQTVWHYEVPELLPE---GSSAQAVSLSRAQEPAQP---PALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPV 347
Cdd:PHA03247 2889 PAVSRSTESFALPPDQPErppQPQAPPPPQPQPQPPPPPqpqPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV 2968
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 348 PALGPVPGPSVTPgslPAPwpvlgPVPAPGAQPPPLGDWPAlprrwplpqgwPRVGSWPLwDLGVLRPTqpqpsrAPPPA 427
Cdd:PHA03247 2969 PGRVAVPRFRVPQ---PAP-----SREAPASSTPPLTGHSL-----------SRVSSWAS-SLALHEET------DPPPV 3022
|
....*...
gi 1908918739 428 TEFGSLWP 435
Cdd:PHA03247 3023 SLKQTLWP 3030
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
286-379 |
5.90e-03 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 39.60 E-value: 5.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 286 PELLPEGSSAQAVSLSRAQEPAQPPALTPESAPgCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVTPGSLPA 365
Cdd:PRK11633 64 PTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTP-VEPEPAPVEPPKPKPVE--------KPKPKPKPQQKVEAPPAPKPE 134
|
90
....*....|....
gi 1908918739 366 PWPVLGPVPAPGAQ 379
Cdd:PRK11633 135 PKPVVEEKAAPTGK 148
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
272-440 |
6.24e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.91 E-value: 6.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 272 QNPQLLQTVWHYEVPELLPEGSSAQAVSlsrAQEPAQPPALTPESAPGcTTEFAPGPAPGTEPVPGLELGLELEP----- 346
Cdd:pfam03154 169 TQPPVLQAQSGAASPPSPPPPGTTQAAT---AGPTPSAPSVPPQGSPA-TSQPPNQTQSTAAPHTLIQQTPTLHPqrlps 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 347 -----VPALGPVPGPSVTPGSLPAPW--PVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGVLRPTQpQ 419
Cdd:pfam03154 245 phpplQPMTQPPPPSQVSPQPLPQPSlhGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQ-Q 323
|
170 180
....*....|....*....|.
gi 1908918739 420 PSRAPPPATEFGSLWPRPLQP 440
Cdd:pfam03154 324 RIHTPPSQSQLQSQQPPREQP 344
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
325-509 |
7.50e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 40.35 E-value: 7.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 325 APGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPPPlgdwpALPRRWPLPQGWPRVGS 404
Cdd:PRK07764 594 AAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPE-----HHPKHVAVPDASDGGDG 668
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 405 WPLWdlgvlrPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGEALQLAAVQVKGEENDVPSLRglRERARKDGAPKDRTR 484
Cdd:PRK07764 669 WPAK------AGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA--QGASAPSPAADDPVP 740
|
170 180
....*....|....*....|....*
gi 1908918739 485 KDGVPKDRGGKDVDPKDRAHKDDVP 509
Cdd:PRK07764 741 LPPEPDDPPDPAGAPAQPPPPPAPA 765
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
286-484 |
8.46e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.44 E-value: 8.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 286 PELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPG-PAPGTEPVPGLELG---LELEPVP---ALGPVPG--P 356
Cdd:PHA03378 576 PLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPEtSAPRQWPMPLRPIPmrpLRMQPITfnvLVFPTPHqpP 655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 357 SVTPGSLPAPWPVLGPVPApgaQPPPLGDWPALPRRWPLpqgwprvgswplwdlGVLRPTQPQPSRAPPPATEFGSLWPR 436
Cdd:PHA03378 656 QVEITPYKPTWTQIGHIPY---QPSPTGANTMLPIQWAP---------------GTMQPPPRAPTPMRPPAAPPGRAQRP 717
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 1908918739 437 PLQPYQSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGAPKDRTR 484
Cdd:PHA03378 718 AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 765
|
|
| Drf_FH1 |
pfam06346 |
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs) ... |
286-431 |
8.61e-03 |
|
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.
Pssm-ID: 461881 [Multi-domain] Cd Length: 157 Bit Score: 38.31 E-value: 8.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 286 PELLPEGSSAQAVSLSRAQEPAQPPALtpesaPGCTTEFAPGPAPGTEPVPglelglELEPVPALGPVPGPSVTPGS--L 363
Cdd:pfam06346 25 PPLPGGGGPPPPPPLPGSAAIPPPPPL-----PGGTSIPPPPPLPGAASIP------PPPPLPGSTGIPPPPPLPGGagI 93
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918739 364 PAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPrvgswplwdlgvLRPTQPQPSRAPPPATEFG 431
Cdd:pfam06346 94 PPPPPPLPGGAGVPPPPPPLPGGPGIPPPPPFPGGPG------------IPPPPPGMGMPPPPPFGFG 149
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
305-560 |
9.24e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 40.15 E-value: 9.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 305 EPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPvpalGPVPGPSVTPGS----LPAPWPVLGPVPAPGAQP 380
Cdd:PHA03307 118 PPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASP----AAVASDAASSRQaalpLSSPEETARAPSSPPAEP 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 381 PPLGDWPALPRRWPLPQGWPRVGSwplwdlGVLRPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGEALQLAAVQVkgee 460
Cdd:PHA03307 194 PPSTPPAAASPRPPRRSSPISASA------SSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPI---- 263
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918739 461 nDVPSLRGLRERARKDGAPKDRTRKDGVPKDRGGKDVDPKDRAHKDDVPKDRGGKDVDPKD-----RAHKDDVPKDRGGK 535
Cdd:PHA03307 264 -TLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSREsssssTSSSSESSRGAAVS 342
|
250 260
....*....|....*....|....*
gi 1908918739 536 DGDPKDRVGKDGAPKEAQPKAPQSA 560
Cdd:PHA03307 343 PGPSPSRSPSPSRPPPPADPSSPRK 367
|
|
|