|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.51e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.51e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
9.12e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.12e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
762-1080 |
1.64e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 1.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 913 APPAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAA 974
Cdd:PHA03247 2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQP 2905
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 975 LHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfp 1054
Cdd:PHA03247 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA-- 2983
|
330 340
....*....|....*....|....*.
gi 1720412586 1055 aaqqtvftihPSHVQPAYTTPPHMAH 1080
Cdd:PHA03247 2984 ----------PSREAPASSTPPLTGH 2999
|
|
| PRK12323 super family |
cl46901 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.20e-05 |
|
DNA polymerase III subunit gamma/tau; The actual alignment was detected with superfamily member PRK12323:
Pssm-ID: 481241 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 9.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
743-758 |
4.85e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains. :
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.67 E-value: 4.85e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.51e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.51e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
9.12e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.12e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
762-1080 |
1.64e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 1.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 913 APPAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAA 974
Cdd:PHA03247 2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQP 2905
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 975 LHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfp 1054
Cdd:PHA03247 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA-- 2983
|
330 340
....*....|....*....|....*.
gi 1720412586 1055 aaqqtvftihPSHVQPAYTTPPHMAH 1080
Cdd:PHA03247 2984 ----------PSREAPASSTPPLTGH 2999
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
734-901 |
2.83e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 48.26 E-value: 2.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 814 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 893
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
|
170
....*....|....
gi 1720412586 894 QSQ------HPHVY 901
Cdd:TIGR01628 505 QKQvlgerlFPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
843-1107 |
6.03e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.34 E-value: 6.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 843 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 917
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 918 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 990
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 991 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 1070
Cdd:pfam09770 248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
|
250 260 270
....*....|....*....|....*....|....*..
gi 1720412586 1071 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1107
Cdd:pfam09770 327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.20e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 9.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
468-617 |
3.44e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 41.28 E-value: 3.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412586 615 SPV 617
Cdd:pfam12004 354 SPV 356
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
92-159 |
4.17e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 4.17e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586 92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
743-758 |
4.85e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.67 E-value: 4.85e-03
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
693-823 |
5.45e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.22 E-value: 5.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412586 772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.51e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.51e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
9.12e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.12e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
762-1080 |
1.64e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 1.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 913 APPAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAA 974
Cdd:PHA03247 2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQP 2905
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 975 LHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfp 1054
Cdd:PHA03247 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA-- 2983
|
330 340
....*....|....*....|....*.
gi 1720412586 1055 aaqqtvftihPSHVQPAYTTPPHMAH 1080
Cdd:PHA03247 2984 ----------PSREAPASSTPPLTGH 2999
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
763-1153 |
7.36e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 7.36e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 763 KPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQr 842
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 843 qdqhhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQP-- 920
Cdd:PHA03247 2667 -----ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPap 2741
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 921 -----GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSfyFAISTGSLAQQYAHPNAALHPHTPHPQPSATPtGQQQSQ 995
Cdd:PHA03247 2742 pavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVL-APAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 996 HGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSFPAAQQtvftihP 1065
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLARPAV------S 2892
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 1066 SHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALA---QSALQPIPVSTTAHFPYMTHPSGEA 1142
Cdd:PHA03247 2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApttDPAGAGEPSGAVPQPWLGALVPGRV 2972
|
410
....*....|.
gi 1720412586 1143 CVCRGRRGTPS 1153
Cdd:PHA03247 2973 AVPRFRVPQPA 2983
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
756-1125 |
1.89e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.17 E-value: 1.89e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 756 PRSFSQPKPSTTPTSPRPQAQPSPSmvghqQPAPVYTQPVCFAPNmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 835
Cdd:PHA03247 2593 PQSARPRAPVDDRGDPRGPAPPSPL-----PPDTHAPDPPPPSPS---PAANEPDPHPPPTVPPPERPRDDPAPGRVSRP 2664
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 836 PNMPQQRQDQHHQSTMMHPASAAGPPIVATppaystqyVAYSPQQFPNQPLVQHVPHYQSqhPHVYSPVIQGNARMMAPP 915
Cdd:PHA03247 2665 RRARRLGRAAQASSPPQRPRRRAARPTVGS--------LTSLADPPPPPPTPEPAPHALV--SATPLPPGPAAARQASPA 2734
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 916 AHAQP-------GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSFyfAISTGSLAQQYAHPNAALHPHTPHPQPSATP 988
Cdd:PHA03247 2735 LPAAPappavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPADPPAAVL 2812
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 989 tGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSF--PAA 1056
Cdd:PHA03247 2813 -APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLarPAV 2891
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720412586 1057 QQTV--FTIHPSHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIP 1125
Cdd:PHA03247 2892 SRSTesFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
734-901 |
2.83e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 48.26 E-value: 2.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 814 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 893
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
|
170
....*....|....
gi 1720412586 894 QSQ------HPHVY 901
Cdd:TIGR01628 505 QKQvlgerlFPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
843-1107 |
6.03e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.34 E-value: 6.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 843 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 917
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 918 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 990
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 991 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 1070
Cdd:pfam09770 248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
|
250 260 270
....*....|....*....|....*....|....*..
gi 1720412586 1071 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1107
Cdd:pfam09770 327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
704-1133 |
8.44e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.07 E-value: 8.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 784 HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYP-------IPMTPMPvnqaktyragkVPNMPQQRQDQHHQSTMMHPAS 856
Cdd:pfam03154 207 PPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPqrlpsphPPLQPMT-----------QPPPPSQVSPQPLPQPSLHGQM 275
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 857 AAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAppahAQPGLVSSSAAQFGAHEQT 936
Cdd:pfam03154 276 PPMPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI----HTPPSQSQLQSQQPPREQP 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 937 HAMYACPkLPYNKetsPSFYFAISTGSLAQQYAHPnaalhPHTPHPQPSATPTgqqqsqhggSHPAPSPVQHHQHQAAQA 1016
Cdd:pfam03154 345 LPPAPLS-MPHIK---PPPTTPIPQLPNPQSHKHP-----PHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHH 406
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 1017 LHLASPQQQSAIYHAGLAPTPPSMTPASnTQSPQSSFPAAQQ-TVFTIHPSHVQPAYTTPPHMAHVPqahvQSGMVPSHP 1095
Cdd:pfam03154 407 PPSAHPPPLQLMPQSQQLPPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGP 481
|
410 420 430
....*....|....*....|....*....|....*...
gi 1720412586 1096 TAHAPMMLMTTQPPggpqAALAQSALQPIPVSTTAHFP 1133
Cdd:pfam03154 482 PTSTSSAMPGIQPP----SSASVSSSGPVPAAVSCPLP 515
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.20e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 9.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
745-1100 |
1.33e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 46.21 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 745 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 824
Cdd:PHA03378 583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 825 NQAKTYraGKVPNMPQQRQDQHHqSTMMHPASAagPPIVATPPAYSTQYvaySPQQFPNQPlvqhvphyqSQHPHvyspv 904
Cdd:PHA03378 661 PYKPTW--TQIGHIPYQPSPTGA-NTMLPIQWA--PGTMQPPPRAPTPM---RPPAAPPGR---------AQRPA----- 718
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 905 iqgNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYACPKLPynketspsfyfAISTGSLAQQYAHPNAAlhphTPHPQP 984
Cdd:PHA03378 719 ---AATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP-----------AAAPGRARPPAAAPGAP----TPQPPP 780
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 985 SATPTGQQQSQHGgshPAPSPVQHHQHQAAQALHLASPQQQSAIYH----------------------------AGLAPT 1036
Cdd:PHA03378 781 QAPPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQilrqlltggvkrgrpslkkpaalerqaaAGPTPS 857
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720412586 1037 PPSMTPASNTQSPQSSFPAAQqtvftihPSHV--QPAYTTPPHMAHVPQAHVQ-----SGMVPSHPTAHAP 1100
Cdd:PHA03378 858 PGSGTSDKIVQAPVFYPPVLQ-------PIQVmrQLGSVRAAAASTVTQAPTEytgerRGVGPMHPTDIPP 921
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
760-1050 |
3.01e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.46 E-value: 3.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 760 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipmtPMPVNQAKTYRAGKVPNM 838
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL------QQPVQPQQPYYAPAAEQP 415
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 839 PQQRQDQHHQSTmmhPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPAHA 918
Cdd:PRK10263 416 AQQPYYAPAPEQ---PAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQ 479
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 919 QPGLVsssaaqfgahEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNAALhpHTPHPQPSATPTGQQQSQHGG 998
Cdd:PRK10263 480 QPQPV----------EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAW--YQPIPEPVKEPEPIKSSLKAP 547
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412586 999 SHPAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1050
Cdd:PRK10263 548 SVAAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
398-1006 |
3.95e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.93 E-value: 3.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 398 YQSGPNSLPPRAATPtrpPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSpkaqrhprnhrvsagrgSMSSG 477
Cdd:PHA03247 2480 YRRPAEARFPFAAGA---APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML-----------------TWIRG 2539
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 478 LEFVSHN------PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP---SGPVLASPQAGIIPA 548
Cdd:PHA03247 2540 LEELASDdagdppPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPP 2619
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 549 EAVSMPVPAASPTPAS---PASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQI 625
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLA 2699
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 626 DDLKKFKNDFRLQPSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSiSPSMLSNA 705
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA-PPAAPAAG 2778
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 706 EHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG-- 783
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlg 2853
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 784 --------------HQQPAPVYTQPVCFAPNMMY--PVPVSPGVQPLYPIPMTPMPVNQAKTyRAGKVPNMPQQRQDQHH 847
Cdd:PHA03247 2854 gsvapggdvrrrppSRSPAAKPAAPARPPVRRLArpAVSRSTESFALPPDQPERPPQPQAPP-PPQPQPQPPPPPQPQPP 2932
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 848 QSTMMHPASAAgPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPAHAQPGLVSSSA 927
Cdd:PHA03247 2933 PPPPPRPQPPL-APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWA 3008
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 928 AQFGAHEQTHAMYACPK----LPYNKETSPSFYFAISTGSLAQQYA---HPNAALHPHTPHPQPSATPTGQQQSQHggSH 1000
Cdd:PHA03247 3009 SSLALHEETDPPPVSLKqtlwPPDDTEDSDADSLFDSDSERSDLEAldpLPPEPHDPFAHEPDPATPEAGARESPS--SQ 3086
|
....*.
gi 1720412586 1001 PAPSPV 1006
Cdd:PHA03247 3087 FGPPPL 3092
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
740-1006 |
5.25e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.26 E-value: 5.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 740 EQVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSP 809
Cdd:pfam09770 98 EQVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKK 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 810 GVQPLYPIPMTPMPVNQAKTYRagKV--------------PNMPQQRQDQHHQstmmhpasaagpPIVATPPAYSTQYVA 875
Cdd:pfam09770 170 AAAPAPAPQPAAQPASLPAPSR--KMmsleeveaamraqaKKPAQQPAPAPAQ------------PPAAPPAQQAQQQQQ 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 876 YSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQgnARMMAPPahAQPGLVSSSAAQFGAHEQTHAMYACPklpynketspsf 955
Cdd:pfam09770 236 FPPQIQQQQQPQQQPQQPQQHPGQGHPVTIL--QRPQSPQ--PDPAQPSIQPQAQQFHQQPPPVPVQP------------ 299
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1720412586 956 yfaistgslAQQYAHPN---AALHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 1006
Cdd:pfam09770 300 ---------TQILQNPNrlsAARVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
756-1130 |
8.68e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.44 E-value: 8.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 756 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 835
Cdd:PRK07764 400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 836 PNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAySTQYVAYSPQQFPNqpLVQHVPHYQ-------SQHPHVYSpvIQGN 908
Cdd:PRK07764 472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA-GADDAATLRERWPE--ILAAVPKRSrktwailLPEATVLG--VRGD 546
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 909 -----------ARMMAPPAHAQpGLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPsfyfaistgslaqqyAHPNAALHP 977
Cdd:PRK07764 547 tlvlgfstgglARRFASPGNAE-VLVTALAEELGGDWQVEAVVGPAPGAAGGEGPP---------------APASSGPPE 610
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 978 HTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNTQ 1047
Cdd:PRK07764 611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAAP 690
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 1048 SPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSA 1120
Cdd:PRK07764 691 AAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAP 770
|
410
....*....|
gi 1720412586 1121 LQPIPVSTTA 1130
Cdd:PRK07764 771 AAAPPPSPPS 780
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
757-1133 |
2.06e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 2.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 757 RSFSQPKPSTTPTS---PRPQAQPSPsmvGHQQPAPVYTQPVCFA--PNMMY---------------PVPVSP------- 809
Cdd:PHA03247 2487 RFPFAAGAAPDPGGggpPDPDAPPAP---SRLAPAILPDEPVGEPvhPRMLTwirgleelasddagdPPPPLPpaappaa 2563
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 810 ---GVQPLYPIPMTPMPVNQAKTYR--------AGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSP 878
Cdd:PHA03247 2564 pdrSVPPPRPAPRPSEPAVTSRARRpdappqsaRPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPP 2643
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 879 QQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMyacPKLPYNKETSPSFYFA 958
Cdd:PHA03247 2644 PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP---PPTPEPAPHALVSATP 2720
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 959 ISTGSLAQQYAHPNAALHPHTP----HPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLA 1034
Cdd:PHA03247 2721 LPPGPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 1035 PTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQA 1114
Cdd:PHA03247 2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPAR 2880
|
410
....*....|....*....
gi 1720412586 1115 ALAQSALQPIPVSTTAHFP 1133
Cdd:PHA03247 2881 PPVRRLARPAVSRSTESFA 2899
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
468-617 |
3.44e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 41.28 E-value: 3.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412586 615 SPV 617
Cdd:pfam12004 354 SPV 356
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
92-159 |
4.17e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 4.17e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586 92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
743-758 |
4.85e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.67 E-value: 4.85e-03
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
693-823 |
5.45e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.22 E-value: 5.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412586 772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|