NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720412586|ref|XP_030110094|]
View 

ataxin-2 isoform X1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 6.51e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


:

Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 6.51e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 9.12e-16

LsmAD domain; This domain is found associated with Lsm domain.


:

Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.12e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 super family cl33720
large tegument protein UL36; Provisional
762-1080 1.64e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 1.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247  2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  913 APPAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAA 974
Cdd:PHA03247  2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQP 2905
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  975 LHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfp 1054
Cdd:PHA03247  2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA-- 2983
                          330       340
                   ....*....|....*....|....*.
gi 1720412586 1055 aaqqtvftihPSHVQPAYTTPPHMAH 1080
Cdd:PHA03247  2984 ----------PSREAPASSTPPLTGH 2999
PRK12323 super family cl46901
DNA polymerase III subunit gamma/tau;
399-606 9.20e-05

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK12323:

Pssm-ID: 481241 [Multi-domain]  Cd Length: 700  Bit Score: 46.79  E-value: 9.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323   447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
743-758 4.85e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


:

Pssm-ID: 429316  Cd Length: 17  Bit Score: 35.67  E-value: 4.85e-03
                           10
                   ....*....|....*.
gi 1720412586  743 RKSTLNPNAKEFNPRS 758
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 6.51e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 6.51e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 9.12e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.12e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
762-1080 1.64e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 1.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247  2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  913 APPAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAA 974
Cdd:PHA03247  2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQP 2905
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  975 LHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfp 1054
Cdd:PHA03247  2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA-- 2983
                          330       340
                   ....*....|....*....|....*.
gi 1720412586 1055 aaqqtvftihPSHVQPAYTTPPHMAH 1080
Cdd:PHA03247  2984 ----------PSREAPASSTPPLTGH 2999
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
734-901 2.83e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 48.26  E-value: 2.83e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628  362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  814 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 893
Cdd:TIGR01628  433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
                          170
                   ....*....|....
gi 1720412586  894 QSQ------HPHVY 901
Cdd:TIGR01628  505 QKQvlgerlFPLVE 518
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
843-1107 6.03e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.34  E-value: 6.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  843 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 917
Cdd:pfam09770   96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  918 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 990
Cdd:pfam09770  175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  991 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 1070
Cdd:pfam09770  248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 1720412586 1071 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1107
Cdd:pfam09770  327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
399-606 9.20e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.79  E-value: 9.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323   447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
DUF3498 pfam12004
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ...
468-617 3.44e-03

Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.


Pssm-ID: 463427 [Multi-domain]  Cd Length: 511  Bit Score: 41.28  E-value: 3.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004  196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004  274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353

                   ...
gi 1720412586  615 SPV 617
Cdd:pfam12004  354 SPV 356
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
92-159 4.17e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.84  E-value: 4.17e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586   92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
743-758 4.85e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 35.67  E-value: 4.85e-03
                           10
                   ....*....|....*.
gi 1720412586  743 RKSTLNPNAKEFNPRS 758
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
PRK10263 PRK10263
DNA translocase FtsK; Provisional
693-823 5.45e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 5.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263   741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412586  772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263   821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 6.51e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 6.51e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 9.12e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.12e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
762-1080 1.64e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 1.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  762 PKPSTTPTSPRPQAQPSPsmVG--------HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAG 833
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPT--VGsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  834 KVPNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMM 912
Cdd:PHA03247  2752 GGPARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPL 2828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  913 APPAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAA 974
Cdd:PHA03247  2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQP 2905
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  975 LHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfp 1054
Cdd:PHA03247  2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA-- 2983
                          330       340
                   ....*....|....*....|....*.
gi 1720412586 1055 aaqqtvftihPSHVQPAYTTPPHMAH 1080
Cdd:PHA03247  2984 ----------PSREAPASSTPPLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
763-1153 7.36e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 7.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  763 KPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKVPNMPQQr 842
Cdd:PHA03247  2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  843 qdqhhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQP-- 920
Cdd:PHA03247  2667 -----ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPap 2741
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  921 -----GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSfyFAISTGSLAQQYAHPNAALHPHTPHPQPSATPtGQQQSQ 995
Cdd:PHA03247  2742 pavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVL-APAAAL 2818
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  996 HGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSFPAAQQtvftihP 1065
Cdd:PHA03247  2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLARPAV------S 2892
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 1066 SHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALA---QSALQPIPVSTTAHFPYMTHPSGEA 1142
Cdd:PHA03247  2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApttDPAGAGEPSGAVPQPWLGALVPGRV 2972
                          410
                   ....*....|.
gi 1720412586 1143 CVCRGRRGTPS 1153
Cdd:PHA03247  2973 AVPRFRVPQPA 2983
PHA03247 PHA03247
large tegument protein UL36; Provisional
756-1125 1.89e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 1.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  756 PRSFSQPKPSTTPTSPRPQAQPSPSmvghqQPAPVYTQPVCFAPNmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 835
Cdd:PHA03247  2593 PQSARPRAPVDDRGDPRGPAPPSPL-----PPDTHAPDPPPPSPS---PAANEPDPHPPPTVPPPERPRDDPAPGRVSRP 2664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  836 PNMPQQRQDQHHQSTMMHPASAAGPPIVATppaystqyVAYSPQQFPNQPLVQHVPHYQSqhPHVYSPVIQGNARMMAPP 915
Cdd:PHA03247  2665 RRARRLGRAAQASSPPQRPRRRAARPTVGS--------LTSLADPPPPPPTPEPAPHALV--SATPLPPGPAAARQASPA 2734
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  916 AHAQP-------GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSFyfAISTGSLAQQYAHPNAALHPHTPHPQPSATP 988
Cdd:PHA03247  2735 LPAAPappavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPADPPAAVL 2812
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  989 tGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSF--PAA 1056
Cdd:PHA03247  2813 -APAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLarPAV 2891
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720412586 1057 QQTV--FTIHPSHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIP 1125
Cdd:PHA03247  2892 SRSTesFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
734-901 2.83e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 48.26  E-value: 2.83e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628  362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  814 LypiPMTPMPVNQaktyrAGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHY 893
Cdd:TIGR01628  433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQM 504
                          170
                   ....*....|....
gi 1720412586  894 QSQ------HPHVY 901
Cdd:TIGR01628  505 QKQvlgerlFPLVE 518
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
843-1107 6.03e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.34  E-value: 6.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  843 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 917
Cdd:pfam09770   96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  918 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 990
Cdd:pfam09770  175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  991 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 1070
Cdd:pfam09770  248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 1720412586 1071 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1107
Cdd:pfam09770  327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
704-1133 8.44e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.07  E-value: 8.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154  127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  784 HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYP-------IPMTPMPvnqaktyragkVPNMPQQRQDQHHQSTMMHPAS 856
Cdd:pfam03154  207 PPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPqrlpsphPPLQPMT-----------QPPPPSQVSPQPLPQPSLHGQM 275
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  857 AAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAppahAQPGLVSSSAAQFGAHEQT 936
Cdd:pfam03154  276 PPMPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI----HTPPSQSQLQSQQPPREQP 344
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  937 HAMYACPkLPYNKetsPSFYFAISTGSLAQQYAHPnaalhPHTPHPQPSATPTgqqqsqhggSHPAPSPVQHHQHQAAQA 1016
Cdd:pfam03154  345 LPPAPLS-MPHIK---PPPTTPIPQLPNPQSHKHP-----PHLSGPSPFQMNS---------NLPPPPALKPLSSLSTHH 406
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 1017 LHLASPQQQSAIYHAGLAPTPPSMTPASnTQSPQSSFPAAQQ-TVFTIHPSHVQPAYTTPPHMAHVPqahvQSGMVPSHP 1095
Cdd:pfam03154  407 PPSAHPPPLQLMPQSQQLPPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGP 481
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 1720412586 1096 TAHAPMMLMTTQPPggpqAALAQSALQPIPVSTTAHFP 1133
Cdd:pfam03154  482 PTSTSSAMPGIQPP----SSASVSSSGPVPAAVSCPLP 515
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
399-606 9.20e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.79  E-value: 9.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323   447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
PHA03378 PHA03378
EBNA-3B; Provisional
745-1100 1.33e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.21  E-value: 1.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  745 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 824
Cdd:PHA03378   583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  825 NQAKTYraGKVPNMPQQRQDQHHqSTMMHPASAagPPIVATPPAYSTQYvaySPQQFPNQPlvqhvphyqSQHPHvyspv 904
Cdd:PHA03378   661 PYKPTW--TQIGHIPYQPSPTGA-NTMLPIQWA--PGTMQPPPRAPTPM---RPPAAPPGR---------AQRPA----- 718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  905 iqgNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYACPKLPynketspsfyfAISTGSLAQQYAHPNAAlhphTPHPQP 984
Cdd:PHA03378   719 ---AATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP-----------AAAPGRARPPAAAPGAP----TPQPPP 780
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  985 SATPTGQQQSQHGgshPAPSPVQHHQHQAAQALHLASPQQQSAIYH----------------------------AGLAPT 1036
Cdd:PHA03378   781 QAPPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQilrqlltggvkrgrpslkkpaalerqaaAGPTPS 857
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720412586 1037 PPSMTPASNTQSPQSSFPAAQqtvftihPSHV--QPAYTTPPHMAHVPQAHVQ-----SGMVPSHPTAHAP 1100
Cdd:PHA03378   858 PGSGTSDKIVQAPVFYPPVLQ-------PIQVmrQLGSVRAAAASTVTQAPTEytgerRGVGPMHPTDIPP 921
PRK10263 PRK10263
DNA translocase FtsK; Provisional
760-1050 3.01e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.46  E-value: 3.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  760 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipmtPMPVNQAKTYRAGKVPNM 838
Cdd:PRK10263   345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL------QQPVQPQQPYYAPAAEQP 415
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  839 PQQRQDQHHQSTmmhPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPAHA 918
Cdd:PRK10263   416 AQQPYYAPAPEQ---PAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQ 479
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  919 QPGLVsssaaqfgahEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNAALhpHTPHPQPSATPTGQQQSQHGG 998
Cdd:PRK10263   480 QPQPV----------EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAW--YQPIPEPVKEPEPIKSSLKAP 547
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720412586  999 SHPAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1050
Cdd:PRK10263   548 SVAAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
PHA03247 PHA03247
large tegument protein UL36; Provisional
398-1006 3.95e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 3.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  398 YQSGPNSLPPRAATPtrpPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSpkaqrhprnhrvsagrgSMSSG 477
Cdd:PHA03247  2480 YRRPAEARFPFAAGA---APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML-----------------TWIRG 2539
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  478 LEFVSHN------PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP---SGPVLASPQAGIIPA 548
Cdd:PHA03247  2540 LEELASDdagdppPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPP 2619
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  549 EAVSMPVPAASPTPAS---PASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQI 625
Cdd:PHA03247  2620 DTHAPDPPPPSPSPAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLA 2699
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  626 DDLKKFKNDFRLQPSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSiSPSMLSNA 705
Cdd:PHA03247  2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA-PPAAPAAG 2778
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  706 EHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG-- 783
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlg 2853
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  784 --------------HQQPAPVYTQPVCFAPNMMY--PVPVSPGVQPLYPIPMTPMPVNQAKTyRAGKVPNMPQQRQDQHH 847
Cdd:PHA03247  2854 gsvapggdvrrrppSRSPAAKPAAPARPPVRRLArpAVSRSTESFALPPDQPERPPQPQAPP-PPQPQPQPPPPPQPQPP 2932
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  848 QSTMMHPASAAgPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPAHAQPGLVSSSA 927
Cdd:PHA03247  2933 PPPPPRPQPPL-APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWA 3008
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  928 AQFGAHEQTHAMYACPK----LPYNKETSPSFYFAISTGSLAQQYA---HPNAALHPHTPHPQPSATPTGQQQSQHggSH 1000
Cdd:PHA03247  3009 SSLALHEETDPPPVSLKqtlwPPDDTEDSDADSLFDSDSERSDLEAldpLPPEPHDPFAHEPDPATPEAGARESPS--SQ 3086

                   ....*.
gi 1720412586 1001 PAPSPV 1006
Cdd:PHA03247  3087 FGPPPL 3092
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
740-1006 5.25e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 44.26  E-value: 5.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  740 EQVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSP 809
Cdd:pfam09770   98 EQVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKK 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  810 GVQPLYPIPMTPMPVNQAKTYRagKV--------------PNMPQQRQDQHHQstmmhpasaagpPIVATPPAYSTQYVA 875
Cdd:pfam09770  170 AAAPAPAPQPAAQPASLPAPSR--KMmsleeveaamraqaKKPAQQPAPAPAQ------------PPAAPPAQQAQQQQQ 235
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  876 YSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQgnARMMAPPahAQPGLVSSSAAQFGAHEQTHAMYACPklpynketspsf 955
Cdd:pfam09770  236 FPPQIQQQQQPQQQPQQPQQHPGQGHPVTIL--QRPQSPQ--PDPAQPSIQPQAQQFHQQPPPVPVQP------------ 299
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720412586  956 yfaistgslAQQYAHPN---AALHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 1006
Cdd:pfam09770  300 ---------TQILQNPNrlsAARVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
756-1130 8.68e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 8.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  756 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 835
Cdd:PRK07764   400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  836 PNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAySTQYVAYSPQQFPNqpLVQHVPHYQ-------SQHPHVYSpvIQGN 908
Cdd:PRK07764   472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPA-GADDAATLRERWPE--ILAAVPKRSrktwailLPEATVLG--VRGD 546
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  909 -----------ARMMAPPAHAQpGLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPsfyfaistgslaqqyAHPNAALHP 977
Cdd:PRK07764   547 tlvlgfstgglARRFASPGNAE-VLVTALAEELGGDWQVEAVVGPAPGAAGGEGPP---------------APASSGPPE 610
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  978 HTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNTQ 1047
Cdd:PRK07764   611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAAP 690
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 1048 SPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSA 1120
Cdd:PRK07764   691 AAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAP 770
                          410
                   ....*....|
gi 1720412586 1121 LQPIPVSTTA 1130
Cdd:PRK07764   771 AAAPPPSPPS 780
PHA03247 PHA03247
large tegument protein UL36; Provisional
757-1133 2.06e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 2.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  757 RSFSQPKPSTTPTS---PRPQAQPSPsmvGHQQPAPVYTQPVCFA--PNMMY---------------PVPVSP------- 809
Cdd:PHA03247  2487 RFPFAAGAAPDPGGggpPDPDAPPAP---SRLAPAILPDEPVGEPvhPRMLTwirgleelasddagdPPPPLPpaappaa 2563
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  810 ---GVQPLYPIPMTPMPVNQAKTYR--------AGKVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSP 878
Cdd:PHA03247  2564 pdrSVPPPRPAPRPSEPAVTSRARRpdappqsaRPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPP 2643
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  879 QQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMyacPKLPYNKETSPSFYFA 958
Cdd:PHA03247  2644 PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP---PPTPEPAPHALVSATP 2720
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  959 ISTGSLAQQYAHPNAALHPHTP----HPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLA 1034
Cdd:PHA03247  2721 LPPGPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586 1035 PTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQA 1114
Cdd:PHA03247  2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPAR 2880
                          410
                   ....*....|....*....
gi 1720412586 1115 ALAQSALQPIPVSTTAHFP 1133
Cdd:PHA03247  2881 PPVRRLARPAVSRSTESFA 2899
DUF3498 pfam12004
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ...
468-617 3.44e-03

Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.


Pssm-ID: 463427 [Multi-domain]  Cd Length: 511  Bit Score: 41.28  E-value: 3.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004  196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004  274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353

                   ...
gi 1720412586  615 SPV 617
Cdd:pfam12004  354 SPV 356
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
92-159 4.17e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.84  E-value: 4.17e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412586   92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
743-758 4.85e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 35.67  E-value: 4.85e-03
                           10
                   ....*....|....*.
gi 1720412586  743 RKSTLNPNAKEFNPRS 758
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
PRK10263 PRK10263
DNA translocase FtsK; Provisional
693-823 5.45e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 5.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412586  693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263   741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412586  772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263   821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH