NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|967490223|ref|XP_014977839|]
View 

adenomatous polyposis coli protein 2 isoform X2 [Macaca mulatta]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1810-2136 9.55e-93

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 304.88  E-value: 9.55e-93
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1810 AVLRGRTVIYVPSpAPRAQPKGTPGPRATPRKvappclaqSAAPAKVPSPGQQRSRSLHRPGKTSELGTLSQPPRSATPP 1889
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPK--------TDAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1890 ARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGAS------PVPKTPARTLLAKQ--HKTQRSPVRIPFMQKPA 1961
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1962 R-----RGPPPLARAVPEpgPRGRAGTEAGPGARGGRLGLVRVAsALSSGSESSDRSGFRRQLTFIKESPG-LRRRRSEL 2035
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPE--PRSESASKGLRSLPGKRLDLVRMS-SARSSGSESDRSGFLRQLTFIKESPSlLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  2036 SSAESAASAPQGTSPRRGRPALPAVFLCSSRCEELRAAPRQAPA--RQRPPAARPGPGERPARRTSSESPSRLPVRAPAA 2113
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNpnSRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|...
gi 967490223  2114 RPETVKRYASLPHISVARRPDGT 2136
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSS 331
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
381-455 1.35e-35

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 130.36  E-value: 1.35e-35
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 967490223   381 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDGAPEgGGASGVPVPIEPQICQATCAVMKLSFDEEYRRAM 455
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPE-GDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
30-81 1.09e-25

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


:

Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.60  E-value: 1.09e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 967490223    30 APYEQLVRQVEALKAENSHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Arm_APC_u3 super family cl25003
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
721-968 2.57e-20

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


The actual alignment was detected with superfamily member pfam16629:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 93.88  E-value: 2.57e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   721 HRPAKHQAAaTAVSPGSCVPSLYVRKQRALEAELDARHLAQALEHLEKHRPPAAEATskKPlpplRHLDGLAQDYASDSG 800
Cdd:pfam16629    1 NRPAKYKDA-NIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRN--KQ----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   801 CFDDDDAPSSlaaaaaTGEPASPAALSLFLGSPFLQGQAlARTPPTRRGGKEAEKDASGEAAVAAKAKA----------- 869
Cdd:pfam16629   74 RHDDSVCRSD------NFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDRSLDRERGAGLSNfhpatensgns 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   870 ------KLALAVARIDQLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGPEG-GRREAGSRAHPLLRL 939
Cdd:pfam16629  147 skrigmQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYM 226
                          250       260
                   ....*....|....*....|....*....
gi 967490223   940 KAAHASLSNDSLNSGSASDGYCPREHMLP 968
Cdd:pfam16629  227 KMEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
148-229 4.29e-20

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


:

Pssm-ID: 463275  Cd Length: 82  Bit Score: 86.54  E-value: 4.29e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   148 SRATIRLLEELDRERCLLLNEIEKEEKEKLWYYSQLQGLSKRLDELPHVETQFSMQMDLIRQQLEFEAQHIRSLMEERFG 227
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 967490223   228 TS 229
Cdd:pfam11414   81 LI 82
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1538-1992 1.47e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 1.47e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1538 SDEEPPAAAPTPTHRRTSAIPRALTRERLQGR-------------KEAPAPSKAAPSAPPPTRAQPSLiadETPPCYSLS 1604
Cdd:PHA03247 2501 GGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRmltwirgleelasDDAGDPPPPLPPAAPPAAPDRSV---PPPRPAPRP 2577
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1605 SSASSLSEPEPPEPPSVSPRGQEPAVTKDPgPRGGRDSSPSPRAAeellqrcissalPRRRPPVSGlRRRKPRATRLDER 1684
Cdd:PHA03247 2578 SEPAVTSRARRPDAPPQSARPRAPVDDRGD-PRGPAPPSPLPPDT------------HAPDPPPPS-PSPAANEPDPHPP 2643
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1685 PAEGSREHGEEAAGSDRASDLDSVEWRAIQEGANSIVT-WLHQAAAATREassesdsilsfvsglSVGSTLQPPKHRKGR 1763
Cdd:PHA03247 2644 PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQrPRRRAARPTVG---------------SLTSLADPPPPPPTP 2708
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1764 QAGGEMGSARRPEKRGAASAKTS--GSPRSPAGPEKPRGTqkTTPGVPA-VLRGRTVIYVPSPAPRAQPKGTPGPRATPR 1840
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQAspALPAAPAPPAVPAGP--ATPGGPArPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1841 KVAPPCLAQSAAP-----AKVPSPGQQRSRSL---HRPGKTSELGTLSQPPRSATPPARLAKTPSS------------SS 1900
Cdd:PHA03247 2787 AVASLSESRESLPspwdpADPPAAVLAPAAALppaASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdvrrRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1901 SQTSPASQPLPRKRPLVTQAAGPLPGPGASPVPKTPARTLLAKQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPGPRGR 1980
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490
                  ....*....|..
gi 967490223 1981 AGTEAGPGARGG 1992
Cdd:PHA03247 2947 TDPAGAGEPSGA 2958
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
639-678 1.24e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.06  E-value: 1.24e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 967490223   639 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 678
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PRK12323 super family cl46901
DNA polymerase III subunit gamma/tau;
1373-1599 4.86e-04

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK12323:

Pssm-ID: 481241 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 4.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1373 AAVPARLRKVASALVPGRRALPVPVYMLVPAPARAQEDDsctdsaegtpvnfSSAASLSDETLQGPPRDQPGGPEGRQRP 1452
Cdd:PRK12323  384 QPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAA-------------PARRSPAPEALAAARQASARGPGGAPAP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1453 TGRPTSArqavghrhKAGGAGRSVEQARGTGkNRAGLELPLGRPPSAPADKDDSKPGRTRGDGALQSLCLttpteeavyc 1532
Cdd:PRK12323  451 APAPAAA--------PAAAARPAAAGPRPVA-AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAP---------- 511
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 967490223 1533 fygNDSDEEPPAAAPTPTHRRTSAIPRALTRERLQGRKEAPAPSKAAPSAPPPTRAQPSLIADETPP 1599
Cdd:PRK12323  512 ---AQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
PRK07764 super family cl35613
DNA polymerase III subunits gamma and tau; Validated
2072-2321 1.08e-03

DNA polymerase III subunits gamma and tau; Validated


The actual alignment was detected with superfamily member PRK07764:

Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 1.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2072 AAPRQAPARQRPPAARPGPGERPARRTSSESPSRLPVRAPAARPETVKRYASLPHISVARRPDGTVPAAPAPADAARRSS 2151
Cdd:PRK07764  419 AAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2152 DGEPRSLPR-VAAPGTTWRRIRdEDVPHILRSTLP-----ATALPLRGST------------------------------ 2195
Cdd:PRK07764  499 APAAPAGADdAATLRERWPEIL-AAVPKRSRKTWAillpeATVLGVRGDTlvlgfstgglarrfaspgnaevlvtalaee 577
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2196 ------------PEDAPAGPPPRKTSDAVVQTEEVAAPKTNSSTSPSLESREPPGAPASGqlsllgsdvdGPSLAKAPIS 2263
Cdd:PRK07764  578 lggdwqveavvgPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA----------EASAAPAPGV 647
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 967490223 2264 APFVHEGLGVAVGGFPASRHGSPSRSARVPPFNYVPSP---MVVAATTDSAAEKAPATSSA 2321
Cdd:PRK07764  648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPapaAPAAPAGAAPAQPAPAPAAT 708
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1284-1305 2.45e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 37.36  E-value: 2.45e-03
                           10        20
                   ....*....|....*....|..
gi 967490223  1284 SVRFTVEKPDENFSCASSLSAL 1305
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
549-586 9.47e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 35.89  E-value: 9.47e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 967490223   549 KKVLREAGSVTALVQCvLRATKESTLKSVLSALWNLSA 586
Cdd:pfam00514    5 KQAVIEAGAVPPLVRL-LSSPDEEVQEEAAWALSNLAA 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
685-720 9.95e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 35.89  E-value: 9.95e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 967490223   685 ELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 720
Cdd:pfam00514    6 QAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1810-2136 9.55e-93

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 304.88  E-value: 9.55e-93
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1810 AVLRGRTVIYVPSpAPRAQPKGTPGPRATPRKvappclaqSAAPAKVPSPGQQRSRSLHRPGKTSELGTLSQPPRSATPP 1889
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPK--------TDAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1890 ARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGAS------PVPKTPARTLLAKQ--HKTQRSPVRIPFMQKPA 1961
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1962 R-----RGPPPLARAVPEpgPRGRAGTEAGPGARGGRLGLVRVAsALSSGSESSDRSGFRRQLTFIKESPG-LRRRRSEL 2035
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPE--PRSESASKGLRSLPGKRLDLVRMS-SARSSGSESDRSGFLRQLTFIKESPSlLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  2036 SSAESAASAPQGTSPRRGRPALPAVFLCSSRCEELRAAPRQAPA--RQRPPAARPGPGERPARRTSSESPSRLPVRAPAA 2113
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNpnSRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|...
gi 967490223  2114 RPETVKRYASLPHISVARRPDGT 2136
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSS 331
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
381-455 1.35e-35

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 130.36  E-value: 1.35e-35
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 967490223   381 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDGAPEgGGASGVPVPIEPQICQATCAVMKLSFDEEYRRAM 455
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPE-GDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
30-81 1.09e-25

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.60  E-value: 1.09e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 967490223    30 APYEQLVRQVEALKAENSHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
721-968 2.57e-20

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 93.88  E-value: 2.57e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   721 HRPAKHQAAaTAVSPGSCVPSLYVRKQRALEAELDARHLAQALEHLEKHRPPAAEATskKPlpplRHLDGLAQDYASDSG 800
Cdd:pfam16629    1 NRPAKYKDA-NIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRN--KQ----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   801 CFDDDDAPSSlaaaaaTGEPASPAALSLFLGSPFLQGQAlARTPPTRRGGKEAEKDASGEAAVAAKAKA----------- 869
Cdd:pfam16629   74 RHDDSVCRSD------NFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDRSLDRERGAGLSNfhpatensgns 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   870 ------KLALAVARIDQLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGPEG-GRREAGSRAHPLLRL 939
Cdd:pfam16629  147 skrigmQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYM 226
                          250       260
                   ....*....|....*....|....*....
gi 967490223   940 KAAHASLSNDSLNSGSASDGYCPREHMLP 968
Cdd:pfam16629  227 KMEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
148-229 4.29e-20

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 86.54  E-value: 4.29e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   148 SRATIRLLEELDRERCLLLNEIEKEEKEKLWYYSQLQGLSKRLDELPHVETQFSMQMDLIRQQLEFEAQHIRSLMEERFG 227
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 967490223   228 TS 229
Cdd:pfam11414   81 LI 82
PHA03247 PHA03247
large tegument protein UL36; Provisional
1538-1992 1.47e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 1.47e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1538 SDEEPPAAAPTPTHRRTSAIPRALTRERLQGR-------------KEAPAPSKAAPSAPPPTRAQPSLiadETPPCYSLS 1604
Cdd:PHA03247 2501 GGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRmltwirgleelasDDAGDPPPPLPPAAPPAAPDRSV---PPPRPAPRP 2577
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1605 SSASSLSEPEPPEPPSVSPRGQEPAVTKDPgPRGGRDSSPSPRAAeellqrcissalPRRRPPVSGlRRRKPRATRLDER 1684
Cdd:PHA03247 2578 SEPAVTSRARRPDAPPQSARPRAPVDDRGD-PRGPAPPSPLPPDT------------HAPDPPPPS-PSPAANEPDPHPP 2643
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1685 PAEGSREHGEEAAGSDRASDLDSVEWRAIQEGANSIVT-WLHQAAAATREassesdsilsfvsglSVGSTLQPPKHRKGR 1763
Cdd:PHA03247 2644 PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQrPRRRAARPTVG---------------SLTSLADPPPPPPTP 2708
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1764 QAGGEMGSARRPEKRGAASAKTS--GSPRSPAGPEKPRGTqkTTPGVPA-VLRGRTVIYVPSPAPRAQPKGTPGPRATPR 1840
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQAspALPAAPAPPAVPAGP--ATPGGPArPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1841 KVAPPCLAQSAAP-----AKVPSPGQQRSRSL---HRPGKTSELGTLSQPPRSATPPARLAKTPSS------------SS 1900
Cdd:PHA03247 2787 AVASLSESRESLPspwdpADPPAAVLAPAAALppaASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdvrrRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1901 SQTSPASQPLPRKRPLVTQAAGPLPGPGASPVPKTPARTLLAKQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPGPRGR 1980
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490
                  ....*....|..
gi 967490223 1981 AGTEAGPGARGG 1992
Cdd:PHA03247 2947 TDPAGAGEPSGA 2958
PHA03247 PHA03247
large tegument protein UL36; Provisional
1772-2325 1.59e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 1.59e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1772 ARRPEKRGAASAKTSGSPRSPAGPEKPRGTQKT-----TPGVPAVLRGRTVIY-----------------VPSPAPRAQP 1829
Cdd:PHA03247 2486 ARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdePVGEPVHPRMLTWIRgleelasddagdpppplPPAAPPAAPD 2565
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1830 KGTPGPRATPRKVAPPCLAQSAAPAKVPSPGQQRSrslhrPGKTSELGTLSQPPrSATPPArlaktpsssssqTSPASQP 1909
Cdd:PHA03247 2566 RSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRA-----PVDDRGDPRGPAPP-SPLPPD------------THAPDPP 2627
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1910 LPRKRPLVTQAAGPLPGPGASPV-PKTPARTLLAKQHKTQRSPVRIPFMQKPARRGPPPLARAV--------------PE 1974
Cdd:PHA03247 2628 PPSPSPAANEPDPHPPPTVPPPErPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTvgsltsladpppppPT 2707
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1975 PGPRGRAGTEAGPGARGGRLGLVRVASALSSGSESSDRSGfrrQLTFIKESPGLRRRRSELSSAESAASAPQGTSPRRGR 2054
Cdd:PHA03247 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG---PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT 2784
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2055 PALPAVFLCSSRCEELRAAPRQAPARQRPPAARPGPGERPARrTSSESPSRLPVrAPAARPETVKRYASLPHiSVArrPD 2134
Cdd:PHA03247 2785 RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG-PLPPPTSAQPT-APPPPPGPPPPSLPLGG-SVA--PG 2859
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2135 GTVPaapapadaaRRSSDGEPRSLPrvAAPgttwrrirdedvPHILRSTLPATALPLRGSTPEDAPAGPPPRKTSDAVVQ 2214
Cdd:PHA03247 2860 GDVR---------RRPPSRSPAAKP--AAP------------ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPP 2916
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2215 TEEVAAPKTNSSTSPSLESREPPGAPASGQLSLLGSDVDGPSlAKAPISAPFVHEGLGVAVGGFPASRHGSPSRSARVPP 2294
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGA-VPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
                         570       580       590
                  ....*....|....*....|....*....|.
gi 967490223 2295 FNYVPSPMVVAATTDSAAEKAPATSSATLLE 2325
Cdd:PHA03247 2996 LTGHSLSRVSSWASSLALHEETDPPPVSLKQ 3026
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
639-678 1.24e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.06  E-value: 1.24e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 967490223   639 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 678
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
638-678 1.88e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 1.88e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 967490223    638 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 678
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1373-1599 4.86e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 4.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1373 AAVPARLRKVASALVPGRRALPVPVYMLVPAPARAQEDDsctdsaegtpvnfSSAASLSDETLQGPPRDQPGGPEGRQRP 1452
Cdd:PRK12323  384 QPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAA-------------PARRSPAPEALAAARQASARGPGGAPAP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1453 TGRPTSArqavghrhKAGGAGRSVEQARGTGkNRAGLELPLGRPPSAPADKDDSKPGRTRGDGALQSLCLttpteeavyc 1532
Cdd:PRK12323  451 APAPAAA--------PAAAARPAAAGPRPVA-AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAP---------- 511
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 967490223 1533 fygNDSDEEPPAAAPTPTHRRTSAIPRALTRERLQGRKEAPAPSKAAPSAPPPTRAQPSLIADETPP 1599
Cdd:PRK12323  512 ---AQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
2072-2321 1.08e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 1.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2072 AAPRQAPARQRPPAARPGPGERPARRTSSESPSRLPVRAPAARPETVKRYASLPHISVARRPDGTVPAAPAPADAARRSS 2151
Cdd:PRK07764  419 AAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2152 DGEPRSLPR-VAAPGTTWRRIRdEDVPHILRSTLP-----ATALPLRGST------------------------------ 2195
Cdd:PRK07764  499 APAAPAGADdAATLRERWPEIL-AAVPKRSRKTWAillpeATVLGVRGDTlvlgfstgglarrfaspgnaevlvtalaee 577
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2196 ------------PEDAPAGPPPRKTSDAVVQTEEVAAPKTNSSTSPSLESREPPGAPASGqlsllgsdvdGPSLAKAPIS 2263
Cdd:PRK07764  578 lggdwqveavvgPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA----------EASAAPAPGV 647
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 967490223 2264 APFVHEGLGVAVGGFPASRHGSPSRSARVPPFNYVPSP---MVVAATTDSAAEKAPATSSA 2321
Cdd:PRK07764  648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPapaAPAAPAGAAPAQPAPAPAAT 708
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1645-1666 1.62e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 37.57  E-value: 1.62e-03
                           10        20
                   ....*....|....*....|..
gi 967490223  1645 SPRAAEELLQRCISSALPRRRP 1666
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1410-1432 2.33e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 37.36  E-value: 2.33e-03
                           10        20
                   ....*....|....*....|...
gi 967490223  1410 DDSCTDSAEGTPVNFSSAASLSD 1432
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1284-1305 2.45e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 37.36  E-value: 2.45e-03
                           10        20
                   ....*....|....*....|..
gi 967490223  1284 SVRFTVEKPDENFSCASSLSAL 1305
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
549-586 9.47e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 35.89  E-value: 9.47e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 967490223   549 KKVLREAGSVTALVQCvLRATKESTLKSVLSALWNLSA 586
Cdd:pfam00514    5 KQAVIEAGAVPPLVRL-LSSPDEEVQEEAAWALSNLAA 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
685-720 9.95e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 35.89  E-value: 9.95e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 967490223   685 ELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 720
Cdd:pfam00514    6 QAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1810-2136 9.55e-93

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 304.88  E-value: 9.55e-93
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1810 AVLRGRTVIYVPSpAPRAQPKGTPGPRATPRKvappclaqSAAPAKVPSPGQQRSRSLHRPGKTSELGTLSQPPRSATPP 1889
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPK--------TDAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1890 ARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGAS------PVPKTPARTLLAKQ--HKTQRSPVRIPFMQKPA 1961
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRnklsplPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  1962 R-----RGPPPLARAVPEpgPRGRAGTEAGPGARGGRLGLVRVAsALSSGSESSDRSGFRRQLTFIKESPG-LRRRRSEL 2035
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPE--PRSESASKGLRSLPGKRLDLVRMS-SARSSGSESDRSGFLRQLTFIKESPSlLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223  2036 SSAESAASAPQGTSPRRGRPALPAVFLCSSRCEELRAAPRQAPA--RQRPPAARPGPGERPARRTSSESPSRLPVRAPAA 2113
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNpnSRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|...
gi 967490223  2114 RPETVKRYASLPHISVARRPDGT 2136
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSS 331
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
381-455 1.35e-35

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 130.36  E-value: 1.35e-35
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 967490223   381 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDGAPEgGGASGVPVPIEPQICQATCAVMKLSFDEEYRRAM 455
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPE-GDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
30-81 1.09e-25

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.60  E-value: 1.09e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 967490223    30 APYEQLVRQVEALKAENSHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
721-968 2.57e-20

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 93.88  E-value: 2.57e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   721 HRPAKHQAAaTAVSPGSCVPSLYVRKQRALEAELDARHLAQALEHLEKHRPPAAEATskKPlpplRHLDGLAQDYASDSG 800
Cdd:pfam16629    1 NRPAKYKDA-NIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRN--KQ----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   801 CFDDDDAPSSlaaaaaTGEPASPAALSLFLGSPFLQGQAlARTPPTRRGGKEAEKDASGEAAVAAKAKA----------- 869
Cdd:pfam16629   74 RHDDSVCRSD------NFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDRSLDRERGAGLSNfhpatensgns 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   870 ------KLALAVARIDQLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGPEG-GRREAGSRAHPLLRL 939
Cdd:pfam16629  147 skrigmQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYM 226
                          250       260
                   ....*....|....*....|....*....
gi 967490223   940 KAAHASLSNDSLNSGSASDGYCPREHMLP 968
Cdd:pfam16629  227 KMEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
148-229 4.29e-20

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 86.54  E-value: 4.29e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223   148 SRATIRLLEELDRERCLLLNEIEKEEKEKLWYYSQLQGLSKRLDELPHVETQFSMQMDLIRQQLEFEAQHIRSLMEERFG 227
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGTYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 967490223   228 TS 229
Cdd:pfam11414   81 LI 82
PHA03247 PHA03247
large tegument protein UL36; Provisional
1538-1992 1.47e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 1.47e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1538 SDEEPPAAAPTPTHRRTSAIPRALTRERLQGR-------------KEAPAPSKAAPSAPPPTRAQPSLiadETPPCYSLS 1604
Cdd:PHA03247 2501 GGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRmltwirgleelasDDAGDPPPPLPPAAPPAAPDRSV---PPPRPAPRP 2577
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1605 SSASSLSEPEPPEPPSVSPRGQEPAVTKDPgPRGGRDSSPSPRAAeellqrcissalPRRRPPVSGlRRRKPRATRLDER 1684
Cdd:PHA03247 2578 SEPAVTSRARRPDAPPQSARPRAPVDDRGD-PRGPAPPSPLPPDT------------HAPDPPPPS-PSPAANEPDPHPP 2643
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1685 PAEGSREHGEEAAGSDRASDLDSVEWRAIQEGANSIVT-WLHQAAAATREassesdsilsfvsglSVGSTLQPPKHRKGR 1763
Cdd:PHA03247 2644 PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQrPRRRAARPTVG---------------SLTSLADPPPPPPTP 2708
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1764 QAGGEMGSARRPEKRGAASAKTS--GSPRSPAGPEKPRGTqkTTPGVPA-VLRGRTVIYVPSPAPRAQPKGTPGPRATPR 1840
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQAspALPAAPAPPAVPAGP--ATPGGPArPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1841 KVAPPCLAQSAAP-----AKVPSPGQQRSRSL---HRPGKTSELGTLSQPPRSATPPARLAKTPSS------------SS 1900
Cdd:PHA03247 2787 AVASLSESRESLPspwdpADPPAAVLAPAAALppaASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdvrrRP 2866
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1901 SQTSPASQPLPRKRPLVTQAAGPLPGPGASPVPKTPARTLLAKQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPGPRGR 1980
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
                         490
                  ....*....|..
gi 967490223 1981 AGTEAGPGARGG 1992
Cdd:PHA03247 2947 TDPAGAGEPSGA 2958
PHA03247 PHA03247
large tegument protein UL36; Provisional
1438-1973 7.60e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 7.60e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1438 PPRDQPGGPEgRQRPTGR--PTSARQAVGHRHKAGGAGRSVEQARGTGKNRAGlelPLGRPPSAPADKDDSKPGRTRGDG 1515
Cdd:PHA03247 2556 PPAAPPAAPD-RSVPPPRpaPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD---PRGPAPPSPLPPDTHAPDPPPPSP 2631
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1516 ALQslclttpteeavycfyGNDSDEEPPAAAPTPTHRRTSAIPRALTRERLQGRKEAPAPSKAAPSAPPPTRAQPSL--- 1592
Cdd:PHA03247 2632 SPA----------------ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVgsl 2695
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1593 --IADETPPCYSLSSSASSLSEPEPPEPPSVSPRGQEPAVTKDPGPRGGRDSSPSPRAAEELLQRCISSALPRRRPPVSG 1670
Cdd:PHA03247 2696 tsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1671 LRRRKPRATRlderPAEGSREHGEEAAGSDRASdldsvewraiqegANSIVTWLHQAAAATREASSesdsilsfvsglsv 1750
Cdd:PHA03247 2776 AAGPPRRLTR----PAVASLSESRESLPSPWDP-------------ADPPAAVLAPAAALPPAASP-------------- 2824
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1751 GSTLQPPkhrkgrqaggemgsarrPEKRGAASAKTSGSPRSPAGPEKprgtqKTTPGVPAVLRGrtviyvPSPAPRAQPK 1830
Cdd:PHA03247 2825 AGPLPPP-----------------TSAQPTAPPPPPGPPPPSLPLGG-----SVAPGGDVRRRP------PSRSPAAKPA 2876
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1831 GTPGPRAtpRKVAPPCLAQSAAPAKVPSPGQQRSRSLHRPGKTSELGTLSQPPRSATPParlaktpsssssqtspasQPL 1910
Cdd:PHA03247 2877 APARPPV--RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPP------------------PPP 2936
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 967490223 1911 PRKRPLVTQAAGPLPGPGASPVPKTPARTLLAKQHKT---QRSPVRIPFMQKPARRGPPPLARAVP 1973
Cdd:PHA03247 2937 PRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAvprFRVPQPAPSREAPASSTPPLTGHSLS 3002
PHA03247 PHA03247
large tegument protein UL36; Provisional
1772-2325 1.59e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 1.59e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1772 ARRPEKRGAASAKTSGSPRSPAGPEKPRGTQKT-----TPGVPAVLRGRTVIY-----------------VPSPAPRAQP 1829
Cdd:PHA03247 2486 ARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdePVGEPVHPRMLTWIRgleelasddagdpppplPPAAPPAAPD 2565
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1830 KGTPGPRATPRKVAPPCLAQSAAPAKVPSPGQQRSrslhrPGKTSELGTLSQPPrSATPPArlaktpsssssqTSPASQP 1909
Cdd:PHA03247 2566 RSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRA-----PVDDRGDPRGPAPP-SPLPPD------------THAPDPP 2627
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1910 LPRKRPLVTQAAGPLPGPGASPV-PKTPARTLLAKQHKTQRSPVRIPFMQKPARRGPPPLARAV--------------PE 1974
Cdd:PHA03247 2628 PPSPSPAANEPDPHPPPTVPPPErPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTvgsltsladpppppPT 2707
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1975 PGPRGRAGTEAGPGARGGRLGLVRVASALSSGSESSDRSGfrrQLTFIKESPGLRRRRSELSSAESAASAPQGTSPRRGR 2054
Cdd:PHA03247 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG---PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT 2784
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2055 PALPAVFLCSSRCEELRAAPRQAPARQRPPAARPGPGERPARrTSSESPSRLPVrAPAARPETVKRYASLPHiSVArrPD 2134
Cdd:PHA03247 2785 RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG-PLPPPTSAQPT-APPPPPGPPPPSLPLGG-SVA--PG 2859
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2135 GTVPaapapadaaRRSSDGEPRSLPrvAAPgttwrrirdedvPHILRSTLPATALPLRGSTPEDAPAGPPPRKTSDAVVQ 2214
Cdd:PHA03247 2860 GDVR---------RRPPSRSPAAKP--AAP------------ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPP 2916
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2215 TEEVAAPKTNSSTSPSLESREPPGAPASGQLSLLGSDVDGPSlAKAPISAPFVHEGLGVAVGGFPASRHGSPSRSARVPP 2294
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGA-VPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
                         570       580       590
                  ....*....|....*....|....*....|.
gi 967490223 2295 FNYVPSPMVVAATTDSAAEKAPATSSATLLE 2325
Cdd:PHA03247 2996 LTGHSLSRVSSWASSLALHEETDPPPVSLKQ 3026
PHA03247 PHA03247
large tegument protein UL36; Provisional
1756-2243 2.30e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.95  E-value: 2.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1756 PPKHRKGRQAGGEMGS-ARRPEkrgaaSAKTSGSPRSPAGP-EKPRGTQKTTPGVPAVLRGRTViyVPSPAPRAQPKGTP 1833
Cdd:PHA03247 2569 PPPRPAPRPSEPAVTSrARRPD-----APPQSARPRAPVDDrGDPRGPAPPSPLPPDTHAPDPP--PPSPSPAANEPDPH 2641
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1834 GPRATPrkvAPPCLAQSAAPAKVPSPgqQRSRSLHRPGKTSelgtlsQPPRSATPPARLAKTPSSSSSQTSPASQPLPRK 1913
Cdd:PHA03247 2642 PPPTVP---PPERPRDDPAPGRVSRP--RRARRLGRAAQAS------SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1914 RPLVTQAAGPLPGPGASPVPKTPARTLLAKQHKTQRSPVripFMQKPARRGPPPLARAVPEPGPRgrAGTEAGPGARGGR 1993
Cdd:PHA03247 2711 APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA---TPGGPARPARPPTTAGPPAPAPP--AAPAAGPPRRLTR 2785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1994 lglvrvasalssgsessdrsgfrrqltfikespglrrrrselssaesaasaPQGTSPRRGRPALPavflcSSRCEELRAA 2073
Cdd:PHA03247 2786 ---------------------------------------------------PAVASLSESRESLP-----SPWDPADPPA 2809
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2074 PRQAPARQRPPAARPGPGERParrTSSESPSRLPVRAPAARPETVKRYASLPHISVARRPDG----TVPAAPAPADAARR 2149
Cdd:PHA03247 2810 AVLAPAAALPPAASPAGPLPP---PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSrspaAKPAAPARPPVRRL 2886
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2150 SSDGEPRSLPRVAAPGTTWRRIRDEDVPHILRSTLPATALPLRGSTPEDAP-AGPPPRKTSDAVVQTE------------ 2216
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPrPQPPLAPTTDPAGAGEpsgavpqpwlga 2966
                         490       500       510
                  ....*....|....*....|....*....|..
gi 967490223 2217 ----EVAAPKTN-SSTSPSLESREPPGAPASG 2243
Cdd:PHA03247 2967 lvpgRVAVPRFRvPQPAPSREAPASSTPPLTG 2998
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
639-678 1.24e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.06  E-value: 1.24e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 967490223   639 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 678
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
638-678 1.88e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 1.88e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 967490223    638 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 678
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1542-1972 2.81e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 52.68  E-value: 2.81e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1542 PPAAAPTPTHRRTSAIPRALTRERLQGRKEaPAPSKAAPSAPPPTRAQPSLIADETPPCYSLSSSASSLSEPEPPEPPSV 1621
Cdd:PRK07764  401 AAAAAPAAAPAPAAAAPAAAAAPAPAAAPQ-PAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAA 479
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1622 SPRGQEPAVTKDPGPRGGRDSSPS--PRAAEELLQR--CISSALPRRRPPVSGLRrrKPRATRLDERPAEGSREHGEEAA 1697
Cdd:PRK07764  480 PAPAPPAAPAPAAAPAAPAAPAAPagADDAATLRERwpEILAAVPKRSRKTWAIL--LPEATVLGVRGDTLVLGFSTGGL 557
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1698 GSDRASdldsvewraiQEGANSIVTWLHQaaaatreassesdsilsfVSGLSVGSTLQPPKHRKGRQAGGEMGSARRPEK 1777
Cdd:PRK07764  558 ARRFAS----------PGNAEVLVTALAE------------------ELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPP 609
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1778 RGAASAKTSGSPRSPAGPekPRGTQKTTPGVPAVLRGRTVIY-------VPSPAPRAQPKGTPGPRATPRKVAPPCLAQS 1850
Cdd:PRK07764  610 EEAARPAAPAAPAAPAAP--APAGAAAAPAEASAAPAPGVAApehhpkhVAVPDASDGGDGWPAKAGGAAPAAPPPAPAP 687
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1851 AAPAkvPSPGQQRSRSLHRPGKTSELGTlSQPPRSATPPARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGAS 1930
Cdd:PRK07764  688 AAPA--APAGAAPAQPAPAPAATPPAGQ-ADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAP 764
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....
gi 967490223 1931 PVPKTPARTLLAKQHKTQRSPVR--IPFMQKPARRGPPPLARAV 1972
Cdd:PRK07764  765 APAAAPAAAPPPSPPSEEEEMAEddAPSMDDEDRRDAEEVAMEL 808
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1543-1990 1.12e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 1.12e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1543 PAAAPTPTHRRTSAIPRAltrerlqgrkEAPAPSKAAPSAPPPTRAQPSLIADETPPcyslsSSASSLSEPEPPEPPSVS 1622
Cdd:PRK07764  390 GAGAPAAAAPSAAAAAPA----------AAPAPAAAAPAAAAAPAPAAAPQPAPAPA-----PAPAPPSPAGNAPAGGAP 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1623 PRGQEPAVTKDPGPRGGRDSSPSPRAAEEllqrcissalprrrppvsglrrrkPRATRLDERPAEGSREHGEEAAGSDRA 1702
Cdd:PRK07764  455 SPPPAAAPSAQPAPAPAAAPEPTAAPAPA------------------------PPAAPAPAAAPAAPAAPAAPAGADDAA 510
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1703 SDLDSveWRAIQE--GANSIVTWLHQAAAATREASSESDSILSFVSGLSVGSTLQPPKHRKGRQA-GGEMGSARRPEKRG 1779
Cdd:PRK07764  511 TLRER--WPEILAavPKRSRKTWAILLPEATVLGVRGDTLVLGFSTGGLARRFASPGNAEVLVTAlAEELGGDWQVEAVV 588
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1780 AASAKTSGSPRSPAGPEKPRGTQKTTPGVPAVlrgrtviyvPSPAPRAQPKGTPGPRATPRKVAPPCLAQSAAPAKVPSP 1859
Cdd:PRK07764  589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAA---------PAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1860 GQQRSRSLHRPGKTSELGTLSQPPRSATPPARlaktpsssssqtspASQPLPRKRPLVTQAAGPLPGPGASPVPKTPART 1939
Cdd:PRK07764  660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPA--------------APAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA 725
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|.
gi 967490223 1940 llakQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPGPRGRAGTEAGPGAR 1990
Cdd:PRK07764  726 ----QGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAA 772
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1539-1985 1.49e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.37  E-value: 1.49e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1539 DEEPPAAAPTPTHRRTSAIPRALTRERLQGRKEAPAPSKAAPSAPPPTRAQPSLIADETPPCYSLSSSASSLSEPEPPEP 1618
Cdd:PRK07764  391 AGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAP 470
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1619 PSVSPRGQEPAvtkdPGPRGGRDSSPSPRAAEEllqrcisSALPRRRPPVSGLRRRKPRAtrldeRPAEGSREHGEEAAG 1698
Cdd:PRK07764  471 AAAPEPTAAPA----PAPPAAPAPAAAPAAPAA-------PAAPAGADDAATLRERWPEI-----LAAVPKRSRKTWAIL 534
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1699 SDRAsdldsvewRAIQEGANSIVTWLHQAAAATREASSESDSILSFVSGLSVGSTLQPPKhrkgrQAGGEMGSARRPEKR 1778
Cdd:PRK07764  535 LPEA--------TVLGVRGDTLVLGFSTGGLARRFASPGNAEVLVTALAEELGGDWQVEA-----VVGPAPGAAGGEGPP 601
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1779 GAASAKTSGSPRSPAGPEKPRGTQKTTPGVPAvlRGRTVIYVPSPAPRAQPKGTPGPRATPRKVAPPCLAQSAAPAKVPS 1858
Cdd:PRK07764  602 APASSGPPEEAARPAAPAAPAAPAAPAPAGAA--AAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPA 679
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1859 PGQQRSRSLHRPGKTSELGTLSQPPRSATPPARLAKTPSSSSSQTSPASQPLPRkrplVTQAAGPLPG-PGASPVPKTPA 1937
Cdd:PRK07764  680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSP----AADDPVPLPPePDDPPDPAGAP 755
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*...
gi 967490223 1938 RTLLAKQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPGPRGRAGTEA 1985
Cdd:PRK07764  756 AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEE 803
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1756-2074 4.19e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.01  E-value: 4.19e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1756 PPKHRKGRQAGGEMGSARRPEKRGAASAKTSGSPRSPAGPEKPRGTQKTTPGVPAVLRGrtviyvPSPAPRAQPKGTPGP 1835
Cdd:PHA03307  127 PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARA------PSSPPAEPPPSTPPA 200
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1836 RATPRKVAPPCLAQSAAPAKVPSPG--------QQRSRSLHRPGKTSELGTLSQPPRSATPPARLAKTPSSSSSQTSPAS 1907
Cdd:PHA03307  201 AASPRPPRRSSPISASASSPAPAPGrsaaddagASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS 280
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1908 QPLPRKRPLVTQAAGPLPGPGASPVPKTPARTllakQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPGPRGRAGTEAGP 1987
Cdd:PHA03307  281 RPGPASSSSSPRERSPSPSPSSPGSGPAPSSP----RASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP 356
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1988 GARggrlglvRVASALSSGSESSDRSGFRRQltfikesPGLRRRRSELSSAESAASAPQGTSPRRGRPALPAVFLCSSRC 2067
Cdd:PHA03307  357 PPP-------ADPSSPRKRPRPSRAPSSPAA-------SAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAA 422

                  ....*..
gi 967490223 2068 EELRAAP 2074
Cdd:PHA03307  423 SGAFYAR 429
PHA03247 PHA03247
large tegument protein UL36; Provisional
1815-2293 5.55e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 5.55e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1815 RTVIYVPSPAPRAQPKGTPGPRATPRkvaPPCLAQSAAPAKVPSPGQQRSRSLHRPGKTSELGTL---SQPPRSATPPAR 1891
Cdd:PHA03247 2457 RTILGAPFSLSLLLGELFPGAPVYRR---PAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpDEPVGEPVHPRM 2533
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1892 LAKTPSSSSSQTSPASQPLPRKRPLVTQAAG--PLPGPGASPVPKTPARTLLAK------QHKTQRSPV--RIPFMQKPA 1961
Cdd:PHA03247 2534 LTWIRGLEELASDDAGDPPPPLPPAAPPAAPdrSVPPPRPAPRPSEPAVTSRARrpdappQSARPRAPVddRGDPRGPAP 2613
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1962 RRGPPPLARAVPEPGPrgragteaGPGARGGRLGLVRVASALSSGSESSDRSGFRRQLtfikespglRRRRSELSSAESA 2041
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPP--------SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR---------PRRARRLGRAAQA 2676
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2042 ASAPQGTSPRRGRPALPAVflcssrceelraaprqaPARQRPPAARPGPGERPARRTSSESPSRLPVRAPAARPETVKRY 2121
Cdd:PHA03247 2677 SSPPQRPRRRAARPTVGSL-----------------TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2122 ASLPhisvarRPDGTVPaapapadaarrssdgePRSLPRVAAPgttwrrirdedvphilrstlPATALPLRgSTPEDAPA 2201
Cdd:PHA03247 2740 APPA------VPAGPAT----------------PGGPARPARP--------------------PTTAGPPA-PAPPAAPA 2776
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2202 GPPPRKTSDAVVQTEEVAAPKTNSSTSPSLESREPPG---------APASGQLSLLGSDVDGPSLAKAPISAPFVHEGlG 2272
Cdd:PHA03247 2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApaaalppaaSPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG-S 2855
                         490       500
                  ....*....|....*....|.
gi 967490223 2273 VAVGGfPASRHGSPSRSARVP 2293
Cdd:PHA03247 2856 VAPGG-DVRRRPPSRSPAAKP 2875
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1746-2136 6.78e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.44  E-value: 6.78e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1746 SGLSVGSTLQPPkhrkGRQAGGEMGSARRPEKRGAASAKTSGSPRSPAGPEKPRGTQKTTPGVPAVLRGRTVIYVPSPAP 1825
Cdd:PRK07764  383 RRLGVAGGAGAP----AAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP 458
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1826 RAQPKGTPGPRATPRKVAPPCLAQSAAPAKVPSPGQQRSrslhRPGKTSELGTLSQPPRSATPPAR--LAKTPSSSSSQT 1903
Cdd:PRK07764  459 AAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAP----AAPAAPAGADDAATLRERWPEILaaVPKRSRKTWAIL 534
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1904 SPASQPLPRKRPLVTQA------AGPLPGPGASPVPKTPARTLLAKQHK------TQRSPVRIPFMQKPARRGPPP-LAR 1970
Cdd:PRK07764  535 LPEATVLGVRGDTLVLGfstgglARRFASPGNAEVLVTALAEELGGDWQveavvgPAPGAAGGEGPPAPASSGPPEeAAR 614
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1971 AVPEPGPRGRAGTEAGPGARGGRLGLVRVASALSSGSESSDRSGFrrqlTFIKESPGLRRRRSELSSAESAASAPQGTSP 2050
Cdd:PRK07764  615 PAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV----PDASDGGDGWPAKAGGAAPAAPPPAPAPAAP 690
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2051 RRGRPALPAvflcssRCEELRAAPRQAPARQRPPAARPGPGERPARRTSSESPSRLPVRAPAARPETVKRYASLPHISVA 2130
Cdd:PRK07764  691 AAPAGAAPA------QPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAP 764

                  ....*.
gi 967490223 2131 RRPDGT 2136
Cdd:PRK07764  765 APAAAP 770
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1538-1936 1.57e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 1.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1538 SDEEPPAAAPTPTHRRTSAIPRALTRERLQGRKEAPAPSKAAPSAPPPTRAQPSLIADETPPCYSLSSSASSLSEPEPPE 1617
Cdd:PHA03307   62 CDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPV 141
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1618 PPSVSPRGQEPAVTKDPGPRGGRDSSPSPRAAeellqRCISSALPRRRPPVSGLRRRKPRATRLDeRPAEGSREHGEEAA 1697
Cdd:PHA03307  142 GSPGPPPAASPPAAGASPAAVASDAASSRQAA-----LPLSSPEETARAPSSPPAEPPPSTPPAA-ASPRPPRRSSPISA 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1698 GSDRASDLDsvewraiqegansivtwLHQAAAATREASSESDSILSFVSGLSVGSTLQPPKH----RKGRQAGGEMGSAR 1773
Cdd:PHA03307  216 SASSPAPAP-----------------GRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPapitLPTRIWEASGWNGP 278
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1774 RPEKRGAASAKTSGSPRSPAGPEKPRGTQKTTPGVPAVLRGR-TVIYVPSPAPRAQPKGTPGPRATPRKVAPPCLAQSAA 1852
Cdd:PHA03307  279 SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSsRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPP 358
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1853 PAKVPSPgQQRSRSLHRPGKTselgtlSQPPRSATPPARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGASPV 1932
Cdd:PHA03307  359 PADPSSP-RKRPRPSRAPSSP------AASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYP 431

                  ....
gi 967490223 1933 PKTP 1936
Cdd:PHA03307  432 LLTP 435
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1800-1995 4.62e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 4.62e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1800 GTQKTTPGVPAVLRGRTVIYVPSPAPRAqPKGTPGPRATPRKVAPPCLAQSAAPAKVPSPGQQRSRSLHRPGKTSELGTL 1879
Cdd:PRK12323  371 GAGPATAAAAPVAQPAPAAAAPAAAAPA-PAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPA 449
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1880 SQPPRSATPPARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGASPVPKTPARTLLAKQHKTQRSPVRIPFMQK 1959
Cdd:PRK12323  450 PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDP 529
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 967490223 1960 PARRGPPPLARAVPEPG----PRGRAGTEAGPGARGGRLG 1995
Cdd:PRK12323  530 ATADPDDAFETLAPAPAaapaPRAAAATEPVVAPRPPRAS 569
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1373-1599 4.86e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 4.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1373 AAVPARLRKVASALVPGRRALPVPVYMLVPAPARAQEDDsctdsaegtpvnfSSAASLSDETLQGPPRDQPGGPEGRQRP 1452
Cdd:PRK12323  384 QPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAA-------------PARRSPAPEALAAARQASARGPGGAPAP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1453 TGRPTSArqavghrhKAGGAGRSVEQARGTGkNRAGLELPLGRPPSAPADKDDSKPGRTRGDGALQSLCLttpteeavyc 1532
Cdd:PRK12323  451 APAPAAA--------PAAAARPAAAGPRPVA-AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAP---------- 511
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 967490223 1533 fygNDSDEEPPAAAPTPTHRRTSAIPRALTRERLQGRKEAPAPSKAAPSAPPPTRAQPSLIADETPP 1599
Cdd:PRK12323  512 ---AQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
PHA03378 PHA03378
EBNA-3B; Provisional
1792-1982 4.96e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.44  E-value: 4.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1792 PAGPEKPRGTQKTTPGVPAVLRGRTVIYVPSPAPRAQPKGTPGPRATPRKVAPPCLAQSAAPAKVPSPGQQRSRSLHRPG 1871
Cdd:PHA03378  670 GHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAA 749
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1872 KTSELGTLSQPPRSATPParlAKTPSSSSSQTSPASQPLPRKRPlvTQAAGPLPGPGASPVPKTPARTLLAKQHKTQRSP 1951
Cdd:PHA03378  750 APGRARPPAAAPGRARPP---AAAPGAPTPQPPPQAPPAPQQRP--RGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQI 824
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 967490223 1952 VRIPFMQ-----KPARRGPPPLAR---AVPEPGPRGRAG 1982
Cdd:PHA03378  825 LRQLLTGgvkrgRPSLKKPAALERqaaAGPTPSPGSGTS 863
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1787-2164 5.47e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 5.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1787 GSPRSPAGPEKPRGTQKTTPGVPAVLRGRTVIYVPSPAPRAQPKGTPGPRATPRKVAPPCLAQSAAPAKVPSPGQQRSRS 1866
Cdd:PHA03307   25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1867 LHRPGKTSELGTLSQPPRSATPPARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGASP--------VPKTPAR 1938
Cdd:PHA03307  105 SPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSrqaalplsSPEETAR 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1939 TLLAKQHKTQRSPVRIPFMQKPARRGpPPLARAVPEPGPRG--RAGTEAGPGARGGRLGLVRVASALSSGSESSDRSGFR 2016
Cdd:PHA03307  185 APSSPPAEPPPSTPPAAASPRPPRRS-SPISASASSPAPAPgrSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPI 263
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2017 RQLTFIKE-SPGLRRRRSELSSAESAASAPQGTSPRRGRPALP-------AVFLCSSRCEELRAAPRQAPARQRPPAARP 2088
Cdd:PHA03307  264 TLPTRIWEaSGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGpapssprASSSSSSSRESSSSSTSSSSESSRGAAVSP 343
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 967490223 2089 GPGERPARRTSSESPSRLPvrAPAARPETVKRYASLPHISVARRPDGTVPAAPAPADAARRSSDGEPRSLPRVAAP 2164
Cdd:PHA03307  344 GPSPSRSPSPSRPPPPADP--SSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPL 417
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
1647-1860 6.70e-04

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 45.24  E-value: 6.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1647 RAAEELLQRCISSALPRRRPPVSGLRRRKPRATRLDERPAEGSREHGEEAAGSDRASDLDSVEWRAIQEGANSivtwlHQ 1726
Cdd:PLN03237 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGA-----KK 1248
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1727 AAAATREASSESDSILSFvsglsvgstlqppkhrKGRQAGGEMGSArrPEKRgaaSAKTSGSPRSPAGPEKPRGTQKTTP 1806
Cdd:PLN03237 1249 KAPAAAKEKEEEDEILDL----------------KDRLAAYNLDSA--PAQS---AKMEETVKAVPARRAAARKKPLASV 1307
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 967490223 1807 GVPAVLRGRTVIYVP--SPAPRAQPKGTPGPRATPRKVA-PPCLAQSAAPAKVPSPG 1860
Cdd:PLN03237 1308 SVISDSDDDDDDFAVevSLAERLKKKGGRKPAAANKKAAkPPAAAKKRGPATVQSGQ 1364
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
2072-2321 1.08e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 1.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2072 AAPRQAPARQRPPAARPGPGERPARRTSSESPSRLPVRAPAARPETVKRYASLPHISVARRPDGTVPAAPAPADAARRSS 2151
Cdd:PRK07764  419 AAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2152 DGEPRSLPR-VAAPGTTWRRIRdEDVPHILRSTLP-----ATALPLRGST------------------------------ 2195
Cdd:PRK07764  499 APAAPAGADdAATLRERWPEIL-AAVPKRSRKTWAillpeATVLGVRGDTlvlgfstgglarrfaspgnaevlvtalaee 577
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2196 ------------PEDAPAGPPPRKTSDAVVQTEEVAAPKTNSSTSPSLESREPPGAPASGqlsllgsdvdGPSLAKAPIS 2263
Cdd:PRK07764  578 lggdwqveavvgPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA----------EASAAPAPGV 647
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 967490223 2264 APFVHEGLGVAVGGFPASRHGSPSRSARVPPFNYVPSP---MVVAATTDSAAEKAPATSSA 2321
Cdd:PRK07764  648 AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPapaAPAAPAGAAPAQPAPAPAAT 708
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1628-1991 1.11e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.39  E-value: 1.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1628 PAVTKDPGPRGGRDSSPSPRA-AEELLQRCISSALPRRRPPVSGLRRRKPRATR-----LDERPAEGSREHGEEAAGSDR 1701
Cdd:PHA03307   55 VVAGAAACDRFEPPTGPPPGPgTEAPANESRSTPTWSLSTLAPASPAREGSPTPpgpssPDPPPPTPPPASPPPSPAPDL 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1702 ASDLDSVEWRAIQEGANSIVTWLHQAAAATREASSESDSILSFVSGLSVGSTLQPPKHRKGRQAGGEMGSARRPEKRGAA 1781
Cdd:PHA03307  135 SEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPIS 214
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1782 SAKTSGSPRSPAGPEKPRG------TQKTTPGVPAVLRGRTVIYVPSPAPRAQPKGTPGPRATPRKVAPPCLAQSAAPAK 1855
Cdd:PHA03307  215 ASASSPAPAPGRSAADDAGasssdsSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1856 VPSPGQQRSRSLHRPGKTSELGTLSQPPRSATPPARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGASPVPKT 1935
Cdd:PHA03307  295 SPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRA 374
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 967490223 1936 PARTLLAKQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPGPRGRAGTEAGPGARG 1991
Cdd:PHA03307  375 PSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARY 430
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1880-2294 1.62e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 1.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1880 SQPPRSATPPARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGASPVPKTPARtllAKQHKTQRSPVRIPFMQK 1959
Cdd:PRK07764  396 AAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAP---SPPPAAAPSAQPAPAPAA 472
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1960 PARRGPPPLARAVPEPGPRGRAGTEAGPGARGGrlglvrvasalssgseSSDRSGFRRQLTFIKES-PGLRRRRSELSSA 2038
Cdd:PRK07764  473 APEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG----------------ADDAATLRERWPEILAAvPKRSRKTWAILLP 536
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2039 ESAASAPQGTSPRRG--RPALPAVFLCSSRCEELRAAPRQAPARQRPPAARPGPGERPARRTSSESPSRLPVRAPAARPE 2116
Cdd:PRK07764  537 EATVLGVRGDTLVLGfsTGGLARRFASPGNAEVLVTALAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPA 616
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2117 TVKRYASlphisvarrpdgtvPAAPAPADAARRSSDGEPRSLPRVAAPGTTWRRIRDEDVPHILRSTLPATALPLRGSTP 2196
Cdd:PRK07764  617 APAAPAA--------------PAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPP 682
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2197 EDAPAGPPPRKTSDAvvQTEEVAAPKTNSSTSPSLESREPPGAPASGQLSLLGSDVDGPSLAKAPISAPFVHEGLGVAVG 2276
Cdd:PRK07764  683 PAPAPAAPAAPAGAA--PAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPP 760
                         410
                  ....*....|....*...
gi 967490223 2277 GFPASRHGSPSRSARVPP 2294
Cdd:PRK07764  761 PPAPAPAAAPAAAPPPSP 778
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1645-1666 1.62e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 37.57  E-value: 1.62e-03
                           10        20
                   ....*....|....*....|..
gi 967490223  1645 SPRAAEELLQRCISSALPRRRP 1666
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1347-1689 1.82e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 1.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1347 EGPAPTGSRPRGAADQELELLRECLGAAVPARLRKVASalVPGRRALPVPVYMLVPAPARAQEDDSCTDSAEGTPVNFSS 1426
Cdd:PHA03307   49 ELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRS--TPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP 126
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1427 AASLSDETLQGPPRDQPGGPEGRQRPTGRPTSARQAVGHRHKAGGA------GRSVEQARGTGKNRAGLELPLGRPP--- 1497
Cdd:PHA03307  127 PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAalplssPEETARAPSSPPAEPPPSTPPAAASprp 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1498 -------SAPADKDDSKPGRTRGDGALQSLCLTTPTEEAVyCFYGNDSDEEPPAAAPTPTHRRTSAIPRALTRERLQGRK 1570
Cdd:PHA03307  207 prrsspiSASASSPAPAPGRSAADDAGASSSDSSSSESSG-CGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPA 285
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1571 EAPAPSKAAPSAPPPTRAQPSLIADETPPCYSLSSSASSLSEPEPPEPPSVSPRGQEPAVTKDPGPRGGRDSSPSPRAAE 1650
Cdd:PHA03307  286 SSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSP 365
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 967490223 1651 ELLQRCISSALPRRRPPVSGLRRRKPRATRLDERPAEGS 1689
Cdd:PHA03307  366 RKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDAT 404
PHA03378 PHA03378
EBNA-3B; Provisional
2047-2301 1.95e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.52  E-value: 1.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2047 GTSPRRGRPALPAVFLCSSRCEELRAAPRQAPARQRPPAARPGPGERPARRTSsespsrlPVRAPAARPETVKRYASLPh 2126
Cdd:PHA03378  670 GHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATG-------RARPPAAAPGRARPPAAAP- 741
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2127 iSVARRPDGTvpaapapADAARRSSDGEPRSLPRVAAPGTtwrrirdedvphilrstlpatalplrgSTPEDAPAGPPpr 2206
Cdd:PHA03378  742 -GRARPPAAA-------PGRARPPAAAPGRARPPAAAPGA---------------------------PTPQPPPQAPP-- 784
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2207 ktsdavvqteevaAPKTNSSTSPSlESREPPGAPASGQLSLlgsdvDGPSLAKAPISAPFVHEGLGVAVGGFPASRhgSP 2286
Cdd:PHA03378  785 -------------APQQRPRGAPT-PQPPPQAGPTSMQLMP-----RAAPGQQGPTKQILRQLLTGGVKRGRPSLK--KP 843
                         250
                  ....*....|....*
gi 967490223 2287 SRSARVPPFNYVPSP 2301
Cdd:PHA03378  844 AALERQAAAGPTPSP 858
PHA03378 PHA03378
EBNA-3B; Provisional
1756-1993 1.98e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.52  E-value: 1.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1756 PPKHRKGRQAGGEMGSARRPEKRGAASAKTSGSPRSPAGPEKPRGTQKTTPGVPAVLRGRTVIYVPSPAPRAQPKGTPGP 1835
Cdd:PHA03378  529 PPQPRAGRRAPCVYTEDLDIESDEPASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEP 608
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1836 RATpRKVAPpclaQSAAPAKVPSPGQQRS----------------RSLHRPGK---TSELGTLSQPPRSATPPARLAKTP 1896
Cdd:PHA03378  609 PTT-QSHIP----ETSAPRQWPMPLRPIPmrplrmqpitfnvlvfPTPHQPPQveiTPYKPTWTQIGHIPYQPSPTGANT 683
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1897 SSSSSQTSPASQPLPRkrplvtqAAGPLPGPGASPVPKTPARTLLAKQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPG 1976
Cdd:PHA03378  684 MLPIQWAPGTMQPPPR-------APTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARP 756
                         250
                  ....*....|....*..
gi 967490223 1977 PRGRAGTEAGPGARGGR 1993
Cdd:PHA03378  757 PAAAPGRARPPAAAPGA 773
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1469-1649 2.21e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 43.30  E-value: 2.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1469 AGGAGRSVEQARGTGKNRAGLELPLG---RPPSAPADKDDSKPGRTRGDGALQSLCLTTPTEEAVYCFYGNDSDEEPPAA 1545
Cdd:PRK07003  367 APGGGVPARVAGAVPAPGARAAAAVGasaVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGD 446
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1546 APTPTHRRTSAIPRALTRERLQGRKEAPAPSKAAPS-APPPTRAQPSLIADETPPcyslsssaSSLSEPEPPEPPSVSPR 1624
Cdd:PRK07003  447 APVPAKANARASADSRCDERDAQPPADSGSASAPASdAPPDAAFEPAPRAAAPSA--------ATPAAVPDARAPAAASR 518
                         170       180
                  ....*....|....*....|....*
gi 967490223 1625 GQEPAVTKDPGPRGgrdSSPSPRAA 1649
Cdd:PRK07003  519 EDAPAAAAPPAPEA---RPPTPAAA 540
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1410-1432 2.33e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 37.36  E-value: 2.33e-03
                           10        20
                   ....*....|....*....|...
gi 967490223  1410 DDSCTDSAEGTPVNFSSAASLSD 1432
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1284-1305 2.45e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 37.36  E-value: 2.45e-03
                           10        20
                   ....*....|....*....|..
gi 967490223  1284 SVRFTVEKPDENFSCASSLSAL 1305
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
2076-2302 2.49e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.14  E-value: 2.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2076 QAPARQRPP--AARPGPGERPARRTSSESPSRLPVRAPAARPETVKRYASLP---HISVARRPDGTVPAAPAPADAARRS 2150
Cdd:PTZ00449  586 KHPKDPEEPkkPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPppqRPSSPERPEGPKIIKSPKPPKSPKP 665
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2151 S----------DGEPRSLPRVAAPGTTwrRIRDEDVPHILRSTLPATAlplrgSTPEDAPAGPPPRKTSDAVVQTEEVAA 2220
Cdd:PTZ00449  666 PfdpkfkekfyDDYLDAAAKSKETKTT--VVLDESFESILKETLPETP-----GTPFTTPRPLPPKLPRDEEFPFEPIGD 738
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2221 PkTNSSTSPSlESREPPgAPASGQLSLLGSDVDGPSLAKAPISAPFVHEGLGVAvgGFPASRHGSPSRSARVPPFNYVPS 2300
Cdd:PTZ00449  739 P-DAEQPDDI-EFFTPP-EEERTFFHETPADTPLPDILAEEFKEEDIHAETGEP--DEAMKRPDSPSEHEDKPPGDHPSL 813

                  ..
gi 967490223 2301 PM 2302
Cdd:PTZ00449  814 PK 815
ZapB pfam06005
Cell division protein ZapB; ZapB is a non-essential, abundant cell division factor that is ...
22-83 3.48e-03

Cell division protein ZapB; ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation.


Pssm-ID: 428718 [Multi-domain]  Cd Length: 71  Bit Score: 38.02  E-value: 3.48e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 967490223    22 ELKMASSVAPYEQLVRQVEALKAENSHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ 83
Cdd:pfam06005   10 ETKIQAAVDTIALLQMENEELKEENEELKEEANELEEENQQLKQERNQWQERIRGLLGKLDE 71
PHA03247 PHA03247
large tegument protein UL36; Provisional
1207-1705 3.91e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1207 DPCSGQGSGTISPSELPDSP-GQTMPP----------------SRSKTPPLAPAPqgPPEASQFSLQWESYVKRFLDIAd 1269
Cdd:PHA03247 2505 DPDAPPAPSRLAPAILPDEPvGEPVHPrmltwirgleelasddAGDPPPPLPPAA--PPAAPDRSVPPPRPAPRPSEPA- 2581
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1270 CRERCRLPSELDAGSVRFTVEKPDENFSCASSLSALALHEHYVQQDVelrllPSACPERGGGTGGAGLHFAGHRRREEGP 1349
Cdd:PHA03247 2582 VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPP-----PSPSPAANEPDPHPPPTVPPPERPRDDP 2656
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1350 APT-GSRPRGAADQElellRECLGAAVPARlrkvasalvPGRRALPVPVYMLV-----PAPARAQED-----DSCTDSAE 1418
Cdd:PHA03247 2657 APGrVSRPRRARRLG----RAAQASSPPQR---------PRRRAARPTVGSLTsladpPPPPPTPEPaphalVSATPLPP 2723
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1419 GTPVNFSSAASLSDETLQGPPRDQPGGPEGRQRPTGRPTSA--RQAVGHRHKAGGAGRSVEQARGTGKNRAGLELPLgrp 1496
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS--- 2800
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1497 PSAPADKDDSKPGRTRGDGALQSLCLTTPTEEAVYcfygNDSDEEPPAAAPTPTHRRTSAIPRALTRERlqGRKEAPAPS 1576
Cdd:PHA03247 2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ----PTAPPPPPGPPPPSLPLGGSVAPGGDVRRR--PPSRSPAAK 2874
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1577 KAAPSAPPPTRAQPSLIADETPPCYSLSSSASSLSEPEPPEPPSVSPRGQEPAVTKDPGPRGGRDSSPSPRAAEELLQRC 1656
Cdd:PHA03247 2875 PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGE 2954
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 967490223 1657 ISSALPRR--------RPPVSGLRRRKPRATRLDERPAEGSREHGEEAAGSDRASDL 1705
Cdd:PHA03247 2955 PSGAVPQPwlgalvpgRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSL 3011
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1779-2125 5.71e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 5.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1779 GAASAKTSGSPRSPAGpEKPRGTQKTTPGVPAVLRgrtviyvpspAPRAQPKGTPGPRATPRKVAPPCLAQSAAPAKVPS 1858
Cdd:PRK07003  363 TGGGAPGGGVPARVAG-AVPAPGARAAAAVGASAV----------PAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPA 431
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1859 PGQQrsrslhrpgktselGTLSQPPRSATPPARLAKTPSSSSSQTSPASQPLPRKRPLVTQAAGPLPGPGASPVPKTPAr 1938
Cdd:PRK07003  432 PPAT--------------ADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRA- 496
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1939 tllAKQHKTQRSPVRIPFMQKPARRGPPPLARAVPEPGPRGRAGTEAGPGARGGrlglvrvasalssgsessdrsGFRRQ 2018
Cdd:PRK07003  497 ---AAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAG---------------------GAAAA 552
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 2019 LTFIKeSPGLRrrrselssaesaasapqgTSPRRGRPALPAvflcssrceelrAAPRQAPARQRPPAARPGPGERPARRT 2098
Cdd:PRK07003  553 LDVLR-NAGMR------------------VSSDRGARAAAA------------AKPAAAPAAAPKPAAPRVAVQVPTPRA 601
                         330       340
                  ....*....|....*....|....*..
gi 967490223 2099 SSESPSRLPVRAPAARPETVKRYASLP 2125
Cdd:PRK07003  602 RAATGDAPPNGAARAEQAAESRGAPPP 628
PHA03378 PHA03378
EBNA-3B; Provisional
1755-1977 7.71e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.59  E-value: 7.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1755 QPPKHRKGRQ--AGGEMGSARRPEK---RGAASAKTSGSPRSPAGPEKPRGTQKTTPGVPAVLRGRTVIYVPSPAPRAQP 1829
Cdd:PHA03378  715 QRPAAATGRArpPAAAPGRARPPAAapgRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAP 794
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967490223 1830 KGTPGPRA--TPRKVAPPclaqsaAPAKVPSPGQQRSRSLHRPGKTSELGTLSQPPRSATPPARLAKTPSSSSSQTSPAS 1907
Cdd:PHA03378  795 TPQPPPQAgpTSMQLMPR------AAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQ 868
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 967490223 1908 QPL---PRKRPLVTQAAGPLPGP-GASPVPKTPARTLLAKQHKTQRSPVRIPfmqkPARRGPpplARAVPEPGP 1977
Cdd:PHA03378  869 APVfypPVLQPIQVMRQLGSVRAaAASTVTQAPTEYTGERRGVGPMHPTDIP----PSKRAK---TDAYVESQP 935
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
549-586 9.47e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 35.89  E-value: 9.47e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 967490223   549 KKVLREAGSVTALVQCvLRATKESTLKSVLSALWNLSA 586
Cdd:pfam00514    5 KQAVIEAGAVPPLVRL-LSSPDEEVQEEAAWALSNLAA 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
685-720 9.95e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 35.89  E-value: 9.95e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 967490223   685 ELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 720
Cdd:pfam00514    6 QAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH