NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|110225370|ref|NP_031488|]
View 

adenomatous polyposis coli protein isoform 3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
730-1017 1.03e-156

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 486.79  E-value: 1.03e-156
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   730 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQNLYGDYAFDANRHDDS-- 807
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   808 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLSAYHPTTENAGTSSKR-GLQITTTAAQ 884
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   885 IAKVMEEVSAIHTSQDDRSSASTTEFHCVADDRSAARRSSASHTHSNTYNFTKSENSNRTCSMPYAKVEYKRSSNDSLNS 964
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 110225370   965 VTSSDGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1017
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2223-2568 1.85e-95

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 312.97  E-value: 1.85e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2223 SISRGRTMIHIPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTS-PRGTKPAGKSELSPITRQTSQISGSNKGSSR 2301
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2302 SGSRDSTPSRPTQQPLSRPMQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQLSQQNLTKQ 2381
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2382 ASLSKNASSI------PRSESASKGLNqmsngNGSNKKVELSRMSSTKSSGSESDRSerpALVRQSTFIKEAPSPTLRRK 2455
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2456 LEESASfESLSPSSRPDSPTRSQaqtpvlsPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDGRPTKRHDIARSHSES 2535
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 110225370  2536 PSRLPInRAGTWKREHSKHSSSLPRVSTWRRTG 2568
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding super family cl05480
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2670-2842 7.47e-76

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


The actual alignment was detected with superfamily member pfam05937:

Pssm-ID: 399141  Cd Length: 174  Bit Score: 249.91  E-value: 7.47e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2670 RSGRSPTGNTPPVIDSVSEKGSSSIKDSKDTHGKQSVGSGS-PVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIA 2748
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNvPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2749 ETAETCIAERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTDSTESSGAQ 2828
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 110225370  2829 SPKRHSGSYLVTSV 2842
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
1034-1133 3.48e-54

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 406923  Cd Length: 100  Bit Score: 184.72  E-value: 3.48e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1034 LNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENTDDKHLKFQPHFGQQECVSPYRSRGTSGSETNRM 1113
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 110225370  1114 GSSHAINQNVNQSLCQEDDY 1133
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
391-464 2.40e-44

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 155.78  E-value: 2.40e-44
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 110225370   391 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 464
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1744-1837 1.49e-41

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435479  Cd Length: 94  Bit Score: 148.45  E-value: 1.49e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1744 IMDQVQQASSTSSGANKNQVDTKKKKPTSPVKPMPQNTEYRTRVRKNTDSKVNVNTEETFSDNKDSKKPSLQTNAKAFNE 1823
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 110225370  1824 KLPNNEDRVRGSFA 1837
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1281-1367 1.08e-30

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435478  Cd Length: 89  Bit Score: 117.28  E-value: 1.08e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1281 ADDEI-GCDQTTQEADSANTLQTAEVKENDVTRSAEDPATEVPAVSQNARAKPSRLQASGLS-SESTRHnKAVEFSSGAK 1358
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRH-KAVEFSSGAK 80

                   ....*....
gi 110225370  1359 SPSKSGAQT 1367
Cdd:pfam16633   81 SPSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1871-1945 1.63e-30

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435480  Cd Length: 81  Bit Score: 116.50  E-value: 1.63e-30
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 110225370  1871 DLSREKAELRKGKESKDSEAKVTCRPEPNSSQQAASKSQASIKHPANRAQSKPVLQKQPTFPQSSKDGPDRGAAT 1945
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1660-1713 6.42e-25

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 406927  Cd Length: 54  Bit Score: 99.48  E-value: 6.42e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 110225370  1660 IESPPNELATGDGVRAGIQSGEFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDD 1713
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 4.66e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


:

Pssm-ID: 435517  Cd Length: 52  Bit Score: 91.20  E-value: 4.66e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 110225370     4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
125-205 7.27e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


:

Pssm-ID: 463275  Cd Length: 82  Bit Score: 91.93  E-value: 7.27e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   125 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRAAMEEQLG 203
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 110225370   204 TC 205
Cdd:pfam11414   81 LI 82
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1635-1658 2.42e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 48.92  E-value: 2.42e-07
                           10        20
                   ....*....|....*....|....
gi 110225370  1635 DVPRVYCVEGTPINFSTATSLSDL 1658
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
647-687 4.65e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.65e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 110225370    647 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 687
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
2032-2051 1.07e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 44.12  E-value: 1.07e-05
                           10        20
                   ....*....|....*....|
gi 110225370  2032 DSEDDLLQECISSAMPKKKR 2051
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
510-551 5.30e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.30e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 110225370    510 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 551
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
689-729 7.00e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 7.00e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 110225370   689 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 729
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1714-1735 1.55e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 37.96  E-value: 1.55e-03
                           10        20
                   ....*....|....*....|..
gi 110225370  1714 NKAEEGDILAECINSAMPKGKS 1735
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
457-508 5.00e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 37.02  E-value: 5.00e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 110225370    457 DEEHRHAMNELGGLQAIAELLQvdcemygltndHYSVTLRRYAGMALTNLTF 508
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK-----------SEDEEVVKEAAWALSNLSS 41
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
338-390 8.10e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 8.10e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 110225370    338 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 390
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1255-1272 8.47e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 35.82  E-value: 8.47e-03
                           10
                   ....*....|....*...
gi 110225370  1255 ETIQTYCVEDTPICFSRC 1272
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
 
Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
730-1017 1.03e-156

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 486.79  E-value: 1.03e-156
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   730 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQNLYGDYAFDANRHDDS-- 807
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   808 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLSAYHPTTENAGTSSKR-GLQITTTAAQ 884
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   885 IAKVMEEVSAIHTSQDDRSSASTTEFHCVADDRSAARRSSASHTHSNTYNFTKSENSNRTCSMPYAKVEYKRSSNDSLNS 964
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 110225370   965 VTSSDGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1017
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2223-2568 1.85e-95

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 312.97  E-value: 1.85e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2223 SISRGRTMIHIPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTS-PRGTKPAGKSELSPITRQTSQISGSNKGSSR 2301
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2302 SGSRDSTPSRPTQQPLSRPMQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQLSQQNLTKQ 2381
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2382 ASLSKNASSI------PRSESASKGLNqmsngNGSNKKVELSRMSSTKSSGSESDRSerpALVRQSTFIKEAPSPTLRRK 2455
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2456 LEESASfESLSPSSRPDSPTRSQaqtpvlsPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDGRPTKRHDIARSHSES 2535
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 110225370  2536 PSRLPInRAGTWKREHSKHSSSLPRVSTWRRTG 2568
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2670-2842 7.47e-76

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


Pssm-ID: 399141  Cd Length: 174  Bit Score: 249.91  E-value: 7.47e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2670 RSGRSPTGNTPPVIDSVSEKGSSSIKDSKDTHGKQSVGSGS-PVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIA 2748
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNvPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2749 ETAETCIAERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTDSTESSGAQ 2828
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 110225370  2829 SPKRHSGSYLVTSV 2842
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
1034-1133 3.48e-54

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406923  Cd Length: 100  Bit Score: 184.72  E-value: 3.48e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1034 LNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENTDDKHLKFQPHFGQQECVSPYRSRGTSGSETNRM 1113
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 110225370  1114 GSSHAINQNVNQSLCQEDDY 1133
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
391-464 2.40e-44

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 155.78  E-value: 2.40e-44
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 110225370   391 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 464
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1744-1837 1.49e-41

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435479  Cd Length: 94  Bit Score: 148.45  E-value: 1.49e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1744 IMDQVQQASSTSSGANKNQVDTKKKKPTSPVKPMPQNTEYRTRVRKNTDSKVNVNTEETFSDNKDSKKPSLQTNAKAFNE 1823
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 110225370  1824 KLPNNEDRVRGSFA 1837
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1281-1367 1.08e-30

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435478  Cd Length: 89  Bit Score: 117.28  E-value: 1.08e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1281 ADDEI-GCDQTTQEADSANTLQTAEVKENDVTRSAEDPATEVPAVSQNARAKPSRLQASGLS-SESTRHnKAVEFSSGAK 1358
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRH-KAVEFSSGAK 80

                   ....*....
gi 110225370  1359 SPSKSGAQT 1367
Cdd:pfam16633   81 SPSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1871-1945 1.63e-30

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435480  Cd Length: 81  Bit Score: 116.50  E-value: 1.63e-30
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 110225370  1871 DLSREKAELRKGKESKDSEAKVTCRPEPNSSQQAASKSQASIKHPANRAQSKPVLQKQPTFPQSSKDGPDRGAAT 1945
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1660-1713 6.42e-25

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406927  Cd Length: 54  Bit Score: 99.48  E-value: 6.42e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 110225370  1660 IESPPNELATGDGVRAGIQSGEFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDD 1713
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 4.66e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 91.20  E-value: 4.66e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 110225370     4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
125-205 7.27e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 91.93  E-value: 7.27e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   125 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRAAMEEQLG 203
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 110225370   204 TC 205
Cdd:pfam11414   81 LI 82
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1635-1658 2.42e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 48.92  E-value: 2.42e-07
                           10        20
                   ....*....|....*....|....
gi 110225370  1635 DVPRVYCVEGTPINFSTATSLSDL 1658
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
647-687 4.65e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.65e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 110225370    647 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 687
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
647-687 6.26e-07

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.83  E-value: 6.26e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 110225370   647 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 687
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
2201-2564 3.15e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 3.15e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2201 GKIRSNSEISSQMKQPLPTNMPSISRGrtmihiPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTSP-----RGTK 2275
Cdd:PHA03247 2659 GRVSRPRRARRLGRAAQASSPPQRPRR------RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPgpaaaRQAS 2732
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2276 PAGKSELSPITRQTSQISGSNKGSSRSGSRDSTPSRPTqqPLSRPMQSPGRNSISPGRNGISPPNKLSQLPRTSSPSTAS 2355
Cdd:PHA03247 2733 PALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA--PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA 2810
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2356 TKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNASSIPRSESAskglnqmsnGNGSNKKVELSRMSSTKSSGSESDRSERP 2435
Cdd:PHA03247 2811 VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL---------GGSVAPGGDVRRRPPSRSPAAKPAAPARP 2881
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2436 ALVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPDSPTRSQAQTPVLSPSLPDMSLSTHPSVQAggwrKLPPNLSPT 2514
Cdd:PHA03247 2882 PVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP----PLAPTTDPA 2950
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2515 ieyNDGRPTKRHDIARSHSESPSRLPINRAGTWK----RE------HSKHSSSLPRVSTW 2564
Cdd:PHA03247 2951 ---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPLTGHSLSRVSSW 3007
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
2032-2051 1.07e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 44.12  E-value: 1.07e-05
                           10        20
                   ....*....|....*....|
gi 110225370  2032 DSEDDLLQECISSAMPKKKR 2051
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
132-245 7.33e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 48.23  E-value: 7.33e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  132 LEELEKERSLLLADLDKEEKEKDWYY---------AQLQNLTKRIDSLPLTENFSLQTDMTRRQLEYEARQIRAAMEEQL 202
Cdd:COG4717   104 LEELEAELEELREELEKLEKLLQLLPlyqelealeAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELL 183
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 110225370  203 -GTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEAERSSQ 245
Cdd:COG4717   184 eQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEE 227
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
510-551 5.30e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.30e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 110225370    510 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 551
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
510-550 6.04e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.04e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 110225370   510 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 550
Cdd:pfam00514    1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
132-241 6.88e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.44  E-value: 6.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   132 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSL------------PLTENFSLQTDMTRRQLEYEARQIRAAME 199
Cdd:TIGR02169  232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIeqlleelnkkikDLGEEEQLRVKEKIGELEAEIASLERSIA 311
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 110225370   200 EQLGTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEAE 241
Cdd:TIGR02169  312 EKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRD 353
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
689-729 7.00e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 7.00e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 110225370   689 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 729
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1714-1735 1.55e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 37.96  E-value: 1.55e-03
                           10        20
                   ....*....|....*....|..
gi 110225370  1714 NKAEEGDILAECINSAMPKGKS 1735
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
457-508 5.00e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 37.02  E-value: 5.00e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 110225370    457 DEEHRHAMNELGGLQAIAELLQvdcemygltndHYSVTLRRYAGMALTNLTF 508
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK-----------SEDEEVVKEAAWALSNLSS 41
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
338-390 8.10e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 8.10e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 110225370    338 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 390
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1255-1272 8.47e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 35.82  E-value: 8.47e-03
                           10
                   ....*....|....*...
gi 110225370  1255 ETIQTYCVEDTPICFSRC 1272
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
 
Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
730-1017 1.03e-156

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 486.79  E-value: 1.03e-156
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   730 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQNLYGDYAFDANRHDDS-- 807
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   808 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLSAYHPTTENAGTSSKR-GLQITTTAAQ 884
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   885 IAKVMEEVSAIHTSQDDRSSASTTEFHCVADDRSAARRSSASHTHSNTYNFTKSENSNRTCSMPYAKVEYKRSSNDSLNS 964
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 110225370   965 VTSSDGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1017
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2223-2568 1.85e-95

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 312.97  E-value: 1.85e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2223 SISRGRTMIHIPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTS-PRGTKPAGKSELSPITRQTSQISGSNKGSSR 2301
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2302 SGSRDSTPSRPTQQPLSRPMQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQLSQQNLTKQ 2381
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2382 ASLSKNASSI------PRSESASKGLNqmsngNGSNKKVELSRMSSTKSSGSESDRSerpALVRQSTFIKEAPSPTLRRK 2455
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2456 LEESASfESLSPSSRPDSPTRSQaqtpvlsPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDGRPTKRHDIARSHSES 2535
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 110225370  2536 PSRLPInRAGTWKREHSKHSSSLPRVSTWRRTG 2568
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2670-2842 7.47e-76

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


Pssm-ID: 399141  Cd Length: 174  Bit Score: 249.91  E-value: 7.47e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2670 RSGRSPTGNTPPVIDSVSEKGSSSIKDSKDTHGKQSVGSGS-PVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIA 2748
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNvPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  2749 ETAETCIAERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTDSTESSGAQ 2828
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 110225370  2829 SPKRHSGSYLVTSV 2842
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
1034-1133 3.48e-54

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406923  Cd Length: 100  Bit Score: 184.72  E-value: 3.48e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1034 LNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENTDDKHLKFQPHFGQQECVSPYRSRGTSGSETNRM 1113
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 110225370  1114 GSSHAINQNVNQSLCQEDDY 1133
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
391-464 2.40e-44

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 155.78  E-value: 2.40e-44
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 110225370   391 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 464
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1744-1837 1.49e-41

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435479  Cd Length: 94  Bit Score: 148.45  E-value: 1.49e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1744 IMDQVQQASSTSSGANKNQVDTKKKKPTSPVKPMPQNTEYRTRVRKNTDSKVNVNTEETFSDNKDSKKPSLQTNAKAFNE 1823
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 110225370  1824 KLPNNEDRVRGSFA 1837
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1281-1367 1.08e-30

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435478  Cd Length: 89  Bit Score: 117.28  E-value: 1.08e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  1281 ADDEI-GCDQTTQEADSANTLQTAEVKENDVTRSAEDPATEVPAVSQNARAKPSRLQASGLS-SESTRHnKAVEFSSGAK 1358
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRH-KAVEFSSGAK 80

                   ....*....
gi 110225370  1359 SPSKSGAQT 1367
Cdd:pfam16633   81 SPSKSGAQT 89
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1871-1945 1.63e-30

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435480  Cd Length: 81  Bit Score: 116.50  E-value: 1.63e-30
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 110225370  1871 DLSREKAELRKGKESKDSEAKVTCRPEPNSSQQAASKSQASIKHPANRAQSKPVLQKQPTFPQSSKDGPDRGAAT 1945
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1660-1713 6.42e-25

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406927  Cd Length: 54  Bit Score: 99.48  E-value: 6.42e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 110225370  1660 IESPPNELATGDGVRAGIQSGEFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDD 1713
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 4.66e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 91.20  E-value: 4.66e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 110225370     4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
125-205 7.27e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 91.93  E-value: 7.27e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   125 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRAAMEEQLG 203
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 110225370   204 TC 205
Cdd:pfam11414   81 LI 82
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1635-1658 2.42e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 48.92  E-value: 2.42e-07
                           10        20
                   ....*....|....*....|....
gi 110225370  1635 DVPRVYCVEGTPINFSTATSLSDL 1658
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
647-687 4.65e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.65e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 110225370    647 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 687
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
647-687 6.26e-07

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.83  E-value: 6.26e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 110225370   647 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 687
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
2201-2564 3.15e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 3.15e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2201 GKIRSNSEISSQMKQPLPTNMPSISRGrtmihiPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTSP-----RGTK 2275
Cdd:PHA03247 2659 GRVSRPRRARRLGRAAQASSPPQRPRR------RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPgpaaaRQAS 2732
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2276 PAGKSELSPITRQTSQISGSNKGSSRSGSRDSTPSRPTqqPLSRPMQSPGRNSISPGRNGISPPNKLSQLPRTSSPSTAS 2355
Cdd:PHA03247 2733 PALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA--PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA 2810
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2356 TKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNASSIPRSESAskglnqmsnGNGSNKKVELSRMSSTKSSGSESDRSERP 2435
Cdd:PHA03247 2811 VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL---------GGSVAPGGDVRRRPPSRSPAAKPAAPARP 2881
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2436 ALVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPDSPTRSQAQTPVLSPSLPDMSLSTHPSVQAggwrKLPPNLSPT 2514
Cdd:PHA03247 2882 PVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP----PLAPTTDPA 2950
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2515 ieyNDGRPTKRHDIARSHSESPSRLPINRAGTWK----RE------HSKHSSSLPRVSTW 2564
Cdd:PHA03247 2951 ---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPLTGHSLSRVSSW 3007
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
2032-2051 1.07e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 44.12  E-value: 1.07e-05
                           10        20
                   ....*....|....*....|
gi 110225370  2032 DSEDDLLQECISSAMPKKKR 2051
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2249-2544 2.10e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 2.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2249 KGPPLKTPASKSPSEGPGATTSPRGTKPAGKSELSPITRQTSQISGSNKGSSRSGSRDSTPSR---PTQQPLSRPMQSPG 2325
Cdd:PHA03307  101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVAsdaASSRQAALPLSSPE 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2326 RNSISPGRNGISPPNKLSQLPRTSSP----STASTKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNASSIPRSE----SA 2397
Cdd:PHA03307  181 ETARAPSSPPAEPPPSTPPAAASPRPprrsSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEcplpRP 260
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2398 SKGLNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQStfiKEAPSPTLRRKLEESASFESLSPSSRPDSPTRS 2477
Cdd:PHA03307  261 APITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG---SGPAPSSPRASSSSSSSRESSSSSTSSSSESSR 337
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 110225370 2478 QAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEY---NDGRPT---KRHDIARSH--SESPSRLPINRA 2544
Cdd:PHA03307  338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSpaaSAGRPTrrrARAAVAGRArrRDATGRFPAGRP 412
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
132-245 7.33e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 48.23  E-value: 7.33e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  132 LEELEKERSLLLADLDKEEKEKDWYY---------AQLQNLTKRIDSLPLTENFSLQTDMTRRQLEYEARQIRAAMEEQL 202
Cdd:COG4717   104 LEELEAELEELREELEKLEKLLQLLPlyqelealeAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELL 183
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 110225370  203 -GTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEAERSSQ 245
Cdd:COG4717   184 eQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEE 227
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
133-260 1.28e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 47.62  E-value: 1.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  133 EELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslQTDMTRRQLEYE-ARQIRAAMEEQLgtcQDMEKR 211
Cdd:COG1196   221 ELKELEAELLLLKLRELEAELEELEAELEELEAELEEL--------EAELAELEAELEeLRLELEELELEL---EEAQAE 289
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 110225370  212 AQRRIARIQQIEKDILRVRQLLQSQAAEAERSSQSRHDAASHEAGRQHE 260
Cdd:COG1196   290 EYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEE 338
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
510-551 5.30e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.30e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 110225370    510 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 551
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
510-550 6.04e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.04e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 110225370   510 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 550
Cdd:pfam00514    1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
132-241 6.88e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.44  E-value: 6.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   132 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSL------------PLTENFSLQTDMTRRQLEYEARQIRAAME 199
Cdd:TIGR02169  232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIeqlleelnkkikDLGEEEQLRVKEKIGELEAEIASLERSIA 311
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 110225370   200 EQLGTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEAE 241
Cdd:TIGR02169  312 EKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRD 353
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
689-729 7.00e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 7.00e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 110225370   689 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 729
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2254-2545 1.00e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2254 KTPASKSPSEG---PGATTSPRGTKPAGKSELSPITRQTSQISGSNKGSSRSGSRDSTPSRPTQQPLSRPMQSPGRNSIS 2330
Cdd:PHA03307   59 AAACDRFEPPTgppPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEML 138
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2331 PGRNGISPPNKLSQLPRTSSPS-TASTKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNASSIPRS---------ESASKG 2400
Cdd:PHA03307  139 RPVGSPGPPPAASPPAAGASPAaVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASprpprrsspISASAS 218
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2401 LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSER-------PALVRQSTFIKEAPSPTLRRKL----EESASFESLSPSS 2469
Cdd:PHA03307  219 SPAPAPGRSAADDAGASSSDSSSSESSGCGWGPEnecplprPAPITLPTRIWEASGWNGPSSRpgpaSSSSSPRERSPSP 298
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2470 RPDSPTRSQAQTP-------------VLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDGRPTKR----HDIARSH 2532
Cdd:PHA03307  299 SPSSPGSGPAPSSprasssssssresSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKrprpSRAPSSP 378
                         330
                  ....*....|...
gi 110225370 2533 SESPSRLPINRAG 2545
Cdd:PHA03307  379 AASAGRPTRRRAR 391
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
2248-2489 1.21e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 44.68  E-value: 1.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2248 KKGP-----PLKTPASKSPSEGPGATTSPRGTKPAGKSElSPITRQtsqisgsnkgssrsgsRDSTPSRPTQQPLSRPMQ 2322
Cdd:PTZ00449  560 KPGPakehkPSKIPTLSKKPEFPKDPKHPKDPEEPKKPK-RPRSAQ----------------RPTRPKSPKLPELLDIPK 622
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2323 SPGRNSISPGRNGISPPNKLSQLPRTSSP-STASTKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNA-SSIPRSESASKG 2400
Cdd:PTZ00449  623 SPKRPESPKSPKRPPPPQRPSSPERPEGPkIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETkTTVVLDESFESI 702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2401 LNQMSNGNGSNKKVE-------LSRMSSTKSSGSESDRSERPALVR-------QSTFIKEAPSPTLRRKL------EESA 2460
Cdd:PTZ00449  703 LKETLPETPGTPFTTprplppkLPRDEEFPFEPIGDPDAEQPDDIEfftppeeERTFFHETPADTPLPDIlaeefkEEDI 782
                         250       260       270
                  ....*....|....*....|....*....|..
gi 110225370 2461 SFESLSPSS---RPDSPTRSQAQTPVLSPSLP 2489
Cdd:PTZ00449  783 HAETGEPDEamkRPDSPSEHEDKPPGDHPSLP 814
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1714-1735 1.55e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 37.96  E-value: 1.55e-03
                           10        20
                   ....*....|....*....|..
gi 110225370  1714 NKAEEGDILAECINSAMPKGKS 1735
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
132-245 2.95e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.83  E-value: 2.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  132 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrrqleyeARQIRAAmEEQLgtcQDMEKR 211
Cdd:COG4942    29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAAL--------------------ARRIRAL-EQEL---AALEAE 84
                          90       100       110
                  ....*....|....*....|....*....|....
gi 110225370  212 AQRRIARIQQIEKDILRVRQLLQSQAAEAERSSQ 245
Cdd:COG4942    85 LAELEKEIAELRAELEAQKEELAELLRALYRLGR 118
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
132-260 3.16e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 43.00  E-value: 3.16e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370  132 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrRQLEYEARQIRAAMEEQLGTCQDMEKR 211
Cdd:COG1196   248 LEELEAELEELEAELAELEAELEELRLELEELELELEEA--------------QAEEYELLAELARLEQDIARLEERRRE 313
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 110225370  212 AQRRIARIQQIEKDILRVRQLLQSQAAEAERSSQSRHDAASHEAGRQHE 260
Cdd:COG1196   314 LEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
457-508 5.00e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 37.02  E-value: 5.00e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 110225370    457 DEEHRHAMNELGGLQAIAELLQvdcemygltndHYSVTLRRYAGMALTNLTF 508
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK-----------SEDEEVVKEAAWALSNLSS 41
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
132-260 6.65e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 42.26  E-value: 6.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370   132 LEELEKErsllLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfSLQTDMTRRQLEyEARQIRAAMEEQlgtcqdMEKR 211
Cdd:TIGR00618  224 LEKELKH----LREALQQTQQSHAYLTQKREAQEEQLKK------QQLLKQLRARIE-ELRAQEAVLEET------QERI 286
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 110225370   212 AQRR--------IARIQQIEKDILRVRQLLQSQAAEAERSSQSRHDAASHEAGRQHE 260
Cdd:TIGR00618  287 NRARkaaplaahIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQ 343
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2256-2400 7.29e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 7.29e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 110225370 2256 PASKSPSEGPGATTSPRGTKPA----GKSELSPITRQTSQISGSNKGSSRSGSRDSTPSRPTQQPLSRPMQSPGRNSISP 2331
Cdd:PHA03307  278 PSSRPGPASSSSSPRERSPSPSpsspGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 110225370 2332 GRNGISPPNKLSQlPRTSSPSTASTKSSGSGKMSytspGRQLSQQNLTKQASLSKNASSIPRSESASKG 2400
Cdd:PHA03307  358 PPADPSSPRKRPR-PSRAPSSPAASAGRPTRRRA----RAAVAGRARRRDATGRFPAGRPRPSPLDAGA 421
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
338-390 8.10e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 8.10e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 110225370    338 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 390
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1255-1272 8.47e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 35.82  E-value: 8.47e-03
                           10
                   ....*....|....*...
gi 110225370  1255 ETIQTYCVEDTPICFSRC 1272
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH