|
Name |
Accession |
Description |
Interval |
E-value |
| WH2 super family |
cl41728 |
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This ... |
963-988 |
8.91e-08 |
|
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This family contains the Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2) as well as thymosin-beta (Tbeta; also called beta-thymosin or betaT) domains that are small, widespread intrinsically disordered actin-binding peptides displaying significant sequence variability and different regulations of actin self-assembly in motile and morphogenetic processes. These WH2/betaT peptides are identified by a central consensus actin-binding motif LKKT/V flanked by variable N-terminal and C-terminal extensions; the betaT shares a more extended and conserved C-terminal half than WH2. These single or repeated domains are found in actin-binding proteins (ABPs) such as the hematopoietic-specific protein WASP, its ubiquitously expressed ortholog neural-WASP (N-WASP), WASP-interacting protein (WAS/WASL-interacting protein family members 1 and 2), and WASP-family verprolin homologous protein (WAVE/SCAR) isoforms: WAVE1, WAVE2, and WAVE3. Also included are the WH2 domains found in inverted formin FH2 domain-containing protein (INF2), Cordon bleu (Cobl) protein, vasodilator-stimulated phosphoprotein (VASP) homology protein and actobindin (found in amoebae). These ABPs are commonly multidomain proteins that contain signaling domains and structurally conserved actin-binding motifs, the most important being the WH2 domain motif through which they bind actin in order to direct the location, rate, and timing for actin assembly in the cell into different structures, such as filopodia, lamellipodia, stress fibers, and focal adhesions. The WH2 domain motif is one of the most abundant actin-binding motifs in Wiskott-Aldrich syndrome proteins (WASPs) where they activate Arp2/3-dependent actin nucleation and branching in response to signals mediated by Rho-family GTPases. The thymosin beta (Tbeta) domains in metazoans act in cells as major actin-sequestering peptides; their complex with monomeric ATP-actin (G-ATP-actin) cannot polymerize at either filament (F-actin) end. The actual alignment was detected with superfamily member cd21801:
Pssm-ID: 425359 Cd Length: 26 Bit Score: 48.84 E-value: 8.91e-08
10 20
....*....|....*....|....*.
gi 1720399543 963 DPEHVRQSLLTAIRSGEAAAKLKRVT 988
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKVP 26
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
646-917 |
1.54e-07 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.71 E-value: 1.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 646 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 725
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 726 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 792
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 793 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 872
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1720399543 873 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 917
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WH2_Wc_Cobl |
cd21801 |
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ... |
963-988 |
8.91e-08 |
|
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.
Pssm-ID: 409199 Cd Length: 26 Bit Score: 48.84 E-value: 8.91e-08
10 20
....*....|....*....|....*.
gi 1720399543 963 DPEHVRQSLLTAIRSGEAAAKLKRVT 988
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKVP 26
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
646-917 |
1.54e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.71 E-value: 1.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 646 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 725
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 726 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 792
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 793 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 872
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1720399543 873 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 917
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
607-873 |
2.50e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.14 E-value: 2.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 607 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 684
Cdd:pfam03154 212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 685 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 764
Cdd:pfam03154 284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 765 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 842
Cdd:pfam03154 357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
|
250 260 270
....*....|....*....|....*....|.
gi 1720399543 843 ALPGSQGTSLNLKTLKTFGAPRPYSSSGPSP 873
Cdd:pfam03154 433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPFP 463
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
765-845 |
2.74e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 41.43 E-value: 2.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 765 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 844
Cdd:NF040983 86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163
|
.
gi 1720399543 845 P 845
Cdd:NF040983 164 P 164
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
715-845 |
8.30e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 40.14 E-value: 8.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 715 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 794
Cdd:NF033839 345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 1720399543 795 TLPA------ETSPPPVFPKPMTLP-AETSLPLVFPKPMTLRAETSPPPVAAKPVALP 845
Cdd:NF033839 420 VKPQpekpkpEVKPQPEKPKPEVKPqPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WH2_Wc_Cobl |
cd21801 |
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ... |
963-988 |
8.91e-08 |
|
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.
Pssm-ID: 409199 Cd Length: 26 Bit Score: 48.84 E-value: 8.91e-08
10 20
....*....|....*....|....*.
gi 1720399543 963 DPEHVRQSLLTAIRSGEAAAKLKRVT 988
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKVP 26
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
646-917 |
1.54e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.71 E-value: 1.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 646 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 725
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 726 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 792
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 793 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 872
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1720399543 873 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 917
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
607-953 |
2.03e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 2.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 607 PATSKSSQQPQPdlKPKPSSGTERHLHRTLSSPTgtETNPPKAPRVTTDTGTIPFAPNLEDINNileskfrSRASNPQAK 686
Cdd:PHA03247 2562 AAPDRSVPPPRP--APRPSEPAVTSRARRPDAPP--QSARPRAPVDDRGDPRGPAPPSPLPPDT-------HAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 687 PSSfflqmqkRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPA 766
Cdd:PHA03247 2631 PSP-------AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 767 ETSPPPVAPKPMTlraetSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP-------PVAA 839
Cdd:PHA03247 2704 PPPTPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapappaaPAAG 2778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 840 KPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSPFALAVVKRSQSFSK--ACPESASEGSSALPPAATQDekthtvNKPT 917
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplPPPTSAQPTAPPPPPGPPPP------SLPL 2852
|
330 340 350
....*....|....*....|....*....|....*.
gi 1720399543 918 VGSQHGDGDKQNNPVQNEHSSQVLTPADGPSFTLKR 953
Cdd:PHA03247 2853 GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR 2888
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
676-907 |
5.12e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 50.64 E-value: 5.12e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 676 FRSRASNPQAKPSSfflqmqkRASGHYVTSAAAKSVHTAPGPAPKEPTiKEVQRDPQLSPEQHPSSLSERTHSAPLPNIS 755
Cdd:PRK12323 363 FRPGQSGGGAGPAT-------AAAAPVAQPAPAAAAPAAAAPAPAAPP-AAPAAAPAAAAAARAVAAAPARRSPAPEALA 434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 756 KADDDIIQKPAETSPPPVAPKPMTLrAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETS------LPLVFPKPMTLR 829
Cdd:PRK12323 435 AARQASARGPGGAPAPAPAPAAAPA-AAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpppweeLPPEFASPAPAQ 513
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720399543 830 AETSPPPVAAKPVALPGSQGTSlnlktlktfgAPRPYSSSGPSPfALAVVKRSQSFSKACPESASEGSSALPPAATQD 907
Cdd:PRK12323 514 PDAAPAGWVAESIPDPATADPD----------DAFETLAPAPAA-APAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
607-902 |
9.92e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.86 E-value: 9.92e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 607 PATSKSSQQPQPDLKPKPSSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIpfapnledinnilESKFRSRASNPQAK 686
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-------------SRPRRARRLGRAAQ 2675
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 687 PSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQlspeqhPSSLSERTHSAPLPNISKADDDIIQKPA 766
Cdd:PHA03247 2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPL------PPGPAAARQASPALPAAPAPPAVPAGPA 2749
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 767 ETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFP------------KPMTLPAETSLPLVFPKPMTLRAETSP 834
Cdd:PHA03247 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlsesreslpspwDPADPPAAVLAPAAALPPAASPAGPLP 2829
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720399543 835 PPVAAKPVALPGSQGTSLNLKTLKTFGAP------RPYSSSGPSPFALAVVKRSQSFSKACPESASEgSSALPP 902
Cdd:PHA03247 2830 PPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE-SFALPP 2902
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
497-873 |
1.51e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 1.51e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 497 EKEPACTYGNNVPLSPVDGSNKNPAASylKNFPLYRQDSNPKPKPSNEITREYIPKIGMTTYKIVPPKSLEMAKDWESeA 576
Cdd:PHA03247 2586 ARRPDAPPQSARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-S 2662
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 577 MGRKDDQKMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSSgteRHLHRTLSSPTGTETNPPKAPRVTTDT 656
Cdd:PHA03247 2663 RPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP---HALVSATPLPPGPAAARQASPALPAAP 2739
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 657 GT--IPFAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLS 734
Cdd:PHA03247 2740 APpaVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 735 PEQHPSSLSerthsAPLPNISKADDDIIQKPAETSPPP---VAPK-PMTLRAETSPPPvfpkpmTLPAETSPPPVfpKPM 810
Cdd:PHA03247 2820 PAASPAGPL-----PPPTSAQPTAPPPPPGPPPPSLPLggsVAPGgDVRRRPPSRSPA------AKPAAPARPPV--RRL 2886
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720399543 811 TLPAETSLPLVFPKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSP 873
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
584-869 |
1.87e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.44 E-value: 1.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 584 KMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSsgterhlhrTLSSPTGTETNPPK--APR-VTTDTGTIP 660
Cdd:PHA03378 562 QLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPS---------QTPEPPTTQSHIPEtsAPRqWPMPLRPIP 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 661 FAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKepTIKEVQRDP-QLSPEQHP 739
Cdd:PHA03378 633 MRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPG--TMQPPPRAPtPMRPPAAP 710
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 740 SSLSERTHSAPLPNISKADDDIIQKPAETSPPPvAPKPMTLRAETSPPPVFPKPMTLPAETSPPPV-FPKPMTLPAETSL 818
Cdd:PHA03378 711 PGRAQRPAAATGRARPPAAAPGRARPPAAAPGR-ARPPAAAPGRARPPAAAPGRARPPAAAPGAPTpQPPPQAPPAPQQR 789
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1720399543 819 PLVFPKPMTlRAETSPPPVAAKPVALPGSQG-TSLNLKTLKTFGAPRPYSSS 869
Cdd:PHA03378 790 PRGAPTPQP-PPQAGPTSMQLMPRAAPGQQGpTKQILRQLLTGGVKRGRPSL 840
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
731-850 |
2.12e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 45.15 E-value: 2.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 731 PQLSPEQHPSSLSERTHSAPLPNiSKADDDIIQKPAETSPPPVAPKPMTlRAETSPPPVfPKPMTLPAETSPPPVFPKPM 810
Cdd:PRK14971 371 GGRGPKQHIKPVFTQPAAAPQPS-AAAAASPSPSQSSAAAQPSAPQSAT-QPAGTPPTV-SVDPPAAVPVNPPSTAPQAV 447
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 1720399543 811 TLPAETSlplvfPKPMTLRAETSPPPVAAKPVALPGSQGT 850
Cdd:PRK14971 448 RPAQFKE-----EKKIPVSKVSSLGPSTLRPIQEKAEQAT 482
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
607-873 |
2.50e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.14 E-value: 2.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 607 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 684
Cdd:pfam03154 212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 685 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 764
Cdd:pfam03154 284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 765 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 842
Cdd:pfam03154 357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
|
250 260 270
....*....|....*....|....*....|.
gi 1720399543 843 ALPGSQGTSLNLKTLKTFGAPRPYSSSGPSP 873
Cdd:pfam03154 433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPFP 463
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
599-873 |
3.69e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 44.69 E-value: 3.69e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 599 ETSMQT-EVPATSKSSQQPQPDLKPKPSSGTerhlhrtlssptgteTNPPKAPRVTTDTGTIPFAPNLEDINNILESKFr 677
Cdd:PRK10263 338 EPVTQTpPVASVDVPPAQPTVAWQPVPGPQT---------------GEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPV- 401
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 678 srasnPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEV-QRDPQLSPEQH-PSSLSERTHSAPLPnis 755
Cdd:PRK10263 402 -----QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAwQAEEQQSTFAPqSTYQTEQTYQQPAA--- 473
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 756 kADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVF-----------------------PKPMTLPAETSPPPVFPKPMTL 812
Cdd:PRK10263 474 -QEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYyfeeveekrarereqlaawyqpiPEPVKEPEPIKSSLKAPSVAAV 552
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720399543 813 PAETSLPLVFPKPMTLRAETSPPPVAAKpVALPgsqgtslnLKTLKTFGAPRPYSSSGPSP 873
Cdd:PRK10263 553 PPVEAAAAVSPLASGVKKATLATGAAAT-VAAP--------VFSLANSGGPRPQVKEGIGP 604
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
764-853 |
1.10e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 42.87 E-value: 1.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 764 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPmtlPAETSLPLVFPKPMTLRAETSPPPVAAK--P 841
Cdd:PRK14950 370 KPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR---PVAPPVPHTPESAPKLTRAAIPVDEKPKytP 446
|
90
....*....|..
gi 1720399543 842 VALPGSQGTSLN 853
Cdd:PRK14950 447 PAPPKEEEKALI 458
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
634-835 |
1.47e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.56 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 634 RTLSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDiNNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHT 713
Cdd:PRK12323 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAA-PAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP 454
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 714 APGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKP 793
Cdd:PRK12323 455 AAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADP 534
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1720399543 794 mTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP 835
Cdd:PRK12323 535 -DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
765-845 |
2.74e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 41.43 E-value: 2.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 765 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 844
Cdd:NF040983 86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163
|
.
gi 1720399543 845 P 845
Cdd:NF040983 164 P 164
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
763-858 |
2.86e-03 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 41.80 E-value: 2.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 763 QKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPlvfpkpmtlrAETSPPPVAAKPV 842
Cdd:PRK12270 36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA----------AAAPAAPPAAAAA 105
|
90
....*....|....*.
gi 1720399543 843 ALPGSQGTSLNLKTLK 858
Cdd:PRK12270 106 AAPAAAAVEDEVTPLR 121
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
716-877 |
5.92e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.82 E-value: 5.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 716 GPAPKEPtikevqrdPQLSPEQHPSSLSERTH--SAPLPNISKADDDIIQKPAETSPPPVAPKPMtlRAETSPPPVFPKP 793
Cdd:PHA03378 648 FPTPHQP--------PQVEITPYKPTWTQIGHipYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM--RPPAAPPGRAQRP 717
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 794 MTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAE-------TSPPPVAA--KPVALPGSQGTSLNLKTLKTFGAPR 864
Cdd:PHA03378 718 AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARppaaapgRARPPAAApgAPTPQPPPQAPPAPQQRPRGAPTPQ 797
|
170
....*....|...
gi 1720399543 865 PYSSSGPSPFALA 877
Cdd:PHA03378 798 PPPQAGPTSMQLM 810
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
764-910 |
6.23e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 40.62 E-value: 6.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 764 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPM------TLPAETSLPLVFPKPMTLRAETSPPPV 837
Cdd:PRK07994 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASApqqapaVPLPETTSQLLAARQQLQRAQGATKAK 439
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720399543 838 AAKPVALPGSQGTSLNLKTLKTFgAPRPYSSSGPSPFALAVVKRSQSfskacPESASEGSSALPPAATQ---DEKT 910
Cdd:PRK07994 440 KSEPAAASRARPVNSALERLASV-RPAPSALEKAPAKKEAYRWKATN-----PVEVKKEPVATPKALKKaleHEKT 509
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
703-910 |
6.25e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 40.54 E-value: 6.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 703 VTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSL---SERTHSAPLPNISKADDDI--IQKPAETSPPPVAPKP 777
Cdd:PHA03307 56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLapaSPAREGSPTPPGPSSPDPPppTPPPASPPPSPAPDLS 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 778 MTLRAETSPPPVfPKPMTLPAETSPPPVfpkpmTLPAETSLPLVFPKPMTLRAETSPPPVAAKPV---ALPGSQGTSLNL 854
Cdd:PHA03307 136 EMLRPVGSPGPP-PAASPPAAGASPAAV-----ASDAASSRQAALPLSSPEETARAPSSPPAEPPpstPPAAASPRPPRR 209
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 1720399543 855 KTLKTFGAPRPYSSSGPSPFALAVVKRSQSFSKACPESAS--EGSSALPPAATQDEKT 910
Cdd:PHA03307 210 SSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWgpENECPLPRPAPITLPT 267
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
607-854 |
7.07e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 40.69 E-value: 7.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 607 PATSKSSQQPQPDLKPKPSSGTERHLHRTlSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDINNILESKFRSRASNPQAK 686
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPA-ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP 2871
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 687 PSSfflqmqkrasghyVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPniskadddiiQKPA 766
Cdd:PHA03247 2872 AAK-------------PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP----------PPPQ 2928
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 767 ETSPPPVAPKPmtlraetsPPPVFPKPMTLPAeTSPPPVFPKPMT---LPAETSLPLVF---PKPMTLRAETSPPPVAAK 840
Cdd:PHA03247 2929 PQPPPPPPPRP--------QPPLAPTTDPAGA-GEPSGAVPQPWLgalVPGRVAVPRFRvpqPAPSREAPASSTPPLTGH 2999
|
250
....*....|....
gi 1720399543 841 PVALPGSQGTSLNL 854
Cdd:PHA03247 3000 SLSRVSSWASSLAL 3013
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
715-845 |
8.30e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 40.14 E-value: 8.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399543 715 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 794
Cdd:NF033839 345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 1720399543 795 TLPA------ETSPPPVFPKPMTLP-AETSLPLVFPKPMTLRAETSPPPVAAKPVALP 845
Cdd:NF033839 420 VKPQpekpkpEVKPQPEKPKPEVKPqPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
|
|
|