|
Name |
Accession |
Description |
Interval |
E-value |
| Cobl |
pfam09469 |
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ... |
176-252 |
3.59e-41 |
|
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold. :
Pssm-ID: 462810 Cd Length: 79 Bit Score: 145.81 E-value: 3.59e-41
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755499890 176 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISR 252
Cdd:pfam09469 1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNR 77
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
906-1177 |
6.76e-08 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 6.76e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 906 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 985
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 986 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1052
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1053 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1132
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 755499890 1133 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1177
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
| WH2 super family |
cl41728 |
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This ... |
1223-1247 |
4.65e-07 |
|
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This family contains the Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2) as well as thymosin-beta (Tbeta; also called beta-thymosin or betaT) domains that are small, widespread intrinsically disordered actin-binding peptides displaying significant sequence variability and different regulations of actin self-assembly in motile and morphogenetic processes. These WH2/betaT peptides are identified by a central consensus actin-binding motif LKKT/V flanked by variable N-terminal and C-terminal extensions; the betaT shares a more extended and conserved C-terminal half than WH2. These single or repeated domains are found in actin-binding proteins (ABPs) such as the hematopoietic-specific protein WASP, its ubiquitously expressed ortholog neural-WASP (N-WASP), WASP-interacting protein (WAS/WASL-interacting protein family members 1 and 2), and WASP-family verprolin homologous protein (WAVE/SCAR) isoforms: WAVE1, WAVE2, and WAVE3. Also included are the WH2 domains found in inverted formin FH2 domain-containing protein (INF2), Cordon bleu (Cobl) protein, vasodilator-stimulated phosphoprotein (VASP) homology protein and actobindin (found in amoebae). These ABPs are commonly multidomain proteins that contain signaling domains and structurally conserved actin-binding motifs, the most important being the WH2 domain motif through which they bind actin in order to direct the location, rate, and timing for actin assembly in the cell into different structures, such as filopodia, lamellipodia, stress fibers, and focal adhesions. The WH2 domain motif is one of the most abundant actin-binding motifs in Wiskott-Aldrich syndrome proteins (WASPs) where they activate Arp2/3-dependent actin nucleation and branching in response to signals mediated by Rho-family GTPases. The thymosin beta (Tbeta) domains in metazoans act in cells as major actin-sequestering peptides; their complex with monomeric ATP-actin (G-ATP-actin) cannot polymerize at either filament (F-actin) end. The actual alignment was detected with superfamily member cd21801:
Pssm-ID: 425359 Cd Length: 26 Bit Score: 46.91 E-value: 4.65e-07
10 20
....*....|....*....|....*
gi 755499890 1223 DPEHVRQSLLTAIRSGEAAAKLKRA 1247
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKV 25
|
|
| RBD super family |
cl46342 |
Raf-like Ras-binding domain; |
92-158 |
9.58e-03 |
|
Raf-like Ras-binding domain; The actual alignment was detected with superfamily member pfam02196:
Pssm-ID: 460485 Cd Length: 69 Bit Score: 35.96 E-value: 9.58e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755499890 92 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 158
Cdd:pfam02196 2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Cobl |
pfam09469 |
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ... |
176-252 |
3.59e-41 |
|
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.
Pssm-ID: 462810 Cd Length: 79 Bit Score: 145.81 E-value: 3.59e-41
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755499890 176 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISR 252
Cdd:pfam09469 1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNR 77
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
906-1177 |
6.76e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 6.76e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 906 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 985
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 986 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1052
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1053 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1132
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 755499890 1133 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1177
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
| WH2_Wc_Cobl |
cd21801 |
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ... |
1223-1247 |
4.65e-07 |
|
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.
Pssm-ID: 409199 Cd Length: 26 Bit Score: 46.91 E-value: 4.65e-07
10 20
....*....|....*....|....*
gi 755499890 1223 DPEHVRQSLLTAIRSGEAAAKLKRA 1247
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKV 25
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
867-1164 |
3.52e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 3.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 867 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 944
Cdd:pfam03154 212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 945 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 1024
Cdd:pfam03154 284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1025 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 1102
Cdd:pfam03154 357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755499890 1103 ALPGSQGTSLNLKTLKTFGAPRPYSSSGP---SPFALAVVKRSQSFSKACPESASEGSSALPPAA 1164
Cdd:pfam03154 433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
1025-1105 |
1.28e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 42.58 E-value: 1.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1025 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 1104
Cdd:NF040983 86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163
|
.
gi 755499890 1105 P 1105
Cdd:NF040983 164 P 164
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
975-1105 |
3.60e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 41.68 E-value: 3.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 975 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 1054
Cdd:NF033839 345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 755499890 1055 TLPA------ETSPPPVFPKPMTLP-AETSLPLVFPKPMTLRAETSPPPVAAKPVALP 1105
Cdd:NF033839 420 VKPQpekpkpEVKPQPEKPKPEVKPqPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
1022-1107 |
8.33e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 40.27 E-value: 8.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1022 IQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVfPKPMTLRAETSPPPVAAKP 1101
Cdd:NF040983 89 VPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST-TTPTPSMHPIQPTQLPSIP 167
|
....*.
gi 755499890 1102 VALPGS 1107
Cdd:NF040983 168 NATPTS 173
|
|
| RBD |
pfam02196 |
Raf-like Ras-binding domain; |
92-158 |
9.58e-03 |
|
Raf-like Ras-binding domain;
Pssm-ID: 460485 Cd Length: 69 Bit Score: 35.96 E-value: 9.58e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755499890 92 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 158
Cdd:pfam02196 2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Cobl |
pfam09469 |
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ... |
176-252 |
3.59e-41 |
|
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.
Pssm-ID: 462810 Cd Length: 79 Bit Score: 145.81 E-value: 3.59e-41
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755499890 176 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISR 252
Cdd:pfam09469 1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNR 77
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
906-1177 |
6.76e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 6.76e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 906 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 985
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 986 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1052
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1053 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1132
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 755499890 1133 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1177
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
| WH2_Wc_Cobl |
cd21801 |
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ... |
1223-1247 |
4.65e-07 |
|
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.
Pssm-ID: 409199 Cd Length: 26 Bit Score: 46.91 E-value: 4.65e-07
10 20
....*....|....*....|....*
gi 755499890 1223 DPEHVRQSLLTAIRSGEAAAKLKRA 1247
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKV 25
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
867-1213 |
8.17e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 8.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 867 PATSKSSQQPQPdlKPKPSSGTERHLHRTLSSPTgtETNPPKAPRVTTDTGTIPFAPnledinnileSKFRSRASNPQAK 946
Cdd:PHA03247 2562 AAPDRSVPPPRP--APRPSEPAVTSRARRPDAPP--QSARPRAPVDDRGDPRGPAPP----------SPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 947 PSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPA 1026
Cdd:PHA03247 2628 PPS----PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1027 ETSPPPVAPKPMTlraetSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP-------PVAA 1099
Cdd:PHA03247 2704 PPPTPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapappaaPAAG 2778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1100 KPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSPFALAVVKRSQSFSK--ACPESASEGSSALPPAATQDekthtvNKPT 1177
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplPPPTSAQPTAPPPPPGPPPP------SLPL 2852
|
330 340 350
....*....|....*....|....*....|....*.
gi 755499890 1178 VGSQHGDGDKQNNPVQNEHSSQVLTPADGPSFTLKR 1213
Cdd:PHA03247 2853 GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR 2888
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
936-1167 |
3.73e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 51.42 E-value: 3.73e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 936 FRSRASNPQAKPSSfflqmqkRASGHYVTSAAAKSVHTAPGPAPKEPTiKEVQRDPQLSPEQHPSSLSERTHSAPLPNIS 1015
Cdd:PRK12323 363 FRPGQSGGGAGPAT-------AAAAPVAQPAPAAAAPAAAAPAPAAPP-AAPAAAPAAAAAARAVAAAPARRSPAPEALA 434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1016 KADDDIIQKPAETSPPPVAPKPMTLrAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETS------LPLVFPKPMTLR 1089
Cdd:PRK12323 435 AARQASARGPGGAPAPAPAPAAAPA-AAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpppweeLPPEFASPAPAQ 513
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 755499890 1090 AETSPPPVAAKPVALPGSQGTSlnlktlktfgAPRPYSSSGPSPfALAVVKRSQSFSKACPESASEGSSALPPAATQD 1167
Cdd:PRK12323 514 PDAAPAGWVAESIPDPATADPD----------DAFETLAPAPAA-APAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
867-1164 |
3.52e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 3.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 867 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 944
Cdd:pfam03154 212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 945 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 1024
Cdd:pfam03154 284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1025 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 1102
Cdd:pfam03154 357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755499890 1103 ALPGSQGTSLNLKTLKTFGAPRPYSSSGP---SPFALAVVKRSQSFSKACPESASEGSSALPPAA 1164
Cdd:pfam03154 433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
844-1129 |
5.60e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 47.75 E-value: 5.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 844 KMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSsgterhlhrTLSSPTGTETNPPK--APR-VTTDTGTIP 920
Cdd:PHA03378 562 QLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPS---------QTPEPPTTQSHIPEtsAPRqWPMPLRPIP 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 921 FAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKepTIKEVQRDP-QLSPEQHP 999
Cdd:PHA03378 633 MRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPG--TMQPPPRAPtPMRPPAAP 710
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1000 SSLSERTHSAPLPNISKADDDIIQKPAETSPPPvAPKPMTLRAETSPPPVFPKPMTLPAETSPPPV-FPKPMTLPAETSL 1078
Cdd:PHA03378 711 PGRAQRPAAATGRARPPAAAPGRARPPAAAPGR-ARPPAAAPGRARPPAAAPGRARPPAAAPGAPTpQPPPQAPPAPQQR 789
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 755499890 1079 PLVFPKPMTlRAETSPPPVAAKPVALPGSQG-TSLNLKTLKTFGAPRPYSSS 1129
Cdd:PHA03378 790 PRGAPTPQP-PPQAGPTSMQLMPRAAPGQQGpTKQILRQLLTGGVKRGRPSL 840
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
867-1162 |
5.93e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.63 E-value: 5.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 867 PATSKSSQQPQPDLKPKPSSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIpfapnledinnilESKFRSRASNPQAK 946
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-------------SRPRRARRLGRAAQ 2675
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 947 PSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQlspeqhPSSLSERTHSAPLPNISKADDDIIQKPA 1026
Cdd:PHA03247 2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPL------PPGPAAARQASPALPAAPAPPAVPAGPA 2749
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1027 ETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFP------------KPMTLPAETSLPLVFPKPMTLRAETSP 1094
Cdd:PHA03247 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlsesreslpspwDPADPPAAVLAPAAALPPAASPAGPLP 2829
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 755499890 1095 PPVAAKPVALPGSQGTSLNLKTLKTFGAP------RPYSSSGPSPFALAVVKRSQSFSKACPESASEgSSALPP 1162
Cdd:PHA03247 2830 PPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE-SFALPP 2902
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
757-1133 |
7.64e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 7.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 757 EKEPACTYGNNVPLSPVDGSNKNPAASylKNFPLYRQDSNPKPKPSNEITREYIPKIGMTTYKIVPPKSLEMAKDWESeA 836
Cdd:PHA03247 2586 ARRPDAPPQSARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-S 2662
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 837 MGRKDDQKMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSSgteRHLHRTLSSPTGTETNPPKAPRVTTDT 916
Cdd:PHA03247 2663 RPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP---HALVSATPLPPGPAAARQASPALPAAP 2739
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 917 GT--IPFAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLS 994
Cdd:PHA03247 2740 APpaVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 995 PEQHPSSLSerthsAPLPNISKADDDIIQKPAETSPPP---VAPK-PMTLRAETSPPPvfpkpmTLPAETSPPPVfpKPM 1070
Cdd:PHA03247 2820 PAASPAGPL-----PPPTSAQPTAPPPPPGPPPPSLPLggsVAPGgDVRRRPPSRSPA------AKPAAPARPPV--RRL 2886
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755499890 1071 TLPAETSLPLVFPKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSP 1133
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
859-1133 |
1.75e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 46.23 E-value: 1.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 859 ETSMQT-EVPATSKSSQQPQPDLKPKPssgterhlhrtlssptGTETNPPK-APRVTTDTGTIPFAPNLEDINNILESKF 936
Cdd:PRK10263 338 EPVTQTpPVASVDVPPAQPTVAWQPVP----------------GPQTGEPViAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 937 rsrasnPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEV-QRDPQLSPEQH-PSSLSERTHSAPLPni 1014
Cdd:PRK10263 402 ------QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAwQAEEQQSTFAPqSTYQTEQTYQQPAA-- 473
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1015 skADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVF-----------------------PKPMTLPAETSPPPVFPKPMT 1071
Cdd:PRK10263 474 --QEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYyfeeveekrarereqlaawyqpiPEPVKEPEPIKSSLKAPSVAA 551
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755499890 1072 LPAETSLPLVFPKPMTLRAETSPPPVAAKpVALPgsqgtslnLKTLKTFGAPRPYSSSGPSP 1133
Cdd:PRK10263 552 VPPVEAAAAVSPLASGVKKATLATGAAAT-VAAP--------VFSLANSGGPRPQVKEGIGP 604
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
991-1110 |
2.47e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 45.54 E-value: 2.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 991 PQLSPEQHPSSLSERTHSAPLPNiSKADDDIIQKPAETSPPPVAPKPMTlRAETSPPPVfPKPMTLPAETSPPPVFPKPM 1070
Cdd:PRK14971 371 GGRGPKQHIKPVFTQPAAAPQPS-AAAAASPSPSQSSAAAQPSAPQSAT-QPAGTPPTV-SVDPPAAVPVNPPSTAPQAV 447
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 755499890 1071 TLPAETSlplvfPKPMTLRAETSPPPVAAKPVALPGSQGT 1110
Cdd:PRK14971 448 RPAQFKE-----EKKIPVSKVSSLGPSTLRPIQEKAEQAT 482
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
1024-1113 |
1.15e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 43.26 E-value: 1.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1024 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPmtlPAETSLPLVFPKPMTLRAETSPPPVAAK--P 1101
Cdd:PRK14950 370 KPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR---PVAPPVPHTPESAPKLTRAAIPVDEKPKytP 446
|
90
....*....|..
gi 755499890 1102 VALPGSQGTSLN 1113
Cdd:PRK14950 447 PAPPKEEEKALI 458
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
1025-1105 |
1.28e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 42.58 E-value: 1.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1025 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 1104
Cdd:NF040983 86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163
|
.
gi 755499890 1105 P 1105
Cdd:NF040983 164 P 164
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
894-1095 |
1.43e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.94 E-value: 1.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 894 RTLSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDiNNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHT 973
Cdd:PRK12323 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAA-PAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP 454
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 974 APGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKP 1053
Cdd:PRK12323 455 AAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADP 534
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 755499890 1054 mTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP 1095
Cdd:PRK12323 535 -DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
963-1170 |
2.14e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.47 E-value: 2.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 963 VTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSL---SERTHSAPLPNISKADDDI--IQKPAETSPPPVAPKP 1037
Cdd:PHA03307 56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLapaSPAREGSPTPPGPSSPDPPppTPPPASPPPSPAPDLS 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1038 MTLRAETSPPPVfPKPMTLPAETSPPPVfpkpmTLPAETSLPLVFPKPMTLRAETSPPPVAAKPV---ALPGSQGTSLNL 1114
Cdd:PHA03307 136 EMLRPVGSPGPP-PAASPPAAGASPAAV-----ASDAASSRQAALPLSSPEETARAPSSPPAEPPpstPPAAASPRPPRR 209
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 755499890 1115 KTLKTFGAPRPYSSSGPSPFALAVVKRSQSFSKACPESAS--EGSSALPPAATQDEKT 1170
Cdd:PHA03307 210 SSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWgpENECPLPRPAPITLPT 267
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
976-1137 |
2.35e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.36 E-value: 2.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 976 GPAPKEPtikevqrdPQLSPEQHPSSLSERTH--SAPLPNISKADDDIIQKPAETSPPPVAPKPMtlRAETSPPPVFPKP 1053
Cdd:PHA03378 648 FPTPHQP--------PQVEITPYKPTWTQIGHipYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM--RPPAAPPGRAQRP 717
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1054 MTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAET-------SPPPVAA--KPVALPGSQGTSLNLKTLKTFGAPR 1124
Cdd:PHA03378 718 AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPpaaapgrARPPAAApgAPTPQPPPQAPPAPQQRPRGAPTPQ 797
|
170
....*....|...
gi 755499890 1125 PYSSSGPSPFALA 1137
Cdd:PHA03378 798 PPPQAGPTSMQLM 810
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
977-1210 |
2.43e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.45 E-value: 2.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 977 PAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTL 1056
Cdd:pfam03154 221 TQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPL 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1057 PAETSPPPVFPKPMT-LPAETSLPLVFPKPMTLRAETSPP---PVAAKPVALPgsqgtslNLKTLKTFGAPR-------- 1124
Cdd:pfam03154 301 TPQSSQSQVPPGPSPaAPGQSQQRIHTPPSQSQLQSQQPPreqPLPPAPLSMP-------HIKPPPTTPIPQlpnpqshk 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1125 -PYSSSGPSPFAL-------AVVKRSQSFSKACPESA--------SEGSSALPPAATQDEKTHTVNKPTVGSQH-GDGDK 1187
Cdd:pfam03154 374 hPPHLSGPSPFQMnsnlpppPALKPLSSLSTHHPPSAhppplqlmPQSQQLPPPPAQPPVLTQSQSLPPPAASHpPTSGL 453
|
250 260
....*....|....*....|...
gi 755499890 1188 QNNPVQNEHSSQVLTPADGPSFT 1210
Cdd:pfam03154 454 HQVPSQSPFPQHPFVPGGPPPIT 476
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
1023-1118 |
3.22e-03 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 41.80 E-value: 3.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1023 QKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPlvfpkpmtlrAETSPPPVAAKPV 1102
Cdd:PRK12270 36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA----------AAAPAAPPAAAAA 105
|
90
....*....|....*.
gi 755499890 1103 ALPGSQGTSLNLKTLK 1118
Cdd:PRK12270 106 AAPAAAAVEDEVTPLR 121
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
867-1127 |
3.52e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 3.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 867 PATSKSSQQPQPDLKPKPSSGTERHLHRTlSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDINNILESKFRSRASNPQAK 946
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPA-ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP 2871
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 947 PSSfflqmqkrasghyVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPniskadddiiQKPA 1026
Cdd:PHA03247 2872 AAK-------------PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP----------PPPQ 2928
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1027 ETSPPPVAPKPmtlraetsPPPVFPKPMTLPAeTSPPPVFPKP---MTLPAETSLPLVF---PKPMTLRAETSPPPVAAK 1100
Cdd:PHA03247 2929 PQPPPPPPPRP--------QPPLAPTTDPAGA-GEPSGAVPQPwlgALVPGRVAVPRFRvpqPAPSREAPASSTPPLTGH 2999
|
250 260
....*....|....*....|....*..
gi 755499890 1101 PVALPGSQGTSLnlkTLKTFGAPRPYS 1127
Cdd:PHA03247 3000 SLSRVSSWASSL---ALHEETDPPPVS 3023
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
975-1105 |
3.60e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 41.68 E-value: 3.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 975 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 1054
Cdd:NF033839 345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 755499890 1055 TLPA------ETSPPPVFPKPMTLP-AETSLPLVFPKPMTLRAETSPPPVAAKPVALP 1105
Cdd:NF033839 420 VKPQpekpkpEVKPQPEKPKPEVKPqPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
1024-1170 |
8.08e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 40.62 E-value: 8.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1024 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPM------TLPAETSLPLVFPKPMTLRAETSPPPV 1097
Cdd:PRK07994 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASApqqapaVPLPETTSQLLAARQQLQRAQGATKAK 439
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755499890 1098 AAKPVALPGSQGTSLNLKTLKTFgAPRPYSSSGPSPFALAVVKRSQSfskacPESASEGSSALPPAATQ---DEKT 1170
Cdd:PRK07994 440 KSEPAAASRARPVNSALERLASV-RPAPSALEKAPAKKEAYRWKATN-----PVEVKKEPVATPKALKKaleHEKT 509
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
1022-1107 |
8.33e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 40.27 E-value: 8.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755499890 1022 IQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVfPKPMTLRAETSPPPVAAKP 1101
Cdd:NF040983 89 VPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST-TTPTPSMHPIQPTQLPSIP 167
|
....*.
gi 755499890 1102 VALPGS 1107
Cdd:NF040983 168 NATPTS 173
|
|
| RBD |
pfam02196 |
Raf-like Ras-binding domain; |
92-158 |
9.58e-03 |
|
Raf-like Ras-binding domain;
Pssm-ID: 460485 Cd Length: 69 Bit Score: 35.96 E-value: 9.58e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755499890 92 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 158
Cdd:pfam02196 2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
|
|
|