NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958765288|ref|XP_038962258|]
View 

cordon-bleu protein-like 1 isoform X6 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
176-254 1.18e-41

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


:

Pssm-ID: 462810  Cd Length: 79  Bit Score: 147.35  E-value: 1.18e-41
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958765288  176 EKTVRVVINFKKTQKTIVRVSPHSPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 254
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 super family cl33720
large tegument protein UL36; Provisional
887-1162 1.51e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.51e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  887 PPKPPRMTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPTPKEPTIK 966
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP 2631
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  967 EVQRDPQLPPEQCLSPLSERTHSAPLPNISKADD--NRIQKPAETSPPPVAPKPMALP---------AETSPPAVAPKPm 1035
Cdd:PHA03247  2632 SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRraRRLGRAAQASSPPQRPRRRAARptvgsltslADPPPPPPTPEP- 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1036 AFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEA---SPLPVAPKPMAFPAETSLPPVAP 1112
Cdd:PHA03247  2711 APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgppAPAPPAAPAAGPPRRLTRPAVAS 2790
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1113 KPMALPTEASPPPVAPKPLALPGSQGASLNLKTLKTFGAPRPYNSSAPSP 1162
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
WH2 super family cl41728
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This ...
1253-1278 4.97e-08

Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This family contains the Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2) as well as thymosin-beta (Tbeta; also called beta-thymosin or betaT) domains that are small, widespread intrinsically disordered actin-binding peptides displaying significant sequence variability and different regulations of actin self-assembly in motile and morphogenetic processes. These WH2/betaT peptides are identified by a central consensus actin-binding motif LKKT/V flanked by variable N-terminal and C-terminal extensions; the betaT shares a more extended and conserved C-terminal half than WH2. These single or repeated domains are found in actin-binding proteins (ABPs) such as the hematopoietic-specific protein WASP, its ubiquitously expressed ortholog neural-WASP (N-WASP), WASP-interacting protein (WAS/WASL-interacting protein family members 1 and 2), and WASP-family verprolin homologous protein (WAVE/SCAR) isoforms: WAVE1, WAVE2, and WAVE3. Also included are the WH2 domains found in inverted formin FH2 domain-containing protein (INF2), Cordon bleu (Cobl) protein, vasodilator-stimulated phosphoprotein (VASP) homology protein and actobindin (found in amoebae). These ABPs are commonly multidomain proteins that contain signaling domains and structurally conserved actin-binding motifs, the most important being the WH2 domain motif through which they bind actin in order to direct the location, rate, and timing for actin assembly in the cell into different structures, such as filopodia, lamellipodia, stress fibers, and focal adhesions. The WH2 domain motif is one of the most abundant actin-binding motifs in Wiskott-Aldrich syndrome proteins (WASPs) where they activate Arp2/3-dependent actin nucleation and branching in response to signals mediated by Rho-family GTPases. The thymosin beta (Tbeta) domains in metazoans act in cells as major actin-sequestering peptides; their complex with monomeric ATP-actin (G-ATP-actin) cannot polymerize at either filament (F-actin) end.


The actual alignment was detected with superfamily member cd21801:

Pssm-ID: 425359  Cd Length: 26  Bit Score: 50.00  E-value: 4.97e-08
                           10        20
                   ....*....|....*....|....*.
gi 1958765288 1253 DPEHIRQSLLTAIRSGEAAAKLKRVT 1278
Cdd:cd21801      1 NPEQARQALLEAIRSGEGAARLKKVP 26
RBD super family cl46342
Raf-like Ras-binding domain;
92-158 1.42e-03

Raf-like Ras-binding domain;


The actual alignment was detected with superfamily member pfam02196:

Pssm-ID: 460485  Cd Length: 69  Bit Score: 38.27  E-value: 1.42e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958765288   92 LSVVLPGDVLKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTINLLSAEENLIkfKPNTPIGMLEVEKV 158
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
 
Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
176-254 1.18e-41

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


Pssm-ID: 462810  Cd Length: 79  Bit Score: 147.35  E-value: 1.18e-41
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958765288  176 EKTVRVVINFKKTQKTIVRVSPHSPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 254
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 PHA03247
large tegument protein UL36; Provisional
887-1162 1.51e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.51e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  887 PPKPPRMTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPTPKEPTIK 966
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP 2631
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  967 EVQRDPQLPPEQCLSPLSERTHSAPLPNISKADD--NRIQKPAETSPPPVAPKPMALP---------AETSPPAVAPKPm 1035
Cdd:PHA03247  2632 SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRraRRLGRAAQASSPPQRPRRRAARptvgsltslADPPPPPPTPEP- 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1036 AFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEA---SPLPVAPKPMAFPAETSLPPVAP 1112
Cdd:PHA03247  2711 APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgppAPAPPAAPAAGPPRRLTRPAVAS 2790
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1113 KPMALPTEASPPPVAPKPLALPGSQGASLNLKTLKTFGAPRPYNSSAPSP 1162
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
958-1134 1.52e-08

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 59.01  E-value: 1.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  958 PTPkEPTIKEVQRDPQLP-PEQCLSPLSERTHSAPLPNISKADdnrIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMA 1036
Cdd:NF033839   301 PSP-QPEKKEVKPEPETPkPEVKPQLEKPKPEVKPQPEKPKPE---VKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPEV 376
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1037 FPA-ETSLPPVSPKPMALPTEASSSPISPKP-MAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPA-ETSLPPVAPK 1113
Cdd:NF033839   377 KPQpETPKPEVKPQPEKPKPEVKPQPEKPKPeVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQpEKPKPEVKPQ 456
                          170       180
                   ....*....|....*....|.
gi 1958765288 1114 PMALPTEASPPPVAPKPLALP 1134
Cdd:NF033839   457 PETPKPEVKPQPEKPKPEVKP 477
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
914-1211 4.38e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.85  E-value: 4.38e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  914 ESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPTPKEPTIKEVQRDP-QLPPEQCLSPLSERTHSAPL 992
Cdd:pfam03154  155 ESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPAtSQPPNQTQSTAAPHTLIQQT 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  993 PNISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAP-KPMAFPAETSLP----PVSPKPMALPTEASSS--PISPK 1065
Cdd:pfam03154  235 PTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQmPPMPHSLQTGPShmqhPVPPQPFPLTPQSSQSqvPPGPS 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1066 PMAPPAEAS---IPPVVPKPMAP-PAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGASL 1141
Cdd:pfam03154  315 PAAPGQSQQrihTPPSQSQLQSQqPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPP 394
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1142 NLKTLKTFGAPRPyNSSAPSPfaLAVVKRSQSFskaspespsedssaQPPAAIQDGKTQTVNQPTVGSQH 1211
Cdd:pfam03154  395 ALKPLSSLSTHHP-PSAHPPP--LQLMPQSQQL--------------PPPPAQPPVLTQSQSLPPPAASH 447
WH2_Wc_Cobl cd21801
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ...
1253-1278 4.97e-08

third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.


Pssm-ID: 409199  Cd Length: 26  Bit Score: 50.00  E-value: 4.97e-08
                           10        20
                   ....*....|....*....|....*.
gi 1958765288 1253 DPEHIRQSLLTAIRSGEAAAKLKRVT 1278
Cdd:cd21801      1 NPEQARQALLEAIRSGEGAARLKKVP 26
MXAN_5187_fam NF041620
MXAN_5187 family protein;
947-1112 8.57e-08

MXAN_5187 family protein;


Pssm-ID: 469504 [Multi-domain]  Cd Length: 597  Bit Score: 56.78  E-value: 8.57e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  947 AAAKSVHTAPGPTPkeptikEVQRDPQLPPEQcLSPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAETS 1026
Cdd:NF041620   363 APAAILRAAPAAVA------EPAAKGEAAPRR-AADLEAVLGAIPAAPAPPAPSPPPAEPFAAPPPPPEPDPSEFAAPAP 435
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1027 PPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAE---ASIPPVVPKPMAPPA-EASPLPVAPKPMAFP 1102
Cdd:NF041620   436 TAPLPPAPPPRGAAFAFPDEPTAAYSLQQAADPPAAAAAQAPPPETtrvAAAPPPLLAASAPPTtAAPPAPPGAAAAAAA 515
                          170
                   ....*....|
gi 1958765288 1103 AETSLPPVAP 1112
Cdd:NF041620   516 AARAAAAVAL 525
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
958-1132 1.52e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 52.46  E-value: 1.52e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  958 PTPKEPTiKEVQRDPQLP-PEQCLSPLSERTHSAPLPNISKADdnrIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMA 1036
Cdd:NF033839   323 PQLEKPK-PEVKPQPEKPkPEVKPQLETPKPEVKPQPEKPKPE---VKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEV 398
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1037 FPA-ETSLPPVSPKPMALPTEASSSPISPKP-MAPPAEASIPPVVPKPMAPPAEASPLPVAPKPmafpaETSLPPVAPKP 1114
Cdd:NF033839   399 KPQpEKPKPEVKPQPEKPKPEVKPQPEKPKPeVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKP-----EVKPQPEKPKP 473
                          170
                   ....*....|....*...
gi 1958765288 1115 MALPTEASPPPVAPKPLA 1132
Cdd:NF033839   474 EVKPQPEKPKPDNSKPQA 491
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
956-1124 1.35e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 49.38  E-value: 1.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  956 PGPTPK-EPTIKEVQRDPQL---PPEQCLSPLSERTHSAPLPNISKADdnrIQKPAETSPPPVAPKPMALPAETSPPAVA 1031
Cdd:NF033839   328 PKPEVKpQPEKPKPEVKPQLetpKPEVKPQPEKPKPEVKPQPEKPKPE---VKPQPETPKPEVKPQPEKPKPEVKPQPEK 404
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1032 PKPMAFPA-ETSLPPVSPKPMALPTEASSSPISPKP-MAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPP 1109
Cdd:NF033839   405 PKPEVKPQpEKPKPEVKPQPEKPKPEVKPQPEKPKPeVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKP 484
                          170
                   ....*....|....*
gi 1958765288 1110 VAPKPMALPTEASPP 1124
Cdd:NF033839   485 DNSKPQADDKKPSTP 499
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
936-1134 2.02e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 49.00  E-value: 2.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  936 QKRASGHYVTSAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSE------------RTHSAPLPNISKADDNRI 1003
Cdd:NF033839   163 QPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAVATYMSKilddiqkhhlqkEKHRQIVALIKELDELKK 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1004 QKPAE--TSPPPVAPKPMALPAETSPPAVAPK-PMAFPAETSLPPVSPKPMALPTEASSSPISP-KPMAPPAEASIPPVV 1079
Cdd:NF033839   243 QALSEidNVNTKVEIENTVHKIFADMDAVVTKfKKGLTQDTPKEPGNKKPSAPKPGMQPSPQPEkKEVKPEPETPKPEVK 322
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1958765288 1080 PKPMAPPAEASPLPVAPKPMAFPA-ETSLPPVAPKPMALPTEASPPPVAPKPLALP 1134
Cdd:NF033839   323 PQLEKPKPEVKPQPEKPKPEVKPQlETPKPEVKPQPEKPKPEVKPQPEKPKPEVKP 378
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
983-1128 2.52e-04

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 44.76  E-value: 2.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  983 LSERTHSAPLP----NISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEAS 1058
Cdd:NF040712   187 LIDPDFGRPLRplatVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAP 266
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1059 SSPisPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAP 1128
Cdd:NF040712   267 AAE--PDEATRDAGEPPAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASVP 334
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
1012-1139 4.52e-04

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 43.99  E-value: 4.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1012 PPVA--PKPMALPAETSPPAVAPKPmAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEA----SIPPVVPKPMAP 1085
Cdd:NF040712   197 RPLAtvPRLAREPADARPEEVEPAP-AAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEPvgpgAAPAAEPDEATR 275
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1086 PAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGA 1139
Cdd:NF040712   276 DAGEPPAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRR 329
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
1026-1108 7.85e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 43.45  E-value: 7.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1026 SPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPL-PVAPKPMAFPAE 1104
Cdd:NF041121    17 RAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAaPGAALPVRVPAP 96

                   ....
gi 1958765288 1105 TSLP 1108
Cdd:NF041121    97 PALP 100
COG5373 COG5373
Uncharacterized membrane protein [Function unknown];
1019-1092 1.11e-03

Uncharacterized membrane protein [Function unknown];


Pssm-ID: 444140 [Multi-domain]  Cd Length: 854  Bit Score: 43.45  E-value: 1.11e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1019 MALPAETSPPAVAPKPMAFPAETslpPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPL 1092
Cdd:COG5373     37 LAEAAEAASAPAEPEPEAAAAAT---AAAPEAAPAPVPEAPAAPPAAAEAPAPAAAAPPAEAEPAAAPAAASSF 107
RBD pfam02196
Raf-like Ras-binding domain;
92-158 1.42e-03

Raf-like Ras-binding domain;


Pssm-ID: 460485  Cd Length: 69  Bit Score: 38.27  E-value: 1.42e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958765288   92 LSVVLPGDVLKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTINLLSAEENLIkfKPNTPIGMLEVEKV 158
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
1044-1147 1.70e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 42.68  E-value: 1.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1044 PPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPmafpaetslPPVAPKPMALPTEASP 1123
Cdd:NF041121    19 AAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPP---------PPPPPGPAGAAPGAAL 89
                           90       100
                   ....*....|....*....|....
gi 1958765288 1124 PPVAPKPLALPGSQGASLNLKTLK 1147
Cdd:NF041121    90 PVRVPAPPALPNPLELARALRPLK 113
KLF10_11_N cd21974
N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily ...
967-1125 2.68e-03

N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily is composed of Kruppel-like factor or Krueppel-like factor (KLF) 10, KLF11, and similar proteins. KLF10 was first identified in human osteoblasts and plays a role in mediating estrogen (E2) signaling in bone and skeletal homeostasis and a regulatory role in tumor formation and metastasis. KLF11 is involved in cell growth, apoptosis, cellular inflammation and differentiation, endometriosis, and cholesterol, prostaglandin, neurotransmitter, fat, and sugar metabolism. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved a-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. KLF10/11 belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF10, KLF11, and similar proteins.


Pssm-ID: 409243 [Multi-domain]  Cd Length: 229  Bit Score: 41.07  E-value: 2.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  967 EVQRDPQLPPEQCLSP-----LSERTHSAPLPNIS--KADDNRIQKPAETSPPPVAPKPMALpaETS-------PPAVAP 1032
Cdd:cd21974     45 ESPKDFHSLSSLCMTPpysppFFEASHSPSVASLHppSAASSQPPPEPESSEPPAASPQRAQ--ATSvirhtadPVPVSP 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1033 KPMAFPaetSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMA--------PPAEASPLPVAPKPMAfPAE 1104
Cdd:cd21974    123 PPVLCQ---MLPVSSSSGVIVAFLKAPQQPSPQPQKPALPQPQVVLVGGQVPqgpvmlvvPQPAVPQPYVQPTVVT-PGG 198
                          170       180
                   ....*....|....*....|.
gi 1958765288 1105 TSLPPVAPKPMALPTEASPPP 1125
Cdd:cd21974    199 TKLLPIAPAPGFIPSGQSSAP 219
 
Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
176-254 1.18e-41

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


Pssm-ID: 462810  Cd Length: 79  Bit Score: 147.35  E-value: 1.18e-41
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958765288  176 EKTVRVVINFKKTQKTIVRVSPHSPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 254
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 PHA03247
large tegument protein UL36; Provisional
887-1162 1.51e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.51e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  887 PPKPPRMTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPTPKEPTIK 966
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP 2631
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  967 EVQRDPQLPPEQCLSPLSERTHSAPLPNISKADD--NRIQKPAETSPPPVAPKPMALP---------AETSPPAVAPKPm 1035
Cdd:PHA03247  2632 SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRraRRLGRAAQASSPPQRPRRRAARptvgsltslADPPPPPPTPEP- 2710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1036 AFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEA---SPLPVAPKPMAFPAETSLPPVAP 1112
Cdd:PHA03247  2711 APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgppAPAPPAAPAAGPPRRLTRPAVAS 2790
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1113 KPMALPTEASPPPVAPKPLALPGSQGASLNLKTLKTFGAPRPYNSSAPSP 1162
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1004-1197 1.08e-13

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 76.07  E-value: 1.08e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1004 QKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLP-----PVSPKPMALPTEASSSPISPKPMAPPAEAsiPPV 1078
Cdd:PRK12323   379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVaaapaRRSPAPEALAAARQASARGPGGAPAPAPA--PAA 456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1079 VPKPMAPPAEASPLPVAPkpmafPAETSLPPVAPKPMALPTEASPPP--VAPKPLALPGSQGASLNLKTLKTFGAPRPYN 1156
Cdd:PRK12323   457 APAAAARPAAAGPRPVAA-----AAAAAPARAAPAAAPAPADDDPPPweELPPEFASPAPAQPDAAPAGWVAESIPDPAT 531
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1958765288 1157 SSAPSPFALAVVKRSQSFSKASPESPSEDSSAQPPAAIQDG 1197
Cdd:PRK12323   532 ADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
PHA03247 PHA03247
large tegument protein UL36; Provisional
859-1138 1.08e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 76.52  E-value: 1.08e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  859 PDRTPKPSSGTEHPLHRTVSSPVG--TEMNPPKPPRMTTDTGTIPFAPNLedinnileskfrSRASNPQAKPSSFFLQMQ 936
Cdd:PHA03247  2670 LGRAAQASSPPQRPRRRAARPTVGslTSLADPPPPPPTPEPAPHALVSAT------------PLPPGPAAARQASPALPA 2737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  937 KRASGHYVTSAAAKSVHTAPG--PTPKEPTIKEVQRDPQLPPEQCL-----SPLSERTHSAPLPNISKAddnriqKPAET 1009
Cdd:PHA03247  2738 APAPPAVPAGPATPGGPARPArpPTTAGPPAPAPPAAPAAGPPRRLtrpavASLSESRESLPSPWDPAD------PPAAV 2811
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1010 SPPPVAPKPMALPAETSPPAVAPKPMAFPaetslPPVSPKPMALPTEASSSPISPKPMAPPAEASiPPVVPKPMAPPAE- 1088
Cdd:PHA03247  2812 LAPAAALPPAASPAGPLPPPTSAQPTAPP-----PPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP-AAKPAAPARPPVRr 2885
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958765288 1089 --ASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQG 1138
Cdd:PHA03247  2886 laRPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP 2937
PHA03378 PHA03378
EBNA-3B; Provisional
957-1166 9.01e-12

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 70.10  E-value: 9.01e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  957 GPTPKEPtikevqrdPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQK--PAETSPPPVAPKPMalpaetSPPAVAPKP 1034
Cdd:PHA03378   648 FPTPHQP--------PQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQwaPGTMQPPPRAPTPM------RPPAAPPGR 713
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1035 MAFPAETSLPpvSPKPMALPTEASSSPISPKPMAPPA----EASIPPVVPKPMAPPAEASPLPV-APKPMAFPAETSLPP 1109
Cdd:PHA03378   714 AQRPAAATGR--ARPPAAAPGRARPPAAAPGRARPPAaapgRARPPAAAPGRARPPAAAPGAPTpQPPPQAPPAPQQRPR 791
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958765288 1110 VAPKPMAlPTEASPPPVAPKPLALPGSQGASLNLKTLKTFGAPRPYNSSAPSPFALA 1166
Cdd:PHA03378   792 GAPTPQP-PPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALE 847
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
917-1140 1.04e-11

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 69.52  E-value: 1.04e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  917 FRSRASNPQAKPSSfflqmqkRASGHYVTSAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSERTHSAPLPnIS 996
Cdd:PRK12323   363 FRPGQSGGGAGPAT-------AAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEA-LA 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  997 KADDNRIQKPAETSPPpvAPKPMALPAETSPPAVAPkPMAFPAETSLPPVSPKPMALPTEASSSPiSPKPMAPPAEASIP 1076
Cdd:PRK12323   435 AARQASARGPGGAPAP--APAPAAAPAAAARPAAAG-PRPVAAAAAAAPARAAPAAAPAPADDDP-PPWEELPPEFASPA 510
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1077 PVVPKPMAPPAEASPLpvaPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGAS 1140
Cdd:PRK12323   511 PAQPDAAPAGWVAESI---PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASAS 571
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1013-1140 1.42e-11

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 68.97  E-value: 1.42e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1013 PVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPL 1092
Cdd:PRK14951   366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV 445
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1958765288 1093 PVAPKPMAfpaetslpPVAPKPMALPTEASPPPVAPKPLALPGSQGAS 1140
Cdd:PRK14951   446 ALAPAPPA--------QAAPETVAIPVRVAPEPAVASAAPAPAAAPAA 485
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1005-1133 1.65e-11

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 68.59  E-value: 1.65e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1005 KP--AETSPPPVAPKPMAlPAETSPPAVAPKPMAFPAETSLPP---VSPKPMALPTEASSSPISPKPMAPPAEAsiPPVV 1079
Cdd:PRK14951   365 KPaaAAEAAAPAEKKTPA-RPEAAAPAAAPVAQAAAAPAPAAApaaAASAPAAPPAAAPPAPVAAPAAAAPAAA--PAAA 441
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1080 PKPmAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTeASPPPVAPKPLAL 1133
Cdd:PRK14951   442 PAA-VALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPA-AAPAAARLTPTEE 493
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
947-1193 1.03e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 60.18  E-value: 1.03e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  947 AAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAETS 1026
Cdd:PHA03307    51 AAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSP 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1027 PPAVAPKPMAFPAETSLPPVSPKPMAL-PTEASSSPISPKPMAPPA---EASIPPVVPKPMAPPAEASPLPVAPKPMAFP 1102
Cdd:PHA03307   131 APDLSEMLRPVGSPGPPPAASPPAAGAsPAAVASDAASSRQAALPLsspEETARAPSSPPAEPPPSTPPAAASPRPPRRS 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1103 AETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGASLNLKTlkTFGAPRPYNSSAPSPFALAVVKRSQSFSKASPESP 1182
Cdd:PHA03307   211 SPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPEN--ECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS 288
                          250
                   ....*....|.
gi 1958765288 1183 SEDSSAQPPAA 1193
Cdd:PHA03307   289 SSPRERSPSPS 299
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
958-1134 1.52e-08

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 59.01  E-value: 1.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  958 PTPkEPTIKEVQRDPQLP-PEQCLSPLSERTHSAPLPNISKADdnrIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMA 1036
Cdd:NF033839   301 PSP-QPEKKEVKPEPETPkPEVKPQLEKPKPEVKPQPEKPKPE---VKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPEV 376
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1037 FPA-ETSLPPVSPKPMALPTEASSSPISPKP-MAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPA-ETSLPPVAPK 1113
Cdd:NF033839   377 KPQpETPKPEVKPQPEKPKPEVKPQPEKPKPeVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQpEKPKPEVKPQ 456
                          170       180
                   ....*....|....*....|.
gi 1958765288 1114 PMALPTEASPPPVAPKPLALP 1134
Cdd:NF033839   457 PETPKPEVKPQPEKPKPEVKP 477
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1023-1142 1.99e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 58.67  E-value: 1.99e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1023 AETSPPAVAPKPMAFpAETSLPPVSPKPMALPTEASSSPISPKPmAPPAEASIPPVVPKPMAPPAEASPLPvaPKPMAFP 1102
Cdd:PRK14950   341 LRTTSYGQLPLELAV-IEALLVPVPAPQPAKPTAAAPSPVRPTP-APSTRPKAAAAANIPPKEPVRETATP--PPVPPRP 416
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1958765288 1103 AETSLPPVAPKPMALPTEASPPPVAPK--PLALPGSQGASLN 1142
Cdd:PRK14950   417 VAPPVPHTPESAPKLTRAAIPVDEKPKytPPAPPKEEEKALI 458
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
914-1211 4.38e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.85  E-value: 4.38e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  914 ESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPTPKEPTIKEVQRDP-QLPPEQCLSPLSERTHSAPL 992
Cdd:pfam03154  155 ESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPAtSQPPNQTQSTAAPHTLIQQT 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  993 PNISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAP-KPMAFPAETSLP----PVSPKPMALPTEASSS--PISPK 1065
Cdd:pfam03154  235 PTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQmPPMPHSLQTGPShmqhPVPPQPFPLTPQSSQSqvPPGPS 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1066 PMAPPAEAS---IPPVVPKPMAP-PAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGASL 1141
Cdd:pfam03154  315 PAAPGQSQQrihTPPSQSQLQSQqPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPP 394
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1142 NLKTLKTFGAPRPyNSSAPSPfaLAVVKRSQSFskaspespsedssaQPPAAIQDGKTQTVNQPTVGSQH 1211
Cdd:pfam03154  395 ALKPLSSLSTHHP-PSAHPPP--LQLMPQSQQL--------------PPPPAQPPVLTQSQSLPPPAASH 447
WH2_Wc_Cobl cd21801
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ...
1253-1278 4.97e-08

third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.


Pssm-ID: 409199  Cd Length: 26  Bit Score: 50.00  E-value: 4.97e-08
                           10        20
                   ....*....|....*....|....*.
gi 1958765288 1253 DPEHIRQSLLTAIRSGEAAAKLKRVT 1278
Cdd:cd21801      1 NPEQARQALLEAIRSGEGAARLKKVP 26
MXAN_5187_fam NF041620
MXAN_5187 family protein;
947-1112 8.57e-08

MXAN_5187 family protein;


Pssm-ID: 469504 [Multi-domain]  Cd Length: 597  Bit Score: 56.78  E-value: 8.57e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  947 AAAKSVHTAPGPTPkeptikEVQRDPQLPPEQcLSPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAETS 1026
Cdd:NF041620   363 APAAILRAAPAAVA------EPAAKGEAAPRR-AADLEAVLGAIPAAPAPPAPSPPPAEPFAAPPPPPEPDPSEFAAPAP 435
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1027 PPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAE---ASIPPVVPKPMAPPA-EASPLPVAPKPMAFP 1102
Cdd:NF041620   436 TAPLPPAPPPRGAAFAFPDEPTAAYSLQQAADPPAAAAAQAPPPETtrvAAAPPPLLAASAPPTtAAPPAPPGAAAAAAA 515
                          170
                   ....*....|
gi 1958765288 1103 AETSLPPVAP 1112
Cdd:NF041620   516 AARAAAAVAL 525
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1031-1166 1.47e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 55.88  E-value: 1.47e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1031 APKPMAfpAETSLPPVSPKPMALPTEASSSPispkpmAPPAEASIPPVvpkPMAPPAEASPLPVAPKPMAFPAETSLPPV 1110
Cdd:PRK14951   363 AFKPAA--AAEAAAPAEKKTPARPEAAAPAA------APVAQAAAAPA---PAAAPAAAASAPAAPPAAAPPAPVAAPAA 431
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1958765288 1111 A---PKPMALPTEASPPPVAPKPLAlpgSQGASLNLKTlktfgAPRPYNSSAPSPFALA 1166
Cdd:PRK14951   432 AapaAAPAAAPAAVALAPAPPAQAA---PETVAIPVRV-----APEPAVASAAPAPAAA 482
PHA03378 PHA03378
EBNA-3B; Provisional
954-1162 1.51e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 56.23  E-value: 1.51e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  954 TAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSERTHSAPLP---------NISKADDNRIQKPAETSPPPVAPKPMAlPAE 1024
Cdd:PHA03378   588 SAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPlrpipmrplRMQPITFNVLVFPTPHQPPQVEITPYK-PTW 666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1025 TSPPAVAPKPMAFPAETSLPPvspkpMALPTEASSSPISPKPMAPPAEAsiPPVVPKPMAPPAEASPLPVAPKPMAFPAe 1104
Cdd:PHA03378   667 TQIGHIPYQPSPTGANTMLPI-----QWAPGTMQPPPRAPTPMRPPAAP--PGRAQRPAAATGRARPPAAAPGRARPPA- 738
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1958765288 1105 tSLPPVAPKPMALPTEASPPPVAPKPLALP-GSQGASLNLKTLKTFGAPRPYNSSAPSP 1162
Cdd:PHA03378   739 -AAPGRARPPAAAPGRARPPAAAPGRARPPaAAPGAPTPQPPPQAPPAPQQRPRGAPTP 796
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
972-1101 1.80e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 55.55  E-value: 1.80e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  972 PQLPPEQCLSPLSERTHSAPLPNiSKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAEtslPPVSPKPM 1051
Cdd:PRK14971   371 GGRGPKQHIKPVFTQPAAAPQPS-AAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVN---PPSTAPQA 446
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1958765288 1052 ALPteasSSPISPKPMAPPAEASIPPVVPKPMAPPAEA-----SPLPVAPKPMAF 1101
Cdd:PRK14971   447 VRP----AQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQatgniKEAPTGTQKEIF 497
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1005-1130 2.08e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 55.26  E-value: 2.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1005 KPAETSPPPVAPKPMALPAETS--------PPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEAsiP 1076
Cdd:PRK07994   360 HPAAPLPEPEVPPQSAAPAASAqataaptaAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGAT--K 437
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1958765288 1077 PVVPKPmAPPAEASPLPVAPKPMAF--PAETSLPPVAPKPMALPTEASPPPVAPKP 1130
Cdd:PRK07994   438 AKKSEP-AAASRARPVNSALERLASvrPAPSALEKAPAKKEAYRWKATNPVEVKKE 492
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
850-1137 2.80e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 55.16  E-value: 2.80e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  850 TSKNSQQPQPDRTPKPSSGTEHPLHR---TVSSPVGTEMNPPKPPRMTTDTGTIPFAPNLEDINnileskfrSRASNPQA 926
Cdd:pfam03154  144 TSPSIPSPQDNESDSDSSAQQQILQTqppVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVP--------PQGSPATS 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  927 KPSSfflQMQKRASGHYVTSAAAkSVHTAPGPTPKEPTIKEVQRDPqlPPEQCLSPLSERTHSAPLPNISKAddnrIQKP 1006
Cdd:pfam03154  216 QPPN---QTQSTAAPHTLIQQTP-TLHPQRLPSPHPPLQPMTQPPP--PSQVSPQPLPQPSLHGQMPPMPHS----LQTG 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1007 AETSPPPVAPKPMALPAETSPPAVAPKPM-AFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASI--PPVVPKPM 1083
Cdd:pfam03154  286 PSHMQHPVPPQPFPLTPQSSQSQVPPGPSpAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIkpPPTTPIPQ 365
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1958765288 1084 APPAEASPLP---VAPKPMAFPAETSlPPVAPKPMALPTEASPPPVAPKPLAL-PGSQ 1137
Cdd:pfam03154  366 LPNPQSHKHPphlSGPSPFQMNSNLP-PPPALKPLSSLSTHHPPSAHPPPLQLmPQSQ 422
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
989-1140 3.29e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 54.86  E-value: 3.29e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  989 SAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMA 1068
Cdd:PRK07003   383 PGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSR 462
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958765288 1069 PPAEASIPPVVPKPMAPPAEASPLPVA--PKPMAFPAETSLPPVAPKPMALPT----EASPPPVAPKPLALPGSQGAS 1140
Cdd:PRK07003   463 CDERDAQPPADSGSASAPASDAPPDAAfePAPRAAAPSAATPAAVPDARAPAAasreDAPAAAAPPAPEARPPTPAAA 540
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
955-1161 3.45e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 55.18  E-value: 3.45e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  955 APGPTPKEPTIKEVQRDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKP 1034
Cdd:PHA03307    17 GGEFFPRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLA 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1035 MAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPmAFPAETSLPPVAPKP 1114
Cdd:PHA03307    97 PASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAASSRQAALP 175
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1958765288 1115 MALPTEASPPPVAPKPLALPGSQGASLnlktlkTFGAPRPYNSSAPS 1161
Cdd:PHA03307   176 LSSPEETARAPSSPPAEPPPSTPPAAA------SPRPPRRSSPISAS 216
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
938-1162 4.95e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 54.47  E-value: 4.95e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  938 RASGHYVTSAAAKSVHTAPGPTPKEPTikEVQRDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPK 1017
Cdd:PRK07003   381 PAPGARAAAAVGASAVPAVTAVTGAAG--AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARAS 458
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1018 PMALPAETSP-PAVAPKPMAFPAETSlPPVSPKPMALPTEASSSPISPkpmaPPAEASIPPVVPKPMAPPAEASPLPVAP 1096
Cdd:PRK07003   459 ADSRCDERDAqPPADSGSASAPASDA-PPDAAFEPAPRAAAPSAATPA----AVPDARAPAAASREDAPAAAAPPAPEAR 533
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1097 KPMafPAETS----------------------------LPPVAPKPMALPTeASPPPVAPK---PLALPGSQGASLNLKT 1145
Cdd:PRK07003   534 PPT--PAAAApaaraggaaaaldvlrnagmrvssdrgaRAAAAAKPAAAPA-AAPKPAAPRvavQVPTPRARAATGDAPP 610
                          250
                   ....*....|....*..
gi 1958765288 1146 LKTFGAPRPYNSSAPSP 1162
Cdd:PRK07003   611 NGAARAEQAAESRGAPP 627
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1006-1128 6.76e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 53.72  E-value: 6.76e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1006 PAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPteASSSPISPKPMAPPAEASIPPVVPKPMAP 1085
Cdd:PRK07994   385 AAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQ--GATKAKKSEPAAASRARPVNSALERLASV 462
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1086 PAEASPLPVAPK-----------PMAFPAETSLPPVAPKPmALPTEASPPPVAP 1128
Cdd:PRK07994   463 RPAPSALEKAPAkkeayrwkatnPVEVKKEPVATPKALKK-ALEHEKTPELAAK 515
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1023-1147 1.48e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 52.68  E-value: 1.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1023 AETSPPAVAPKPMAFPAETSLPPVSPKPMALPteASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVA-PKPMAF 1101
Cdd:PRK07764   390 GAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAA--APAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAaPSAQPA 467
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1958765288 1102 PAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGASLNLKTLK 1147
Cdd:PRK07764   468 PAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLR 513
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
958-1132 1.52e-06

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 52.46  E-value: 1.52e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  958 PTPKEPTiKEVQRDPQLP-PEQCLSPLSERTHSAPLPNISKADdnrIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMA 1036
Cdd:NF033839   323 PQLEKPK-PEVKPQPEKPkPEVKPQLETPKPEVKPQPEKPKPE---VKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEV 398
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1037 FPA-ETSLPPVSPKPMALPTEASSSPISPKP-MAPPAEASIPPVVPKPMAPPAEASPLPVAPKPmafpaETSLPPVAPKP 1114
Cdd:NF033839   399 KPQpEKPKPEVKPQPEKPKPEVKPQPEKPKPeVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKP-----EVKPQPEKPKP 473
                          170
                   ....*....|....*...
gi 1958765288 1115 MALPTEASPPPVAPKPLA 1132
Cdd:NF033839   474 EVKPQPEKPKPDNSKPQA 491
PHA03379 PHA03379
EBNA-3A; Provisional
919-1244 1.88e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 52.37  E-value: 1.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  919 SRASNPQAKPSSFFLQMQK----RASGhyVTSAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSERTHSAPLPN 994
Cdd:PHA03379   370 SREGTKRKRPPIFLRRLHRlllmRAGK--LTERAREALEKASEPTYGTPRPPVEKPRPEVPQSLETATSHGSAQVPEPPP 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  995 ISKADDNRIQKPAETSPPPVAPKPmalpaetsppavaPKPmafpaetsLPPVSPKPMaLPTEASSSPISPKPMAPPAEAS 1074
Cdd:PHA03379   448 VHDLEPGPLHDQHSMAPCPVAQLP-------------PGP--------LQDLEPGDQ-LPGVVQDGRPACAPVPAPAGPI 505
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1075 IPPVVPKPMAPPAEAsPLPVAPKPMafpaetslpPVAPKPMalPTEASPPPVAPKP-LALPGSQGASLNLKTLKTFGAPR 1153
Cdd:PHA03379   506 VRPWEASLSQVPGVA-FAPVMPQPM---------PVEPVPV--PTVALERPVCPAPpLIAMQGPGETSGIVRVRERWRPA 573
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1154 PYNSSAPSPFALAVVKRSQSFSKASPESPSEDSSAQPPAAIQDGKTQTVNQPTVGSQHDGVDKQNKPVQNEHSSQRLtPA 1233
Cdd:PHA03379   574 PWTPNPPRSPSQMSVRDRLARLRAEAQPYQASVEVQPPQLTQVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGV-PA 652
                          330
                   ....*....|.
gi 1958765288 1234 DGPSSFTLQRQ 1244
Cdd:PHA03379   653 MQPQYFDLPLQ 663
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1002-1139 2.03e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 52.30  E-value: 2.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1002 RIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSlPPVSPKPMALPTEAssspisPKPMAPPAEASIPPVVPK 1081
Cdd:PRK07764   380 RLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAA-AAPAPAAAPQPAPA------PAPAPAPPSPAGNAPAGG 452
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1958765288 1082 PMAPPAEASPlPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGA 1139
Cdd:PRK07764   453 APSPPPAAAP-SAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDA 509
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1006-1161 2.80e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.77  E-value: 2.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1006 PAETSPPPVapkPMALPAetsPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPV------- 1078
Cdd:PRK07003   368 PGGGVPARV---AGAVPA---PGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPAtadrgdd 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1079 ---------VPKPMAPPAEASPLPVAPKPMAFPAETSLPPvAPKPMALPTEASPPPVAPKPLALPGSQGASlNLKTLKTF 1149
Cdd:PRK07003   442 aadgdapvpAKANARASADSRCDERDAQPPADSGSASAPA-SDAPPDAAFEPAPRAAAPSAATPAAVPDAR-APAAASRE 519
                          170
                   ....*....|..
gi 1958765288 1150 GAPRPYNSSAPS 1161
Cdd:PRK07003   520 DAPAAAAPPAPE 531
PRK10263 PRK10263
DNA translocase FtsK; Provisional
940-1126 3.02e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 52.01  E-value: 3.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  940 SGHYVT--------SAAAKSVHTAPG----PTPKEPTIKEVQRDPQLPPEQCLSPlserthSAPLPNISKADDNRIQKPA 1007
Cdd:PRK10263   312 NGAPITepvavaaaATTATQSWAAPVepvtQTPPVASVDVPPAQPTVAWQPVPGP------QTGEPVIAPAPEGYPQQSQ 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1008 ETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPA 1087
Cdd:PRK10263   386 YAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1958765288 1088 EASPLPVAPKP--MAFPAETSLPPVAPKPMALPTEASPPPV 1126
Cdd:PRK10263   466 QTYQQPAAQEPlyQQPQPVEQQPVVEPEPVVEETKPARPPL 506
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
814-1210 3.68e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 3.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  814 DSEAAGKKDNQKTLAVAQKHAIETMTETAVQAEALVTSKNSQqpqPDRTPKPSSGTEHPLHRTVSSPVGTEMNPPKPPRM 893
Cdd:pfam03154  153 DNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQA---ATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHT 229
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  894 TTDTGTIPFAPNLEDINNILESKFRS---RASNPQAKPSSFfLQMQKRASGHYVTSAAAKSVH-TAPGPTPKEPTIKEVQ 969
Cdd:pfam03154  230 LIQQTPTLHPQRLPSPHPPLQPMTQPpppSQVSPQPLPQPS-LHGQMPPMPHSLQTGPSHMQHpVPPQPFPLTPQSSQSQ 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  970 RdPQLPPEQCLSPLSERTHSAPlpniSKADDNRIQKPAETsppPVAPKPMALPAETSPPAVAPKPMAFPAETSLPP--VS 1047
Cdd:pfam03154  309 V-PPGPSPAAPGQSQQRIHTPP----SQSQLQSQQPPREQ---PLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPhlSG 380
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1048 PKPMALPTEASSSPiSPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAP-KPMAFPAETSLPPVA---PKPMALPTEASP 1123
Cdd:pfam03154  381 PSPFQMNSNLPPPP-ALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPaQPPVLTQSQSLPPPAashPPTSGLHQVPSQ 459
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1124 PPVAPKPLaLPGSQGASLNLKTLKTFGAPR----------PYNSSAPSPFALAVVKRSQSFSKASPESPSEDSSAQPPAA 1193
Cdd:pfam03154  460 SPFPQHPF-VPGGPPPITPPSGPPTSTSSAmpgiqppssaSVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPR 538
                          410
                   ....*....|....*..
gi 1958765288 1194 IQDGKTQTVNQPTVGSQ 1210
Cdd:pfam03154  539 SPSPEPTVVNTPSHASQ 555
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
946-1098 5.26e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 50.87  E-value: 5.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  946 SAAAKSVHTAPGPTPKEPTikevqrdpqlPPEQCLSPLSErTHSAPLPNISkaddnriqkPAETSPPPVAPKPMALPAET 1025
Cdd:PRK14951   367 AAAAEAAAPAEKKTPARPE----------AAAPAAAPVAQ-AAAAPAPAAA---------PAAAASAPAAPPAAAPPAPV 426
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958765288 1026 SPPAVAPKPmafpaetSLPPVSPKPMALPtEASSSPISPKPMAPPAEASIPPVVPKPMAPPAeASPLPVAPKP 1098
Cdd:PRK14951   427 AAPAAAAPA-------AAPAAAPAAVALA-PAPPAQAAPETVAIPVRVAPEPAVASAAPAPA-AAPAAARLTP 490
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1054-1204 7.21e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 50.48  E-value: 7.21e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1054 PTEASSSPISPKPmAPPAEASIPPVVPKPMAPPAeASPLPVAPKPMAFPAETSLPPVAPKPMAlpteASPPPVAPKPLAL 1133
Cdd:PRK14951   366 PAAAAEAAAPAEK-KTPARPEAAAPAAAPVAQAA-AAPAPAAAPAAAASAPAAPPAAAPPAPV----AAPAAAAPAAAPA 439
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958765288 1134 PGSQGAslnlktlktfgAPRPYNSSAPSPFALAVVKRSQSfSKASPESPSEDSSAQPPAAIQDGKT-----QTVNQ 1204
Cdd:PRK14951   440 AAPAAV-----------ALAPAPPAQAAPETVAIPVRVAP-EPAVASAAPAPAAAPAAARLTPTEEgdvwhATVQQ 503
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
1004-1107 7.73e-06

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 50.66  E-value: 7.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1004 QKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSspisPKPMAPPAEASIPPVVPKPM 1083
Cdd:PRK12270    36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAA----AAAAPAAPPAAAAAAAPAAA 111
                           90       100
                   ....*....|....*....|....
gi 1958765288 1084 APPAEASPLPVAPKPMAFPAETSL 1107
Cdd:PRK12270   112 AVEDEVTPLRGAAAAVAKNMDASL 135
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
945-1140 7.82e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.37  E-value: 7.82e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  945 TSAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQclsplSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAE 1024
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAA-----PAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD 664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1025 TSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAE 1104
Cdd:PRK07764   665 GGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPE 744
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1958765288 1105 TSLPPVAPKPmalPTEASPPPVAPKPLALPGSQGAS 1140
Cdd:PRK07764   745 PDDPPDPAGA---PAQPPPPPAPAPAAAPAAAPPPS 777
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1007-1133 7.84e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 50.16  E-value: 7.84e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1007 AETSPPPVAPKPMALPAETSP-----PAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPK 1081
Cdd:PRK14971   366 GDDASGGRGPKQHIKPVFTQPaaapqPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQ 445
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958765288 1082 PMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLAL 1133
Cdd:PRK14971   446 AVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQKEIF 497
PHA03378 PHA03378
EBNA-3B; Provisional
857-1133 7.89e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 50.45  E-value: 7.89e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  857 PQPDRTPKPSSGTEHPLHRTV--SSPVGTEMNPPKPPRMTTDTGTIPFAPNLEDINNILESKFRSRASNPQAKPSsfflq 934
Cdd:PHA03378   600 PHPSQTPEPPTTQSHIPETSAprQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPY----- 674
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  935 mQKRASGHyvtsAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPAETSPPPV 1014
Cdd:PHA03378   675 -QPSPTGA----NTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAA 749
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1015 APKPMalpaetSPPAVAPKPMAFPAETslpPVSPKPMalpteassspisPKPMAPPAEASIPPVVPKPMAPPaEASPLPV 1094
Cdd:PHA03378   750 APGRA------RPPAAAPGRARPPAAA---PGAPTPQ------------PPPQAPPAPQQRPRGAPTPQPPP-QAGPTSM 807
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1958765288 1095 APKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLAL 1133
Cdd:PHA03378   808 QLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAAL 846
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
946-1128 1.24e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 1.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  946 SAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQclsplserTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMalPAET 1025
Cdd:PRK07764   619 AAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPE--------HHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA--PAPA 688
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1026 SPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAET 1105
Cdd:PRK07764   689 APAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAA 768
                          170       180
                   ....*....|....*....|...
gi 1958765288 1106 slPPVAPKPmalPTEASPPPVAP 1128
Cdd:PRK07764   769 --APAAAPP---PSPPSEEEEMA 786
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1008-1117 1.31e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 49.42  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1008 ETSPPPVAPKPMALPAETSPPAVAPKpmafPAETSLPPVSPKPMALPTEASSSPISPKPMAP-PAEASIPPVVPKPMAPP 1086
Cdd:PRK14950   357 EALLVPVPAPQPAKPTAAAPSPVRPT----PAPSTRPKAAAAANIPPKEPVRETATPPPVPPrPVAPPVPHTPESAPKLT 432
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1958765288 1087 AEASPLPVAPKPMAFPAETSLPPVAPKPMAL 1117
Cdd:PRK14950   433 RAAIPVDEKPKYTPPAPPKEEEKALIADGDV 463
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
956-1124 1.35e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 49.38  E-value: 1.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  956 PGPTPK-EPTIKEVQRDPQL---PPEQCLSPLSERTHSAPLPNISKADdnrIQKPAETSPPPVAPKPMALPAETSPPAVA 1031
Cdd:NF033839   328 PKPEVKpQPEKPKPEVKPQLetpKPEVKPQPEKPKPEVKPQPEKPKPE---VKPQPETPKPEVKPQPEKPKPEVKPQPEK 404
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1032 PKPMAFPA-ETSLPPVSPKPMALPTEASSSPISPKP-MAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPP 1109
Cdd:NF033839   405 PKPEVKPQpEKPKPEVKPQPEKPKPEVKPQPEKPKPeVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKP 484
                          170
                   ....*....|....*
gi 1958765288 1110 VAPKPMALPTEASPP 1124
Cdd:NF033839   485 DNSKPQADDKKPSTP 499
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
955-1137 1.71e-05

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 49.22  E-value: 1.71e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  955 APGPTPKEPTikevqrdPQLPPEQCLSPLSERTHSAPLPNISKADDNRI-----QKPAETSPPPVAPKPMALPAETSPPA 1029
Cdd:PRK12727    68 APAPAPQAPT-------KPAAPVHAPLKLSANANMSQRQRVASAAEDMIaamalRQPVSVPRQAPAAAPVRAASIPSPAA 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1030 VAPKPMAfpAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAP--------KPMAF 1101
Cdd:PRK12727   141 QALAHAA--AVRTAPRQEHALSAVPEQLFADFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAYaqdddeqlDDDGF 218
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1958765288 1102 PAETSLPPVAPkPMALPT----EASPPPVAPKPLALPGSQ 1137
Cdd:PRK12727   219 DLDDALPQILP-PAALPPivvaPAAPAALAAVAAAAPAPQ 257
PRK10263 PRK10263
DNA translocase FtsK; Provisional
981-1130 1.82e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 49.31  E-value: 1.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  981 SPLSERTHSAPLPniskADDNRIQKPA---ETSPPPVAPKPMALPA-ETSPPA-VAPKPMAFPAETSLPPVSPKPM---- 1051
Cdd:PRK10263   335 APVEPVTQTPPVA----SVDVPPAQPTvawQPVPGPQTGEPVIAPApEGYPQQsQYAQPAVQYNEPLQQPVQPQQPyyap 410
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1052 ALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPP---AEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAP 1128
Cdd:PRK10263   411 AAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNawqAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVV 490

                   ..
gi 1958765288 1129 KP 1130
Cdd:PRK10263   491 EP 492
PHA03378 PHA03378
EBNA-3B; Provisional
958-1275 2.00e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 49.30  E-value: 2.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  958 PTPKEPTIKEVQRDPQLPPEQcLSPLSERThSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKPmaf 1037
Cdd:PHA03378   553 PASTEPVHDQLLPAPGLGPLQ-IQPLTSPT-TSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMP--- 627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1038 paetsLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPM----AFPAETSLPPVAPK 1113
Cdd:PHA03378   628 -----LRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMlpiqWAPGTMQPPPRAPT 702
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1114 PMalpteaSPPPVAPKPLALPGSQGASLNLKTLKTFGAPRPYNSSAPSPFALAVVKRSQSFSKASPESpsedssaqPPAA 1193
Cdd:PHA03378   703 PM------RPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRA--------RPPA 768
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1194 IQDGKTQTVNQPTVGSQhdgvdKQNKPVQNEHSSQRltPADGPSSFTLQRQSSLNFQSSDPEHIRQSLLTAIRSGEAAAK 1273
Cdd:PHA03378   769 AAPGAPTPQPPPQAPPA-----PQQRPRGAPTPQPP--PQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLK 841

                   ..
gi 1958765288 1274 LK 1275
Cdd:PHA03378   842 KP 843
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
936-1134 2.02e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 49.00  E-value: 2.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  936 QKRASGHYVTSAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSE------------RTHSAPLPNISKADDNRI 1003
Cdd:NF033839   163 QPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAVATYMSKilddiqkhhlqkEKHRQIVALIKELDELKK 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1004 QKPAE--TSPPPVAPKPMALPAETSPPAVAPK-PMAFPAETSLPPVSPKPMALPTEASSSPISP-KPMAPPAEASIPPVV 1079
Cdd:NF033839   243 QALSEidNVNTKVEIENTVHKIFADMDAVVTKfKKGLTQDTPKEPGNKKPSAPKPGMQPSPQPEkKEVKPEPETPKPEVK 322
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1958765288 1080 PKPMAPPAEASPLPVAPKPMAFPA-ETSLPPVAPKPMALPTEASPPPVAPKPLALP 1134
Cdd:NF033839   323 PQLEKPKPEVKPQPEKPKPEVKPQlETPKPEVKPQPEKPKPEVKPQPEKPKPEVKP 378
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
946-1154 2.80e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 48.38  E-value: 2.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  946 SAAAKSVHTAPG-PTPKEPTIKEVQRDPQLPPEQCLSPLSERTH-------SAPLPNISKADDNR------IQKPAETSP 1011
Cdd:PLN03209   333 SDAADGPKPVPTkPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAyedlkppTSPIPTPPSSSPASsksvdaVAKPAEPDV 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1012 PPVAPKPMALPAetSPPAVAPKPMAFPA------ETSLPPVSPKPMA----LPTEASSSPISPKPMAPPAEASIPPVVPk 1081
Cdd:PLN03209   413 VPSPGSASNVPE--VEPAQVEAKKTRPLspyaryEDLKPPTSPSPTAptgvSPSVSSTSSVPAVPDTAPATAATDAAAP- 489
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958765288 1082 pmaPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALP--TEASPPPVAPKPLALPGSQGASLNLKtlktfgaPRP 1154
Cdd:PLN03209   490 ---PPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPssTNEVVKVGNSAPPTALADEQHHAQPK-------PRP 554
rne PRK10811
ribonuclease E; Reviewed
1006-1138 4.39e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.11  E-value: 4.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1006 PAETSPPPVAPKPMALPAETspPAVAPKPMAFPAETSLPPVS---PKPMALPTEASSSPISP--KPMAPPAEASIPPVVP 1080
Cdd:PRK10811   874 PVAAAVEPVVSAPVVEAVAE--VVEEPVVVAEPQPEEVVVVEtthPEVIAAPVTEQPQVITEsdVAVAQEVAEHAEPVVE 951
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1958765288 1081 KPMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQG 1138
Cdd:PRK10811   952 PQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVAPAQVPEA 1009
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1035-1132 4.97e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.55  E-value: 4.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1035 MAF-PAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPM--AFPAETSLPPVA 1111
Cdd:PRK07994   357 LAFhPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLlaARQQLQRAQGAT 436
                           90       100
                   ....*....|....*....|.
gi 1958765288 1112 PKPMALPTEASPPPVAPKPLA 1132
Cdd:PRK07994   437 KAKKSEPAAASRARPVNSALE 457
PRK10263 PRK10263
DNA translocase FtsK; Provisional
953-1154 4.98e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.77  E-value: 4.98e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  953 HTAPGPTPKEPTI--------------------KEVQRDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPAETSPP 1012
Cdd:PRK10263   361 QPVPGPQTGEPVIapapegypqqsqyaqpavqyNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQ 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1013 PVAPKPMALPAETSPpaVAPKPMAFPAETSLPPVSPKP--MALPTEASSSPISPKPMAPPAEASIPPV------------ 1078
Cdd:PRK10263   441 PVAGNAWQAEEQQST--FAPQSTYQTEQTYQQPAAQEPlyQQPQPVEQQPVVEPEPVVEETKPARPPLyyfeeveekrar 518
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1079 -----------VPKPMAPPAEASPLPVAPKPMAFPAETSLPPVAP-----KPMALPTEASPPPVAPkplalpgsqgasln 1142
Cdd:PRK10263   519 ereqlaawyqpIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPlasgvKKATLATGAAATVAAP-------------- 584
                          250
                   ....*....|..
gi 1958765288 1143 LKTLKTFGAPRP 1154
Cdd:PRK10263   585 VFSLANSGGPRP 596
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1008-1134 6.10e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.77  E-value: 6.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1008 ETSPPPVAPKPMALPA---ETSPPAVAPKPMAFPAETSLPPVS--PKPMALPTEASSSPISPKPM----APPAEASIPPV 1078
Cdd:PRK10263   342 QTPPVASVDVPPAQPTvawQPVPGPQTGEPVIAPAPEGYPQQSqyAQPAVQYNEPLQQPVQPQQPyyapAAEQPAQQPYY 421
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1958765288 1079 VPKPMAPPAEASPLPVAPKPMA---FPAETSLPPVAPKPMALPTEASPPPVAPKPLALP 1134
Cdd:PRK10263   422 APAPEQPAQQPYYAPAPEQPVAgnaWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQ 480
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1006-1188 6.47e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.54  E-value: 6.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1006 PAETSPPPVAPKPMALPAETSPPAVAPkpmAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEAsipPVVPKPMAP 1085
Cdd:PRK07003   391 VGASAVPAVTAVTGAAGAALAPKAAAA---AAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANA---RASADSRCD 464
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1086 PAEASPlPVAPKPMAFPAETSLPPVApkpmalpTEASPPPVAPKPLALPGSQGASlnlktlKTFGAPRPYNSSAPSPFAl 1165
Cdd:PRK07003   465 ERDAQP-PADSGSASAPASDAPPDAA-------FEPAPRAAAPSAATPAAVPDAR------APAAASREDAPAAAAPPA- 529
                          170       180
                   ....*....|....*....|...
gi 1958765288 1166 avvkrsqsfSKASPESPSEDSSA 1188
Cdd:PRK07003   530 ---------PEARPPTPAAAAPA 543
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
1047-1131 7.70e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 47.19  E-value: 7.70e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1047 SPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPV 1126
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116

                   ....*
gi 1958765288 1127 ApKPL 1131
Cdd:PRK12270   117 V-TPL 120
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
1014-1138 1.16e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.57  E-value: 1.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1014 VAPKPmalpaeTSPPAVAPKPMAFPAetSLPPVSPKPMAL-PTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPL 1092
Cdd:pfam09770  165 VAPKK------AAAPAPAPQPAAQPA--SLPAPSRKMMSLeEVEAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQF 236
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958765288 1093 PVAPKPMAFPAETSLPPVAPKPMALPT-----EASPPPVAPKPLALPGSQG 1138
Cdd:pfam09770  237 PPQIQQQQQPQQQPQQPQQHPGQGHPVtilqrPQSPQPDPAQPSIQPQAQQ 287
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
1057-1147 1.21e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.81  E-value: 1.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1057 ASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPmafPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGS 1136
Cdd:PRK12270    34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPP---AAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAA 110
                           90
                   ....*....|.
gi 1958765288 1137 QGASLNLKTLK 1147
Cdd:PRK12270   111 AAVEDEVTPLR 121
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1029-1132 1.48e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.01  E-value: 1.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1029 AVAPKPMAFPAETSLPPVSPKPM----ALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAE 1104
Cdd:PRK07994   356 MLAFHPAAPLPEPEVPPQSAAPAasaqATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGA 435
                           90       100
                   ....*....|....*....|....*...
gi 1958765288 1105 TSLPPVAPkpmALPTEASPPPVAPKPLA 1132
Cdd:PRK07994   436 TKAKKSEP---AAASRARPVNSALERLA 460
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
980-1177 1.77e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 45.69  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  980 LSPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAETSPP---AVAPKPMA-FPAETSL-PPVSPKPM--- 1051
Cdd:PLN03209   313 LTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPqpkAVVPRPLSpYTAYEDLkPPTSPIPTpps 392
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1052 -----ALPTEASSSPISPKPMAPPAEASIPPVVP---------KPMAPPAE-------ASPLPVAPKPMAFPA-----ET 1105
Cdd:PLN03209   393 sspasSKSVDAVAKPAEPDVVPSPGSASNVPEVEpaqveakktRPLSPYARyedlkppTSPSPTAPTGVSPSVsstssVP 472
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958765288 1106 SLPPVAPkPMALPTEASPPPVAPKPLAlPGSQGASLNLKTLKTFGAPRPYNSSAPSPFALAVVKRSQSFSKA 1177
Cdd:PLN03209   473 AVPDTAP-ATAATDAAAPPPANMRPLS-PYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1007-1210 2.37e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.75  E-value: 2.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1007 AETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETslPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVpkpmAPP 1086
Cdd:PRK07764   588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPA--APAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVA----VPD 661
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1087 AEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGASLNLktlkTFGAPRPYNSSAPSPFALA 1166
Cdd:PRK07764   662 ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPA----AQPPQAAQGASAPSPAADD 737
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1958765288 1167 VVKRSQSFSKASPESPSEDSSAQPPAAIQDGKTQTVNQPTVGSQ 1210
Cdd:PRK07764   738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
983-1128 2.52e-04

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 44.76  E-value: 2.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  983 LSERTHSAPLP----NISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEAS 1058
Cdd:NF040712   187 LIDPDFGRPLRplatVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAP 266
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1059 SSPisPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAP 1128
Cdd:NF040712   267 AAE--PDEATRDAGEPPAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASVP 334
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
944-1125 2.58e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 45.61  E-value: 2.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  944 VTSAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSErTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALP- 1022
Cdd:PRK07003   428 AAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQ-PPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAa 506
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1023 -AETSPPAVAPKPMAFPAETSLPPVSPKPMalPTEASSS----------------------------PISPKPMAPPAeA 1073
Cdd:PRK07003   507 vPDARAPAAASREDAPAAAAPPAPEARPPT--PAAAAPAaraggaaaaldvlrnagmrvssdrgaraAAAAKPAAAPA-A 583
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958765288 1074 SIPPVVPKPMAPpaeaSPLPVAPkpmAFPAETSLPPVAPKPMALPTEASPPP 1125
Cdd:PRK07003   584 APKPAAPRVAVQ----VPTPRAR---AATGDAPPNGAARAEQAAESRGAPPP 628
PRK10263 PRK10263
DNA translocase FtsK; Provisional
855-1082 2.89e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.46  E-value: 2.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  855 QQPQPDRTPKPSSGTEHPLHRTVSSPVGTEMNPPKPPRMTTDTGTIPFAPNLEDINNILESKFRSRASNPQAKPSSFFLQ 934
Cdd:PRK10263   367 QTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNA 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  935 MQKRASG-HYVTSAAAKSVHTAPGPTPKEPTIKEVQRDPQlPPEQCLSPLSERTHSA--PLPNISKADDNRIQKPAETSP 1011
Cdd:PRK10263   447 WQAEEQQsTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQ-QPVVEPEPVVEETKPArpPLYYFEEVEEKRAREREQLAA 525
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1012 --PPVaPKPMALPAETSPPAVAPKPMAFPAETSLPPVSP-----KPMALPTEASSSPISP-------KPMAPPAEASIPP 1077
Cdd:PRK10263   526 wyQPI-PEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPlasgvKKATLATGAAATVAAPvfslansGGPRPQVKEGIGP 604

                   ....*
gi 1958765288 1078 VVPKP 1082
Cdd:PRK10263   605 QLPRP 609
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1006-1095 2.98e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.36  E-value: 2.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1006 PAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSL-----PPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVvp 1080
Cdd:PRK07764   419 AAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPppaaaPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAA-- 496
                           90
                   ....*....|....*
gi 1958765288 1081 kPMAPPAEASPLPVA 1095
Cdd:PRK07764   497 -PAAPAAPAGADDAA 510
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1023-1148 3.05e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.15  E-value: 3.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1023 AETSPPAVAPKPMAFPAET-----SLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPlPVAPK 1097
Cdd:PRK14971   366 GDDASGGRGPKQHIKPVFTqpaaaPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNP-PSTAP 444
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958765288 1098 PMAFPAetslPPVAPKPMALPTEASPPPVAPKPLAlPGSQGASLNLKTLKT 1148
Cdd:PRK14971   445 QAVRPA----QFKEEKKIPVSKVSSLGPSTLRPIQ-EKAEQATGNIKEAPT 490
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
857-1130 3.37e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 3.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  857 PQPDRTPKPSSGTEHPLHRTVssPVGTEMNPPKPPRMTTDTGTIPFAPNledinnileSKFRSRASNPQAKPSSFFLQMQ 936
Cdd:PHA03307   111 PSSPDPPPPTPPPASPPPSPA--PDLSEMLRPVGSPGPPPAASPPAAGA---------SPAAVASDAASSRQAALPLSSP 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  937 KRASghyvtsaaaksvhTAPGPTPKEPTIKevqRDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPAETS---PPP 1013
Cdd:PHA03307   180 EETA-------------RAPSSPPAEPPPS---TPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSdssSSE 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1014 VAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLP 1093
Cdd:PHA03307   244 SSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRE 323
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958765288 1094 V---APKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKP 1130
Cdd:PHA03307   324 SsssSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPS 363
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
973-1072 3.64e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 3.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  973 QLPPEQCL--SPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPAETSP-PAVAPKPMAFPAETSLPPVSPK 1049
Cdd:PRK14950   348 QLPLELAVieALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVReTATPPPVPPRPVAPPVPHTPES 427
                           90       100
                   ....*....|....*....|....*.
gi 1958765288 1050 PMALPTEASSSPISPK---PMAPPAE 1072
Cdd:PRK14950   428 APKLTRAAIPVDEKPKytpPAPPKEE 453
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
974-1114 3.85e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.07  E-value: 3.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  974 LPPEQCLSPLSERTHSAPlpniSKADDNRIQ--KPAETSPPPVAPKPMalPAETSPPAVAP---KPMAFPAETSLP--PV 1046
Cdd:PTZ00449   519 LPPKAPGDKEGEEGEHED----SKESDEPKEggKPGETKEGEVGKKPG--PAKEHKPSKIPtlsKKPEFPKDPKHPkdPE 592
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1047 SPKPMALPTEASSsPISPKPMAPPAEASIPPVVPKPMAPPAEASPL----PVAPKPMAFPA--ETSLPPVAPKP 1114
Cdd:PTZ00449   593 EPKKPKRPRSAQR-PTRPKSPKLPELLDIPKSPKRPESPKSPKRPPppqrPSSPERPEGPKiiKSPKPPKSPKP 665
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
1012-1139 4.52e-04

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 43.99  E-value: 4.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1012 PPVA--PKPMALPAETSPPAVAPKPmAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEA----SIPPVVPKPMAP 1085
Cdd:NF040712   197 RPLAtvPRLAREPADARPEEVEPAP-AAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEPvgpgAAPAAEPDEATR 275
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1086 PAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGA 1139
Cdd:NF040712   276 DAGEPPAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRR 329
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
1038-1123 4.55e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.88  E-value: 4.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1038 PAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASplpvAPKPMAFPAETSLPPVAPKPMAL 1117
Cdd:PRK12270    38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAA----AAAAAPAAPPAAAAAAAPAAAAV 113

                   ....*.
gi 1958765288 1118 PTEASP 1123
Cdd:PRK12270   114 EDEVTP 119
PRK10819 PRK10819
transport protein TonB; Provisional
1003-1130 4.66e-04

transport protein TonB; Provisional


Pssm-ID: 236768 [Multi-domain]  Cd Length: 246  Bit Score: 43.52  E-value: 4.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1003 IQKPAETSPPPVApkpMALPAETSPPAVAPKPmafPAETSLPPVSPKPMALPTEassspisPKPMAPPAEASIPPVVPKP 1082
Cdd:PRK10819    39 IELPAPAQPISVT---MVAPADLEPPQAVQPP---PEPVVEPEPEPEPIPEPPK-------EAPVVIPKPEPKPKPKPKP 105
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1958765288 1083 MAPPAEAspLPVAPKPMAFPAETSLP-PVAPKPMALPTEASPPPVAPKP 1130
Cdd:PRK10819   106 KPKPVKK--VEEQPKREVKPVEPRPAsPFENTAPARPTSSTATAAASKP 152
flhF PRK06995
flagellar biosynthesis protein FlhF;
996-1114 5.26e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 44.19  E-value: 5.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  996 SKADDNRIQKPAETSPPPVAPKPMAlpaetsPPAVAPKPMAFPAETSLPPVSPKPmaLPTEASSSPISPKPMAPPAEASI 1075
Cdd:PRK06995    44 ADSDLAALAPPAAAAPAAAQPPPAA------APAAVSRPAAPAAEPAPWLVEHAK--RLTAQREQLVARAAAPAAPEAQA 115
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1958765288 1076 PPVVPKPMAPpaEASPLPVAPKPMAFPAETSLPPVAPKP 1114
Cdd:PRK06995   116 PAAPAERAAA--ENAARRLARAAAAAPRPRVPADAAAAV 152
PHA02682 PHA02682
ORF080 virion core protein; Provisional
958-1154 6.05e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 43.31  E-value: 6.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  958 PTPKEPTikevqrdpqlPPEQCLSPLSERThsapLPNISKADDNRIQKPAETSPPPVAPKPMAlpaeTSPPAVAPKPmaf 1037
Cdd:PHA02682    35 PAPAAPC----------PPDADVDPLDKYS----VKEAGRYYQSRLKANSACMQRPSGQSPLA----PSPACAAPAP--- 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1038 paetSLPPVSPkpmALPTEASSSPiSPKPMAPPAEAsippvvpkPMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMAL 1117
Cdd:PHA02682    94 ----ACPACAP---AAPAPAVTCP-APAPACPPATA--------PTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPK 157
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1958765288 1118 PTEASPPPVAPKPLALPGSQGASlnLKTLKTFGAPRP 1154
Cdd:PHA02682   158 PAPAAKPIFLHNQLPPPDYPAAS--CPTIETAPAASP 192
flhF PRK06995
flagellar biosynthesis protein FlhF;
1019-1130 6.36e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 43.80  E-value: 6.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1019 MALPAEtSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIP--PVVPKPMAPPAeASPLPVAP 1096
Cdd:PRK06995    41 VALADS-DLAALAPPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQreQLVARAAAPAA-PEAQAPAA 118
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1958765288 1097 KPMAFPAETSLPPVAPKPMALPTEASPPPVAPKP 1130
Cdd:PRK06995   119 PAERAAAENAARRLARAAAAAPRPRVPADAAAAV 152
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
1026-1108 7.85e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 43.45  E-value: 7.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1026 SPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPL-PVAPKPMAFPAE 1104
Cdd:NF041121    17 RAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAaPGAALPVRVPAP 96

                   ....
gi 1958765288 1105 TSLP 1108
Cdd:NF041121    97 PALP 100
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
1051-1128 9.59e-04

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 42.99  E-value: 9.59e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958765288 1051 MALPTEASSSPiSPKPMAPPAEASIPPVVPKPMAPPAEA-SPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAP 1128
Cdd:pfam07174   31 VALPAVAHADP-EPAPPPPSTATAPPAPPPPPPAPAAPApPPPPAAPNAPNAPPPPADPNAPPPPPADPNAPPPPAVDP 108
COG5373 COG5373
Uncharacterized membrane protein [Function unknown];
1019-1092 1.11e-03

Uncharacterized membrane protein [Function unknown];


Pssm-ID: 444140 [Multi-domain]  Cd Length: 854  Bit Score: 43.45  E-value: 1.11e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1019 MALPAETSPPAVAPKPMAFPAETslpPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPL 1092
Cdd:COG5373     37 LAEAAEAASAPAEPEPEAAAAAT---AAAPEAAPAPVPEAPAAPPAAAEAPAPAAAAPPAEAEPAAAPAAASSF 107
PHA03269 PHA03269
envelope glycoprotein C; Provisional
991-1127 1.22e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 43.18  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  991 PLPNISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPaeTSLPPVSPKPMALPTEASSSpiSPKPMAPP 1070
Cdd:PHA03269    27 PIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAP--TPAASEKFDPAPAPHQAASR--APDPAVAP 102
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958765288 1071 AEASIPpvVPKPMAPPAEASplpvAPKPMAFPAETSLPPVAPKPmALPTEASPPPVA 1127
Cdd:PHA03269   103 QLAAAP--KPDAAEAFTSAA----QAHEAPADAGTSAASKKPDP-AAHTQHSPPPFA 152
PHA03247 PHA03247
large tegument protein UL36; Provisional
976-1154 1.28e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 1.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  976 PEQCLSPLSERTHSAPLPNISKADDNRIQKPAETSPPP-------VAPKPMALPAETSPPAVAPK--------------- 1033
Cdd:PHA03247   257 PPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPdgvwgaaLAGAPLALPAPPDPPPPAPAgdaeeeddedgamev 336
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1034 -----------PMAFPAE---TSLPPVS----------PKPMALPTEASSSpiSPKPMAPPAEASIPPVVPKPMAPPAEA 1089
Cdd:PHA03247   337 vsplprprqhyPLGFPKRrrpTWTPPSSledlsagrhhPKRASLPTRKRRS--ARHAATPFARGPGGDDQTRPAAPVPAS 414
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958765288 1090 SPLPVAPK-PMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGASLNL--KTLKTFGAPRP 1154
Cdd:PHA03247   415 VPTPAPTPvPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDAtrKALDALRERRP 482
RBD pfam02196
Raf-like Ras-binding domain;
92-158 1.42e-03

Raf-like Ras-binding domain;


Pssm-ID: 460485  Cd Length: 69  Bit Score: 38.27  E-value: 1.42e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958765288   92 LSVVLPGDVLKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTINLLSAEENLIkfKPNTPIGMLEVEKV 158
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
PRK11633 PRK11633
cell division protein DedD; Provisional
1032-1132 1.56e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 41.53  E-value: 1.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1032 PKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEAsiPPVVPKPMAPPaEASPLPVAPkPMAFPAETslPPVA 1111
Cdd:PRK11633    45 PKPGDRDEPDMMPAATQALPTQPPEGAAEAVRAGDAAAPSLD--PATVAPPNTPV-EPEPAPVEP-PKPKPVEK--PKPK 118
                           90       100
                   ....*....|....*....|.
gi 1958765288 1112 PKPMALPTEASPPPVAPKPLA 1132
Cdd:PRK11633   119 PKPQQKVEAPPAPKPEPKPVV 139
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
1032-1109 1.59e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.96  E-value: 1.59e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958765288 1032 PKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPP 1109
Cdd:PRK12270    38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
dnaA PRK14086
chromosomal replication initiator protein DnaA;
970-1149 1.64e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.89  E-value: 1.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  970 RDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPM-ALPAETSPPAVAPKP-MAFPAETSLPPVS 1047
Cdd:PRK14086    94 EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARpAYPAYQQRPEPGAWPrAADDYGWQQQRLG 173
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1048 PKPMALPTEASSSPISPKPMAPPAEASIP--------PVVPKPMA--PPAEASPLPVAPKPMAFPAETSLPPV----APK 1113
Cdd:PRK14086   174 FPPRAPYASPASYAPEQERDREPYDAGRPeydqrrrdYDHPRPDWdrPRRDRTDRPEPPPGAGHVHRGGPGPPerddAPV 253
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1958765288 1114 PMALPTEASPPPVAPKPLALPGSQGASLNLK-TLKTF 1149
Cdd:PRK14086   254 VPIRPSAPGPLAAQPAPAPGPGEPTARLNPKyTFDTF 290
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
1044-1147 1.70e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 42.68  E-value: 1.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1044 PPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPmafpaetslPPVAPKPMALPTEASP 1123
Cdd:NF041121    19 AAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPP---------PPPPPGPAGAAPGAAL 89
                           90       100
                   ....*....|....*....|....
gi 1958765288 1124 PPVAPKPLALPGSQGASLNLKTLK 1147
Cdd:NF041121    90 PVRVPAPPALPNPLELARALRPLK 113
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
991-1130 1.71e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 42.64  E-value: 1.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  991 PLPNISKADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMAlpTEASSSPISPKPMAPP 1070
Cdd:pfam17823  280 LSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVA--STNLAVVTTTKAQAKE 357
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1071 AEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKP 1130
Cdd:pfam17823  358 PSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGP 417
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
1052-1112 1.79e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 42.42  E-value: 1.79e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958765288 1052 ALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPL--PVAPKPMAFPAETSLPPVAP 1112
Cdd:PRK14965   382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPApaPAPPAAAAPPARSADPAAAA 444
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
997-1130 1.91e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 42.24  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  997 KADDNRIQKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVspKPMALPTEASSSPisPKPMAPPAEASIP 1076
Cdd:PTZ00436   202 KAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPA--KAAAPPAKAAAPP--AKAAAPPAKAAAP 277
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1077 PVvpKPMAPPAEASPLPVapKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKP 1130
Cdd:PTZ00436   278 PA--KAAAPPAKAAAPPA--KAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPP 327
rne PRK10811
ribonuclease E; Reviewed
1004-1130 2.03e-03

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 42.72  E-value: 2.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1004 QKPAETSPPPVAPKPMALPAETSPPAVAPKPMAFPAETSLP--PVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPK 1081
Cdd:PRK10811   905 QPEEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQEVAEHaePVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVV 984
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1958765288 1082 PMAPPAEASPLPVAPKPMafPAETSLPPV----APKPMalpTEASPPPVAPKP 1130
Cdd:PRK10811   985 AEVAAEVETVTAVEPEVA--PAQVPEATVehnhATAPM---TRAPAPEYVPEA 1032
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
1006-1125 2.68e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 41.47  E-value: 2.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1006 PAETSPPPVapKPMALPAETSPPAVapKPMAFPAETSLPPVspKPMALPTEASSSPisPKPMAPPAEASIPPVvpKPMAP 1085
Cdd:PTZ00436   229 PAKAAAPPA--KAAAAPAKAAAAPA--KAAAPPAKAAAPPA--KAAAPPAKAAAPP--AKAAAPPAKAAAPPA--KAAAA 298
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1958765288 1086 PAEASPLPVapKPMAFPAETSLPPVapKPMALPTEASPPP 1125
Cdd:PTZ00436   299 PAKAAAAPA--KAAAAPAKAAAPPA--KAAAPPAKAATPP 334
KLF10_11_N cd21974
N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily ...
967-1125 2.68e-03

N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily is composed of Kruppel-like factor or Krueppel-like factor (KLF) 10, KLF11, and similar proteins. KLF10 was first identified in human osteoblasts and plays a role in mediating estrogen (E2) signaling in bone and skeletal homeostasis and a regulatory role in tumor formation and metastasis. KLF11 is involved in cell growth, apoptosis, cellular inflammation and differentiation, endometriosis, and cholesterol, prostaglandin, neurotransmitter, fat, and sugar metabolism. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved a-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. KLF10/11 belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF10, KLF11, and similar proteins.


Pssm-ID: 409243 [Multi-domain]  Cd Length: 229  Bit Score: 41.07  E-value: 2.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  967 EVQRDPQLPPEQCLSP-----LSERTHSAPLPNIS--KADDNRIQKPAETSPPPVAPKPMALpaETS-------PPAVAP 1032
Cdd:cd21974     45 ESPKDFHSLSSLCMTPpysppFFEASHSPSVASLHppSAASSQPPPEPESSEPPAASPQRAQ--ATSvirhtadPVPVSP 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1033 KPMAFPaetSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMA--------PPAEASPLPVAPKPMAfPAE 1104
Cdd:cd21974    123 PPVLCQ---MLPVSSSSGVIVAFLKAPQQPSPQPQKPALPQPQVVLVGGQVPqgpvmlvvPQPAVPQPYVQPTVVT-PGG 198
                          170       180
                   ....*....|....*....|.
gi 1958765288 1105 TSLPPVAPKPMALPTEASPPP 1125
Cdd:cd21974    199 TKLLPIAPAPGFIPSGQSSAP 219
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
946-1112 2.88e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.17  E-value: 2.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  946 SAAAKSVHTAPGPTPKEPTIKEVQRDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPAETSPPPVAPKPMALPA-- 1023
Cdd:PRK12323   413 AAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPApa 492
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1024 -------ETSPPAVApkpmafpaetSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAP 1096
Cdd:PRK12323   493 dddpppwEELPPEFA----------SPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA 562
                          170
                   ....*....|....*.
gi 1958765288 1097 KPMAFPAETSLPPVAP 1112
Cdd:PRK12323   563 PRPPRASASGLPDMFD 578
PHA01929 PHA01929
putative scaffolding protein
1011-1091 3.08e-03

putative scaffolding protein


Pssm-ID: 177328  Cd Length: 306  Bit Score: 41.19  E-value: 3.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1011 PPPVAPKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMA----LPTEASSSPISPKPMAPPAEASIPPVVPKPMAPP 1086
Cdd:PHA01929    20 PPAAAPTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPVPtsamTPHVVQQAPAQPAPAAPPAAGAALPEALEVPPPP 99

                   ....*
gi 1958765288 1087 AEASP 1091
Cdd:PHA01929   100 AFTPN 104
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1019-1140 3.10e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 3.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1019 MALP-AETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPK 1097
Cdd:PRK07764   362 MLLPsASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAP 441
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1958765288 1098 PMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLALPGSQGAS 1140
Cdd:PRK07764   442 PSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAP 484
PHA03269 PHA03269
envelope glycoprotein C; Provisional
1003-1127 3.37e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.64  E-value: 3.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1003 IQKPAETSPPPVAPKPMALPAETSPPAVAPKPmafpaeTSLPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKP 1082
Cdd:PHA03269    15 INLIIANLNTNIPIPELHTSAATQKPDPAPAP------HQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAP 88
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1083 -----MAP-PAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEA---SPPPVA 1127
Cdd:PHA03269    89 hqaasRAPdPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAaskKPDPAA 142
PHA01929 PHA01929
putative scaffolding protein
1006-1105 4.01e-03

putative scaffolding protein


Pssm-ID: 177328  Cd Length: 306  Bit Score: 40.81  E-value: 4.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1006 PAETSPPPVAPKPMALPAETSPPAVAPKPMAFP--AETSLPPVSPKPMALPTEASSsPISPKPMAPPAEASiPPVVPKPM 1083
Cdd:PHA01929     3 QNEQQLPPGLAGLVANVPPAAAPTPQPNPVIQPqaPVQPGQPGAPQQLAIPTQQPQ-PVPTSAMTPHVVQQ-APAQPAPA 80
                           90       100
                   ....*....|....*....|..
gi 1958765288 1084 APPAEASPLPVAPKPMAFPAET 1105
Cdd:PHA01929    81 APPAAGAALPEALEVPPPPAFT 102
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
1054-1139 4.16e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 4.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1054 PTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPMALPTEASPPPVAPKPLAL 1133
Cdd:PRK12270    41 TAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPL 120

                   ....*.
gi 1958765288 1134 PGSQGA 1139
Cdd:PRK12270   121 RGAAAA 126
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
1004-1079 4.23e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 41.47  E-value: 4.23e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958765288 1004 QKPAETSPPPVAPK---PMALPAETSPPAVAPKPMAFPAETSLPPVsPKPMALPTEASSSPispkpmaPPAEASIPPVV 1079
Cdd:PRK14954   391 KKKAPEPDLPQPDRhpgPAKPEAPGARPAELPSPASAPTPEQQPPV-ARSAPLPPSPQASA-------PRNVASGKPGV 461
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
1026-1132 4.59e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 41.20  E-value: 4.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1026 SPP--AVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPaeasiPPVVPKPMAPPAEASPLP----VAPKPM 1099
Cdd:PRK14959   379 SAPsgSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPA-----PSAAPSPRVPWDDAPPAPprsgIPPRPA 453
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1958765288 1100 AFPAETSLPPVAPKPMALPTEASPPPVAPKPLA 1132
Cdd:PRK14959   454 PRMPEASPVPGAPDSVASASDAPPTLGDPSDTA 486
Gag_spuma pfam03276
Spumavirus gag protein;
981-1135 4.74e-03

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 41.27  E-value: 4.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  981 SPLSERTHSAPLPNISKADDNRIQKPAETSPPPvaPKPMALPAETsPPAVAPKPMAFP--AETSLPPVSPKPMALPTEAS 1058
Cdd:pfam03276  187 PPGASFSGLPSLPAIGGIHLPAIPGIHARAPPG--NIARSLGDDI-MPSLGDAGMPQPrfAFHPGNPFAEAEGHPFAEAE 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1059 SSPISPKPMAPPAEASIPPVVPKPM-APPAEASPLPVAPKPMAFPAETSLPPVAPK--PMALPTEAsPPPVAPKPLALPG 1135
Cdd:pfam03276  264 GERPRDIPRAPRIDAPSAPAIPAIQpIAPPMIPPIGAPIPIPHGASIPGEHIRNPReePIRLGREA-PAIDGRFAPAIDD 342
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
991-1093 5.42e-03

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 40.29  E-value: 5.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  991 PLPNISKADdnriQKPAETSPPPVAPKPMALPAETSPPAVAPKPmafpaetslPPVSPkpmalpteasSSPISPKPMAPP 1070
Cdd:pfam07174   32 ALPAVAHAD----PEPAPPPPSTATAPPAPPPPPPAPAAPAPPP---------PPAAP----------NAPNAPPPPADP 88
                           90       100
                   ....*....|....*....|...
gi 1958765288 1071 AEASIPPVVPKPMAPPAEASPLP 1093
Cdd:pfam07174   89 NAPPPPPADPNAPPPPAVDPNAP 111
PHA03377 PHA03377
EBNA-3C; Provisional
854-1113 5.51e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 41.19  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  854 SQQPQPDRTPKPSSGTEHP---LHRTVSSPVGTEMNPPKPPRMTTDTGTIPFAPNLE---DINNILESKFRSRASNPQAK 927
Cdd:PHA03377   456 SDQPSVPVEPAHLTPVEHTtviLHQPPQSPPTVAIKPAPPPSRRRRGACVVYDDDIIeviDVETTEEEESVTQPAKPHRK 535
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  928 PSSFFlqmqkRASGHYVTSAAAKSVHTAPGPTPKeptikevQRDPQLPPEQCLSPLSERTHSAPLPNISKADDNRIQKPA 1007
Cdd:PHA03377   536 VQDGF-----QRSGRRQKRATPPKVSPSDRGPPK-------ASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKC 603
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1008 ETSPPPVAPKPMAlpaetsPPAVAPKPMA------FPAETSLP-PVSPKPMALpTEASSSPISPKPMAPPAEASIPPVVP 1080
Cdd:PHA03377   604 KDGPPASGPHEKQ------PPSSAPRDMApsvvrmFLRERLLEqSTGPKPKSF-WEMRAGRDGSGIQQEPSSRRQPATQS 676
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1958765288 1081 KPMAPPAEAS--PLPVAPKPMAFPAETS-LPPVAPK 1113
Cdd:PHA03377   677 TPPRPSWLPSvfVLPSVDAGRAQPSEEShLSSMSPT 712
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
1015-1123 6.28e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 40.82  E-value: 6.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1015 APKPMALPAETSPPAVAPKPMAFPAETSLPPVSPKPMALPTEASSSPISPKPMAPPAeasiPP---VVPKPMAPPAEASP 1091
Cdd:PRK14959   386 AEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPA----PPrsgIPPRPAPRMPEASP 461
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1958765288 1092 LPVAPKPMAFPAETSLPPVAPKPMALPTEASP 1123
Cdd:PRK14959   462 VPGAPDSVASASDAPPTLGDPSDTAEHTPSGP 493
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
1054-1129 6.48e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 40.70  E-value: 6.48e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958765288 1054 PTEASSSPISPKpmAPPAEASIPPVVPKPMAPPAEASPLPVAPKPMAFPAETSLPPVAPKPmALPT--EASPPPVAPK 1129
Cdd:PRK14954   382 PSPAGSPDVKKK--APEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSA-PLPPspQASAPRNVAS 456
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
938-1077 6.68e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.85  E-value: 6.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288  938 RASGHYVTSAAAKSVH--TAPGPTPKEPTIKEVQRDPQLPPeqclsPLSERTHSAPLPNISKAddnRIQKPAETSPPPVA 1015
Cdd:PRK14951   367 AAAAEAAAPAEKKTPArpEAAAPAAAPVAQAAAAPAPAAAP-----AAAASAPAAPPAAAPPA---PVAAPAAAAPAAAP 438
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958765288 1016 PK-PMALPAETSPPAV-APKPMAFPAETSLPPVSPKPmalpteassspiSPKPMAPPAEASIPP 1077
Cdd:PRK14951   439 AAaPAAVALAPAPPAQaAPETVAIPVRVAPEPAVASA------------APAPAAAPAAARLTP 490
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
1020-1177 8.07e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 39.93  E-value: 8.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1020 ALPAETSPPAVAPKPMAFPAETSLPPVSP-KPMALPTEASSSPisPKPMAPPAEASIPPVvpKPMAPPAEASPLPVapKP 1098
Cdd:PTZ00436   194 AAAAAAKQKAAAKKAAAPSGKKSAKAAAPaKAAAAPAKAAAPP--AKAAAAPAKAAAAPA--KAAAPPAKAAAPPA--KA 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1099 MAFPAETSLPPVapKPMALPTEASPPP----VAPKPLALPGSQGASLNLKTlktfGAPRPYNSSAPSPFALAVVKRSQSF 1174
Cdd:PTZ00436   268 AAPPAKAAAPPA--KAAAPPAKAAAPPakaaAAPAKAAAAPAKAAAAPAKA----AAPPAKAAAPPAKAATPPAKAAAPP 341

                   ...
gi 1958765288 1175 SKA 1177
Cdd:PTZ00436   342 AKA 344
PRK14963 PRK14963
DNA polymerase III subunits gamma and tau; Provisional
1020-1106 9.97e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184927 [Multi-domain]  Cd Length: 504  Bit Score: 40.21  E-value: 9.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958765288 1020 ALPAETSPPAVAPKPMAFPAETSlPPVSPKPMALPTEASSSPISPKPMAPPAEASIPPVVPKPMAPPAEASPLPVAPKPM 1099
Cdd:PRK14963   338 ALLALGGAPSEGVAAVAPPAPAP-ADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPL 416

                   ....*..
gi 1958765288 1100 AFPAETS 1106
Cdd:PRK14963   417 VAPAAAP 423
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH