NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|16758006|ref|NP_445761|]
View 

homer protein homolog 2 [Rattus norvegicus]

Protein Classification

Homer/Vesl family EVH1 domain-containing protein( domain architecture ID 10100433)

Homer/Vesl family EVH1 (WH1, RanBP1-WASP) domain-containing protein is a synaptic scaffolding protein, required for long-term potentiation, a form of synaptic plasticity thought to underlie memory formation

CATH:  2.30.29.30
Gene Ontology:  GO:0007216|GO:0035256|GO:0005515
PubMed:  11911879|17316461
SCOP:  4002440

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
EVH1_Homer_Vesl cd01206
Homer/Vesl family proteins EVH1 domain; Homer/Vesl proteins are synaptic scaffolding proteins, ...
3-111 2.35e-77

Homer/Vesl family proteins EVH1 domain; Homer/Vesl proteins are synaptic scaffolding proteins, required for long-term potentiation, a form of synaptic plasticity thought to underlie memory formation. They contains an N-terminal EVH1 domain and bind to both neurotransmitter receptors, such as the metabotropic group 1 glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. These mGluRs possess a long C-terminal intracellular tail that may be important for subcellular localization of the receptor. The C-terminus is also the site of binding by the immediate early gene (IEG), Homer 1a. In contrast to Homer 1a, other Homer members additionally encode a C-terminal coiled-coil (CC) domain and form multivalent complexes that bind group 1 mGluRs. Homer 1a competes with constitutively expressed CC-Homers to modify the association of group 1 mGluRs with CC-Homer complexes. Since Homer proteins are strikingly enriched at the postsynaptic density (PSD), these observations suggest a role for the Homer family in regulating synaptic metabotropic receptor function. PSD-Zip45 (also named Homer 1c/Vesl-1L) has an EVH1 domain with a longer alpha-helix and its linking part included in the conserved region of Homer 1 (CRH1) interacts with the EVH1 domain of the neighbour CRH1 molecule in the crystal, suggesting that the EVH1 domain recognizes the PPXXF motif found in the binding partners, and the SPLTP sequence (P-motif) in the linking region of the CRH1. The two types of binding are partly overlapped in the EVH1 domain, implying a mechanism to regulate multimerization of Homer 1 family proteins. Homer 2 and Homer 3 are negative regulators of T cell activation. They bind the nuclear factor of activated T cells (NFAT) and compete with calcineurin binding. NFAT plays a critical role in calcium-dependent signaling in other cell types, including muscle and neurons. Homer-NFAT binding is also antagonized by active serine-threonine kinase AKT, enhancing TCR signaling via calcineurin-dependent dephosphorylation of NFAT resulting in changes in cytokine expression and an increase in effector-memory T cell populations in Homer-deficient mice. The EVH1 domains are part of the PH domain superamily. There are 5 EVH1 subfamilies: Enables/VASP, Homer/Vesl, WASP, Dcp1, and Spred. Ligands are known for three of the EVH1 subfamilies, all of which bind proline-rich sequences: the Enabled/VASP family binds to FPPPP peptides, the Homer/Vesl family binds PPxxF peptides, and the WASP family binds LPPPEP peptides. EVH1 has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.


:

Pssm-ID: 269917  Cd Length: 109  Bit Score: 232.62  E-value: 2.35e-77
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   3 EQPIFTTRAHVFQIDPSTKKNWVPASKQAVTVSYFYDVTRNSYRIISVDGAKVIINSTITPNMTFTKTSQKFGQWADSRA 82
Cdd:cd01206   1 EQPIFTTQAHVFQIDPQTKKSWIPASKQAVTVSFFYDSTRNTYRIISVEGSKAIINSTITPNMTFTKTSQKFGQWADSRA 80
                        90       100
                ....*....|....*....|....*....
gi 16758006  83 NTVFGLGFSSEQQLTKFAEKFQEVREAAR 111
Cdd:cd01206  81 NTVYGLGFASEAELTKFAEKFQEAKEAAK 109
SMC_N super family cl47134
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
93-326 1.27e-08

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


The actual alignment was detected with superfamily member TIGR02169:

Pssm-ID: 481474 [Multi-domain]  Cd Length: 1164  Bit Score: 56.61  E-value: 1.27e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006     93 EQQLTKFAEKFQEVR---EAARLARDKSQEKIETSSNHSQESGCETPSSTQA--SSVNGTDDEKASHASPADTHLK---S 164
Cdd:TIGR02169  250 EEELEKLTEEISELEkrlEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAeiASLERSIAEKERELEDAEERLAkleA 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    165 ENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKE 244
Cdd:TIGR02169  330 EIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELD 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    245 KntqLKRRIEELESEVREKEMELKDLRKQseiIPQLMSECEYVSEKLEAAER-------DNQNLEDKVRSLKTDIEESKY 317
Cdd:TIGR02169  410 R---LQEELQRLSEELADLNAAIAGIEAK---INELEEEKEDKALEIKKQEWkleqlaaDLSKYEQELYDLKEEYDRVEK 483

                   ....*....
gi 16758006    318 RQRHLKGEL 326
Cdd:TIGR02169  484 ELSKLQREL 492
 
Name Accession Description Interval E-value
EVH1_Homer_Vesl cd01206
Homer/Vesl family proteins EVH1 domain; Homer/Vesl proteins are synaptic scaffolding proteins, ...
3-111 2.35e-77

Homer/Vesl family proteins EVH1 domain; Homer/Vesl proteins are synaptic scaffolding proteins, required for long-term potentiation, a form of synaptic plasticity thought to underlie memory formation. They contains an N-terminal EVH1 domain and bind to both neurotransmitter receptors, such as the metabotropic group 1 glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. These mGluRs possess a long C-terminal intracellular tail that may be important for subcellular localization of the receptor. The C-terminus is also the site of binding by the immediate early gene (IEG), Homer 1a. In contrast to Homer 1a, other Homer members additionally encode a C-terminal coiled-coil (CC) domain and form multivalent complexes that bind group 1 mGluRs. Homer 1a competes with constitutively expressed CC-Homers to modify the association of group 1 mGluRs with CC-Homer complexes. Since Homer proteins are strikingly enriched at the postsynaptic density (PSD), these observations suggest a role for the Homer family in regulating synaptic metabotropic receptor function. PSD-Zip45 (also named Homer 1c/Vesl-1L) has an EVH1 domain with a longer alpha-helix and its linking part included in the conserved region of Homer 1 (CRH1) interacts with the EVH1 domain of the neighbour CRH1 molecule in the crystal, suggesting that the EVH1 domain recognizes the PPXXF motif found in the binding partners, and the SPLTP sequence (P-motif) in the linking region of the CRH1. The two types of binding are partly overlapped in the EVH1 domain, implying a mechanism to regulate multimerization of Homer 1 family proteins. Homer 2 and Homer 3 are negative regulators of T cell activation. They bind the nuclear factor of activated T cells (NFAT) and compete with calcineurin binding. NFAT plays a critical role in calcium-dependent signaling in other cell types, including muscle and neurons. Homer-NFAT binding is also antagonized by active serine-threonine kinase AKT, enhancing TCR signaling via calcineurin-dependent dephosphorylation of NFAT resulting in changes in cytokine expression and an increase in effector-memory T cell populations in Homer-deficient mice. The EVH1 domains are part of the PH domain superamily. There are 5 EVH1 subfamilies: Enables/VASP, Homer/Vesl, WASP, Dcp1, and Spred. Ligands are known for three of the EVH1 subfamilies, all of which bind proline-rich sequences: the Enabled/VASP family binds to FPPPP peptides, the Homer/Vesl family binds PPxxF peptides, and the WASP family binds LPPPEP peptides. EVH1 has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.


Pssm-ID: 269917  Cd Length: 109  Bit Score: 232.62  E-value: 2.35e-77
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   3 EQPIFTTRAHVFQIDPSTKKNWVPASKQAVTVSYFYDVTRNSYRIISVDGAKVIINSTITPNMTFTKTSQKFGQWADSRA 82
Cdd:cd01206   1 EQPIFTTQAHVFQIDPQTKKSWIPASKQAVTVSFFYDSTRNTYRIISVEGSKAIINSTITPNMTFTKTSQKFGQWADSRA 80
                        90       100
                ....*....|....*....|....*....
gi 16758006  83 NTVFGLGFSSEQQLTKFAEKFQEVREAAR 111
Cdd:cd01206  81 NTVYGLGFASEAELTKFAEKFQEAKEAAK 109
WH1 pfam00568
WH1 domain; WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in ...
4-106 1.16e-40

WH1 domain; WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homolog. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif.


Pssm-ID: 395450  Cd Length: 111  Bit Score: 138.74  E-value: 1.16e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006     4 QPIFTTRAHVFQIDPSTKKNWVPAsKQAVTVSYFYDVTRNSYRIISVD--GAKVIINSTITPNMTFTKTSQKFGQWADSR 81
Cdd:pfam00568   9 QTICTAVAQVYLADPDNKRHWIKA-KHSGVVCFVKDSPQNSYFIRLVDiqDGKVIWNQEIYPNMEYNQARPFFHTFADSR 87
                          90       100
                  ....*....|....*....|....*
gi 16758006    82 AntVFGLGFSSEQQLTKFAEKFQEV 106
Cdd:pfam00568  88 C--VYGLNFASEEEATKFAKAVQEA 110
WH1 smart00461
WASP homology region 1; Region of the Wiskott-Aldrich syndrome protein (WASp) that contains ...
1-105 1.39e-29

WASP homology region 1; Region of the Wiskott-Aldrich syndrome protein (WASp) that contains point mutations in the majority of patients with WAS. Unknown function. Ena-like WH1 domains bind polyproline-containing peptides, and that Homer contains a WH1 domain.


Pssm-ID: 214674  Cd Length: 106  Bit Score: 109.37  E-value: 1.39e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006      1 MGEQPIFTTRAHVFQIDPSTKKnWVPASK-QAVTVSYFYDVTRNSYRIISVDG-AKVIINSTITPNMTFTKTSQKFGQWA 78
Cdd:smart00461   1 LGSQCIILARAVVQLYDADTKK-WVPTGEgGAANLVIDKNQRSYFFRIVGIKGqDKVIWNQELYKNFKYNQATPTFHQWA 79
                           90       100
                   ....*....|....*....|....*..
gi 16758006     79 DsrANTVFGLGFSSEQQLTKFAEKFQE 105
Cdd:smart00461  80 D--DKCVYGLNFASEEEAKKFRKKVLK 104
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
93-326 1.27e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 56.61  E-value: 1.27e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006     93 EQQLTKFAEKFQEVR---EAARLARDKSQEKIETSSNHSQESGCETPSSTQA--SSVNGTDDEKASHASPADTHLK---S 164
Cdd:TIGR02169  250 EEELEKLTEEISELEkrlEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAeiASLERSIAEKERELEDAEERLAkleA 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    165 ENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKE 244
Cdd:TIGR02169  330 EIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELD 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    245 KntqLKRRIEELESEVREKEMELKDLRKQseiIPQLMSECEYVSEKLEAAER-------DNQNLEDKVRSLKTDIEESKY 317
Cdd:TIGR02169  410 R---LQEELQRLSEELADLNAAIAGIEAK---INELEEEKEDKALEIKKQEWkleqlaaDLSKYEQELYDLKEEYDRVEK 483

                   ....*....
gi 16758006    318 RQRHLKGEL 326
Cdd:TIGR02169  484 ELSKLQREL 492
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
162-331 2.40e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 49.55  E-value: 2.40e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 162 LKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQE-------SAASVEQWKRQFSICRDENDRLRSKIEELEE 234
Cdd:COG1196 244 LEAELEELEAELEELEAELAELEAELEELRLELEELELELEEaqaeeyeLLAELARLEQDIARLEERRRELEERLEELEE 323
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 235 QCGEINREKEKNTQ----LKRRIEELESEVREKEMELKDLRKQSEIIPQLMSECEyvsEKLEAAERDNQNLEDKVRSLKT 310
Cdd:COG1196 324 ELAELEEELEELEEeleeLEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAE---EELEELAEELLEALRAAAELAA 400
                       170       180
                ....*....|....*....|.
gi 16758006 311 DIEESKYRQRHLKGELKSFLE 331
Cdd:COG1196 401 QLEELEEAEEALLERLERLEE 421
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
101-314 3.05e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 48.88  E-value: 3.05e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  101 EKFQEVREAARLARDKSQEKIETSSNHSQE-SGCETPSSTQASSVNGTDDEKASHASPADTH------LKSENDKLKIAL 173
Cdd:PRK02224 223 ERYEEQREQARETRDEADEVLEEHEERREElETLEAEIEDLRETIAETEREREELAEEVRDLrerleeLEEERDDLLAEA 302
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  174 TQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEKNtqlKRRI 253
Cdd:PRK02224 303 GLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEA---REAV 379
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 16758006  254 EELESEVREKEMELKDLRKQSEIIPQLMSECEYVSEKLeAAERDnqNLEDKVRSLKTDIEE 314
Cdd:PRK02224 380 EDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEEL-REERD--ELREREAELEATLRT 437
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
162-314 5.58e-05

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 45.20  E-value: 5.58e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   162 LKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSIcrdendrLRSKIEELEEQCgeinR 241
Cdd:pfam10174 343 LQTEVDALRLRLEEKESFLNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINV-------LQKKIENLQEQL----R 411
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   242 EKEKN-TQLKRRIEELESE--------------VREKEMELKDLRKQSEiipqlmSECEYVSEKLEAAERDNQNLEDKVR 306
Cdd:pfam10174 412 DKDKQlAGLKERVKSLQTDssntdtalttleeaLSEKERIIERLKEQRE------REDRERLEELESLKKENKDLKEKVS 485

                  ....*...
gi 16758006   307 SLKTDIEE 314
Cdd:pfam10174 486 ALQPELTE 493
 
Name Accession Description Interval E-value
EVH1_Homer_Vesl cd01206
Homer/Vesl family proteins EVH1 domain; Homer/Vesl proteins are synaptic scaffolding proteins, ...
3-111 2.35e-77

Homer/Vesl family proteins EVH1 domain; Homer/Vesl proteins are synaptic scaffolding proteins, required for long-term potentiation, a form of synaptic plasticity thought to underlie memory formation. They contains an N-terminal EVH1 domain and bind to both neurotransmitter receptors, such as the metabotropic group 1 glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. These mGluRs possess a long C-terminal intracellular tail that may be important for subcellular localization of the receptor. The C-terminus is also the site of binding by the immediate early gene (IEG), Homer 1a. In contrast to Homer 1a, other Homer members additionally encode a C-terminal coiled-coil (CC) domain and form multivalent complexes that bind group 1 mGluRs. Homer 1a competes with constitutively expressed CC-Homers to modify the association of group 1 mGluRs with CC-Homer complexes. Since Homer proteins are strikingly enriched at the postsynaptic density (PSD), these observations suggest a role for the Homer family in regulating synaptic metabotropic receptor function. PSD-Zip45 (also named Homer 1c/Vesl-1L) has an EVH1 domain with a longer alpha-helix and its linking part included in the conserved region of Homer 1 (CRH1) interacts with the EVH1 domain of the neighbour CRH1 molecule in the crystal, suggesting that the EVH1 domain recognizes the PPXXF motif found in the binding partners, and the SPLTP sequence (P-motif) in the linking region of the CRH1. The two types of binding are partly overlapped in the EVH1 domain, implying a mechanism to regulate multimerization of Homer 1 family proteins. Homer 2 and Homer 3 are negative regulators of T cell activation. They bind the nuclear factor of activated T cells (NFAT) and compete with calcineurin binding. NFAT plays a critical role in calcium-dependent signaling in other cell types, including muscle and neurons. Homer-NFAT binding is also antagonized by active serine-threonine kinase AKT, enhancing TCR signaling via calcineurin-dependent dephosphorylation of NFAT resulting in changes in cytokine expression and an increase in effector-memory T cell populations in Homer-deficient mice. The EVH1 domains are part of the PH domain superamily. There are 5 EVH1 subfamilies: Enables/VASP, Homer/Vesl, WASP, Dcp1, and Spred. Ligands are known for three of the EVH1 subfamilies, all of which bind proline-rich sequences: the Enabled/VASP family binds to FPPPP peptides, the Homer/Vesl family binds PPxxF peptides, and the WASP family binds LPPPEP peptides. EVH1 has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.


Pssm-ID: 269917  Cd Length: 109  Bit Score: 232.62  E-value: 2.35e-77
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   3 EQPIFTTRAHVFQIDPSTKKNWVPASKQAVTVSYFYDVTRNSYRIISVDGAKVIINSTITPNMTFTKTSQKFGQWADSRA 82
Cdd:cd01206   1 EQPIFTTQAHVFQIDPQTKKSWIPASKQAVTVSFFYDSTRNTYRIISVEGSKAIINSTITPNMTFTKTSQKFGQWADSRA 80
                        90       100
                ....*....|....*....|....*....
gi 16758006  83 NTVFGLGFSSEQQLTKFAEKFQEVREAAR 111
Cdd:cd01206  81 NTVYGLGFASEAELTKFAEKFQEAKEAAK 109
EVH1_family cd00837
EVH1 (Drosophila Enabled (Ena)/Vasodilator-stimulated phosphoprotein (VASP) homology 1) domain; ...
5-107 3.79e-43

EVH1 (Drosophila Enabled (Ena)/Vasodilator-stimulated phosphoprotein (VASP) homology 1) domain; The EVH1 domains are part of the PH domain superfamily. EVH1 subfamilies include Enables/VASP, Homer/Vesl, WASP, and Spred. Ligands are known for three of the EVH1 subfamilies, all of which bind proline-rich sequences: the Enabled/VASP family binds to FPPPP peptides, the Homer/Vesl family binds PPxxF peptides, and the WASP family binds LPPPEP peptides. EVH1 has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.


Pssm-ID: 269909  Cd Length: 103  Bit Score: 144.91  E-value: 3.79e-43
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   5 PIFTTRAHVFQIDPStKKNWVPAS-KQAVTVSYFYDVTRNSYRIISVDGA--KVIINSTITPNMTFTKTSQKFGQWADSR 81
Cdd:cd00837   1 SIFSARAHVMQIDDS-NKNWVPAGgKGASRVSYFKDTTRNSFRIIGVDIKdkKVVINCTITKNLVYNKATQTFHQWADDR 79
                        90       100
                ....*....|....*....|....*.
gi 16758006  82 anTVFGLGFSSEQQLTKFAEKFQEVR 107
Cdd:cd00837  80 --TVFGLNFASEEDATKFAEAVQEAL 103
WH1 pfam00568
WH1 domain; WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in ...
4-106 1.16e-40

WH1 domain; WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homolog. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif.


Pssm-ID: 395450  Cd Length: 111  Bit Score: 138.74  E-value: 1.16e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006     4 QPIFTTRAHVFQIDPSTKKNWVPAsKQAVTVSYFYDVTRNSYRIISVD--GAKVIINSTITPNMTFTKTSQKFGQWADSR 81
Cdd:pfam00568   9 QTICTAVAQVYLADPDNKRHWIKA-KHSGVVCFVKDSPQNSYFIRLVDiqDGKVIWNQEIYPNMEYNQARPFFHTFADSR 87
                          90       100
                  ....*....|....*....|....*
gi 16758006    82 AntVFGLGFSSEQQLTKFAEKFQEV 106
Cdd:pfam00568  88 C--VYGLNFASEEEATKFAKAVQEA 110
WH1 smart00461
WASP homology region 1; Region of the Wiskott-Aldrich syndrome protein (WASp) that contains ...
1-105 1.39e-29

WASP homology region 1; Region of the Wiskott-Aldrich syndrome protein (WASp) that contains point mutations in the majority of patients with WAS. Unknown function. Ena-like WH1 domains bind polyproline-containing peptides, and that Homer contains a WH1 domain.


Pssm-ID: 214674  Cd Length: 106  Bit Score: 109.37  E-value: 1.39e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006      1 MGEQPIFTTRAHVFQIDPSTKKnWVPASK-QAVTVSYFYDVTRNSYRIISVDG-AKVIINSTITPNMTFTKTSQKFGQWA 78
Cdd:smart00461   1 LGSQCIILARAVVQLYDADTKK-WVPTGEgGAANLVIDKNQRSYFFRIVGIKGqDKVIWNQELYKNFKYNQATPTFHQWA 79
                           90       100
                   ....*....|....*....|....*..
gi 16758006     79 DsrANTVFGLGFSSEQQLTKFAEKFQE 105
Cdd:smart00461  80 D--DKCVYGLNFASEEEAKKFRKKVLK 104
EVH1_Ena_VASP-like cd01207
Enabled/VASP family EVH1 domain; Ena/VASP family includes proteins such as: ...
5-100 4.61e-13

Enabled/VASP family EVH1 domain; Ena/VASP family includes proteins such as: Vasodilator-stimulated phosphoprotein (VASP), enabled gene product from Drosophila (Ena), mammalian enabled (Mena) and Ena/VASP-Like protein (EVL) localize to focal adhesions and to sites of actin filament dynamics. These proteins share a common modular organization with a highly conserved N- and C-terminal domains, termed Ena/VASP homology domains 1 and 2 (EVH1 and EVH2), that are separated by a central proline-rich domain. The EVH1 domain binds to other proteins at proline rich sequences. The majority of Ena-VASP type EVH1 domains recognize FPPPP motifs such as in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes, however the LIM3 domain of Tes lacks the FPPPP motif but still binds the EVH1 domain of Mena. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains. EVH2 mediates oligomerization within the family. The proline-rich region binds SH3 and WW domains as well as profilin, a protein that regulates actin filament dynamics. The EVH1 domains are part of the PH domain superamily. There are 5 EVH1 subfamilies: Enables/VASP, Homer/Vesl, WASP, Dcp1, and Spred. Ligands are known for three of the EVH1 subfamilies, all of which bind proline-rich sequences: the Enabled/VASP family binds to FPPPP peptides, the Homer/Vesl family binds PPxxF peptides, and the WASP family binds LPPPEP peptides. EVH1 has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.


Pssm-ID: 269918  Cd Length: 108  Bit Score: 64.64  E-value: 4.61e-13
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   5 PIFTTRAHVFQIDPSTKKnWVPA-SKQAVTVSYFYDVTR-NSYRIISVDGA--KVIINSTITPNMTFTKTSQKFGQWADS 80
Cdd:cd01207   1 SVASARASVMVYDDENKR-WVPSgGSQGLSRVQIYHNTRnNTFRVVGRKLQdhEVVINCAILKGLKYNQATPTFHQWRDA 79
                        90       100
                ....*....|....*....|
gi 16758006  81 RanTVFGLGFSSEQQLTKFA 100
Cdd:cd01207  80 R--QVYGLNFASKEEATEFA 97
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
93-326 1.27e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 56.61  E-value: 1.27e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006     93 EQQLTKFAEKFQEVR---EAARLARDKSQEKIETSSNHSQESGCETPSSTQA--SSVNGTDDEKASHASPADTHLK---S 164
Cdd:TIGR02169  250 EEELEKLTEEISELEkrlEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAeiASLERSIAEKERELEDAEERLAkleA 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    165 ENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKE 244
Cdd:TIGR02169  330 EIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELD 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    245 KntqLKRRIEELESEVREKEMELKDLRKQseiIPQLMSECEYVSEKLEAAER-------DNQNLEDKVRSLKTDIEESKY 317
Cdd:TIGR02169  410 R---LQEELQRLSEELADLNAAIAGIEAK---INELEEEKEDKALEIKKQEWkleqlaaDLSKYEQELYDLKEEYDRVEK 483

                   ....*....
gi 16758006    318 RQRHLKGEL 326
Cdd:TIGR02169  484 ELSKLQREL 492
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
167-342 1.36e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 56.60  E-value: 1.36e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    167 DKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQC----GEINRE 242
Cdd:TIGR02168  326 EELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIaslnNEIERL 405
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    243 KEKNTQLKRRIEELESEVRE--KEMELKDLRKQSEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQR 320
Cdd:TIGR02168  406 EARLERLEDRRERLQQEIEEllKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELA 485
                          170       180
                   ....*....|....*....|..
gi 16758006    321 HLKGELKSfLEVLDGKIDDLHD 342
Cdd:TIGR02168  486 QLQARLDS-LERLQENLEGFSE 506
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
160-349 1.70e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 56.23  E-value: 1.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    160 THLKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQcgeI 239
Cdd:TIGR02169  698 RRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEED---L 774
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    240 NREKEKNTQLKRRIeeLESEVREKEMELKDLRKQ----SEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEES 315
Cdd:TIGR02169  775 HKLEEALNDLEARL--SHSRIPEIQAELSKLEEEvsriEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSI 852
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 16758006    316 KYRQRHLKGELKSFLEVLD---GKIDDLHDFRRGLSK 349
Cdd:TIGR02169  853 EKEIENLNGKKEELEEELEeleAALRDLESRLGDLKK 889
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
162-326 2.91e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 52.37  E-value: 2.91e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    162 LKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEeqcGEINR 241
Cdd:TIGR02168  787 LEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLA---AEIEE 863
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    242 EKEKNTQLKRRIEELESEVREKEMELKDLRKQSEiipQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRH 321
Cdd:TIGR02168  864 LEELIEELESELEALLNERASLEEALALLRSELE---ELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDN 940

                   ....*
gi 16758006    322 LKGEL 326
Cdd:TIGR02168  941 LQERL 945
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
162-353 2.96e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 52.37  E-value: 2.96e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    162 LKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQcgeINR 241
Cdd:TIGR02168  717 LRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQ---IEQ 793
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    242 EKEKNTQLKRRIEELESEVREKEmelKDLRKQSEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRH 321
Cdd:TIGR02168  794 LKEELKALREALDELRAELTLLN---EEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870
                          170       180       190
                   ....*....|....*....|....*....|..
gi 16758006    322 LKGELKSFLEVLDGKIDDLHDFRRGLSKLGTD 353
Cdd:TIGR02168  871 LESELEALLNERASLEEALALLRSELEELSEE 902
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
163-340 4.03e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 51.98  E-value: 4.03e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    163 KSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINRE 242
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKE 755
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    243 ----KEKNTQLKRRIEELESEVREKEMELKDLRKQseiIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYR 318
Cdd:TIGR02168  756 ltelEAEIEELEERLEEAEEELAEAEAEIEELEAQ---IEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERR 832
                          170       180
                   ....*....|....*....|..
gi 16758006    319 qrhlKGELKSFLEVLDGKIDDL 340
Cdd:TIGR02168  833 ----IAATERRLEDLEEQIEEL 850
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
186-351 4.59e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 51.61  E-value: 4.59e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    186 ELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQC-----GEINREKEKNTQLKRRIEELESEV 260
Cdd:TIGR02169  231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIkdlgeEEQLRVKEKIGELEAEIASLERSI 310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    261 REKEMELKDLRKQSEI----IPQLMSECEYVSEKLE--AAERDN-----QNLEDKVRSLKTDIEESKYRQRHLKGELKSF 329
Cdd:TIGR02169  311 AEKERELEDAEERLAKleaeIDKLLAEIEELEREIEeeRKRRDKlteeyAELKEELEDLRAELEEVDKEFAETRDELKDY 390
                          170       180
                   ....*....|....*....|..
gi 16758006    330 LEVLDGKIDDLHDFRRGLSKLG 351
Cdd:TIGR02169  391 REKLEKLKREINELKRELDRLQ 412
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
162-331 2.40e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 49.55  E-value: 2.40e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 162 LKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQE-------SAASVEQWKRQFSICRDENDRLRSKIEELEE 234
Cdd:COG1196 244 LEAELEELEAELEELEAELAELEAELEELRLELEELELELEEaqaeeyeLLAELARLEQDIARLEERRRELEERLEELEE 323
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 235 QCGEINREKEKNTQ----LKRRIEELESEVREKEMELKDLRKQSEIIPQLMSECEyvsEKLEAAERDNQNLEDKVRSLKT 310
Cdd:COG1196 324 ELAELEEELEELEEeleeLEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAE---EELEELAEELLEALRAAAELAA 400
                       170       180
                ....*....|....*....|.
gi 16758006 311 DIEESKYRQRHLKGELKSFLE 331
Cdd:COG1196 401 QLEELEEAEEALLERLERLEE 421
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
101-314 3.05e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 48.88  E-value: 3.05e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  101 EKFQEVREAARLARDKSQEKIETSSNHSQE-SGCETPSSTQASSVNGTDDEKASHASPADTH------LKSENDKLKIAL 173
Cdd:PRK02224 223 ERYEEQREQARETRDEADEVLEEHEERREElETLEAEIEDLRETIAETEREREELAEEVRDLrerleeLEEERDDLLAEA 302
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  174 TQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEKNtqlKRRI 253
Cdd:PRK02224 303 GLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEA---REAV 379
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 16758006  254 EELESEVREKEMELKDLRKQSEIIPQLMSECEYVSEKLeAAERDnqNLEDKVRSLKTDIEE 314
Cdd:PRK02224 380 EDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEEL-REERD--ELREREAELEATLRT 437
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
150-275 6.05e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 47.93  E-value: 6.05e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 150 EKASHASPADTHLKSENDKLK--IALTQSAANVKKWEIELQTLRESNARLTTALQEsaasveqwkrqfsiCRDENDRLRS 227
Cdd:COG2433 383 EELIEKELPEEEPEAEREKEHeeRELTEEEEEIRRLEEQVERLEAEVEELEAELEE--------------KDERIERLER 448
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*...
gi 16758006 228 KIEELEEQCGEINREKEKNTQLKRRIEELESEVREKEMELKDLRKQSE 275
Cdd:COG2433 449 ELSEARSEERREIRKDREISRLDREIERLERELEEERERIEELKRKLE 496
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
161-354 1.11e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 47.22  E-value: 1.11e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  161 HLKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQES---------AASVEQWKRQFSICRDENDRLRSKIEE 231
Cdd:COG4913  614 ALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSwdeidvasaEREIAELEAELERLDASSDDLAALEEQ 693
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  232 LEEQCGEINREKEKNTQLKRRIEELESEVREKEMELKDLRKQSEIIPQLmsECEYVSEKLEA--AERDNQNLEDKVR-SL 308
Cdd:COG4913  694 LEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDL--ARLELRALLEErfAAALGDAVERELReNL 771
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 16758006  309 KTDIEESKYRQRHLKGEL----KSFLEV-------LDGKIDDLHDFRRGLSKLGTDN 354
Cdd:COG4913  772 EERIDALRARLNRAEEELeramRAFNREwpaetadLDADLESLPEYLALLDRLEEDG 828
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
185-323 1.23e-05

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 47.16  E-value: 1.23e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 185 IELQTLRESNARLTTALQESAASVeqwkRQFSICRDENDRLRSKIEELEEQCGEInreKEKNTQLKRRIEELESEVREKE 264
Cdd:COG2433 382 LEELIEKELPEEEPEAEREKEHEE----RELTEEEEEIRRLEEQVERLEAEVEEL---EAELEEKDERIERLERELSEAR 454
                        90       100       110       120       130       140
                ....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 265 MELK-DLRKQSEIipqlmseceyvseklEAAERDNQNLEDKVRSLKTDIEESKYRQRHLK 323
Cdd:COG2433 455 SEERrEIRKDREI---------------SRLDREIERLERELEEERERIEELKRKLERLK 499
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
223-340 1.38e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 45.69  E-value: 1.38e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 223 DRLRSKIEELEEQCGEINREKEKN----TQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMSEC----EY--VSEKLE 292
Cdd:COG1579  20 DRLEHRLKELPAELAELEDELAALearlEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVrnnkEYeaLQKEIE 99
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*...
gi 16758006 293 AAERDNQNLEDKVRSLKTDIEESKYRQRHLKGELKSFLEVLDGKIDDL 340
Cdd:COG1579 100 SLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAEL 147
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
164-340 1.55e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 46.94  E-value: 1.55e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   164 SENDKLKIALTQSAANVKKweiELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQcgeINREK 243
Cdd:TIGR04523 331 SQNNKIISQLNEQISQLKK---ELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESK---IQNQE 404
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   244 EKNTQLKRRIEELESEVREKEMELKDLRKQseiIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRHLK 323
Cdd:TIGR04523 405 KLNQQKDEQIKKLQQEKELLEKEIERLKET---IIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIK 481
                         170
                  ....*....|....*..
gi 16758006   324 GELKSFLEVLDGKIDDL 340
Cdd:TIGR04523 482 QNLEQKQKELKSKEKEL 498
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
162-314 2.67e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 46.08  E-value: 2.67e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 162 LKSENDKLKIALTqsAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIE----------- 230
Cdd:COG1196 218 LKEELKELEAELL--LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEeaqaeeyella 295
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 231 ELEEQCGEINREKEKNTQLKRRIEELESEVREKEMELKDLRKQ-SEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLK 309
Cdd:COG1196 296 ELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEElEELEEELEEAEEELEEAEAELAEAEEALLEAEAELA 375

                ....*
gi 16758006 310 TDIEE 314
Cdd:COG1196 376 EAEEE 380
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
201-350 4.01e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 45.44  E-value: 4.01e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  201 LQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEKNTQLKRRIEELESEVREKEMELKDLRKQSEIIPQL 280
Cdd:PRK03918 195 IKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKE 274
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  281 MSECEYVSEKLEAAERDNQNLEdKVRSLKTDIEESKYRQRHLKGELKSFLEVLDGKIDDLHDFRRGLSKL 350
Cdd:PRK03918 275 IEELEEKVKELKELKEKAEEYI-KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEEL 343
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
162-314 5.58e-05

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 45.20  E-value: 5.58e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   162 LKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSIcrdendrLRSKIEELEEQCgeinR 241
Cdd:pfam10174 343 LQTEVDALRLRLEEKESFLNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINV-------LQKKIENLQEQL----R 411
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   242 EKEKN-TQLKRRIEELESE--------------VREKEMELKDLRKQSEiipqlmSECEYVSEKLEAAERDNQNLEDKVR 306
Cdd:pfam10174 412 DKDKQlAGLKERVKSLQTDssntdtalttleeaLSEKERIIERLKEQRE------REDRERLEELESLKKENKDLKEKVS 485

                  ....*...
gi 16758006   307 SLKTDIEE 314
Cdd:pfam10174 486 ALQPELTE 493
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
219-349 5.76e-05

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.06  E-value: 5.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    219 RDENDRLRSKIEELEEQCG----EINREKEKNTQLKRRIEELESEVREKEMEL----KDLRKQSEIIPQLMSECEYVSEK 290
Cdd:TIGR02169  673 PAELQRLRERLEGLKRELSslqsELRRIENRLDELSQELSDASRKIGEIEKEIeqleQEEEKLKERLEELEEDLSSLEQE 752
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    291 LEAAERDNQNLEDKVRSLKTDIEESKYRQRHLKGEL-KSFLEVLDGKIDDLHDFRRGLSK 349
Cdd:TIGR02169  753 IENVKSELKELEARIEELEEDLHKLEEALNDLEARLsHSRIPEIQAELSKLEEEVSRIEA 812
PTZ00121 PTZ00121
MAEBL; Provisional
101-336 6.34e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.13  E-value: 6.34e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   101 EKFQEVREAARLARDKSQEKIETSSNHSQESGCETPSSTQASSVNGTDDEKASHASPADTHLKSENDKLKIALTQSAANV 180
Cdd:PTZ00121 1546 KKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEEL 1625
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   181 KKWEIElqtlRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEKNTQLKRRIEELESEV 260
Cdd:PTZ00121 1626 KKAEEE----KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 16758006   261 REKEMELKDLRKQSEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRHLKGELKSFLEVLDGK 336
Cdd:PTZ00121 1702 KKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
168-320 6.45e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 44.76  E-value: 6.45e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 168 KLKIALTQSAANVKKWEIELQTLRESNARLTTALQ--ESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEK 245
Cdd:COG4717  92 ELQEELEELEEELEELEAELEELREELEKLEKLLQllPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAE 171
                        90       100       110       120       130       140       150
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 16758006 246 NTQLKRRIEELESEVR-EKEMELKDLRKQSEiipQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQR 320
Cdd:COG4717 172 LAELQEELEELLEQLSlATEEELQDLAEELE---ELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEER 244
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
186-314 9.54e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.37  E-value: 9.54e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 186 ELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEKN------TQLKRRIEELESE 259
Cdd:COG1579  32 ELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNKEYEalqkeiESLKRRISDLEDE 111
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*
gi 16758006 260 VREKEMELKDLRKQseiIPQLMSECEYVSEKLEAAERDnqnLEDKVRSLKTDIEE 314
Cdd:COG1579 112 ILELMERIEELEEE---LAELEAELAELEAELEEKKAE---LDEELAELEAELEE 160
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
186-334 1.26e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.89  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    186 ELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEqcgEINREKEKNTQLKRRIEELESEVREKEM 265
Cdd:TIGR02168  289 ELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAE---ELAELEEKLEELKEELESLEAELEELEA 365
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 16758006    266 ELKDLRKQSEiipQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRHLKGELKSFLEVLD 334
Cdd:TIGR02168  366 ELEELESRLE---ELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE 431
PRK12704 PRK12704
phosphodiesterase; Provisional
224-333 1.54e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 43.61  E-value: 1.54e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  224 RLRSKIEELEEQCGEINREK--EKNTQLKRRIEELESEVREKEMELKDLRKQseiipqLMSECEYVSEKLEAAERDNQNL 301
Cdd:PRK12704  39 EAKRILEEAKKEAEAIKKEAllEAKEEIHKLRNEFEKELRERRNELQKLEKR------LLQKEENLDRKLELLEKREEEL 112
                         90       100       110
                 ....*....|....*....|....*....|..
gi 16758006  302 EDKVRSLKTDIEESKYRQRHLKGELKSFLEVL 333
Cdd:PRK12704 113 EKKEKELEQKQQELEKKEEELEELIEEQLQEL 144
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
185-349 1.78e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 42.22  E-value: 1.78e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 185 IELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEKNTQLKRRIEELESEVReKE 264
Cdd:COG1579  10 LDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVR-NN 88
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 265 MELKDLRKQSEIIPQLMSECEY----VSEKLEAAERDNQNLEDKVRSLKTDIEEskyrqrhLKGELKSFLEVLDGKIDDL 340
Cdd:COG1579  89 KEYEALQKEIESLKRRISDLEDeileLMERIEELEEELAELEAELAELEAELEE-------KKAELDEELAELEAELEEL 161

                ....*....
gi 16758006 341 HDFRRGLSK 349
Cdd:COG1579 162 EAEREELAA 170
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
114-340 1.96e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 43.57  E-value: 1.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    114 RDKSQEKIETSSNHSQESgCETPSSTQASSVNGTDdEKASHASPADTHLKSEndkLKIALTQSAANVKKWEIELQTLRES 193
Cdd:pfam15921  251 KSESQNKIELLLQQHQDR-IEQLISEHEVEITGLT-EKASSARSQANSIQSQ---LEIIQEQARNQNSMYMRQLSDLEST 325
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    194 NARLTTALQESAAS----VEQWKRQFSICRDE------------------NDRLRSKIEELEEQCGEINREKEKNTQLKR 251
Cdd:pfam15921  326 VSQLRSELREAKRMyedkIEELEKQLVLANSEltearterdqfsqesgnlDDQLQKLLADLHKREKELSLEKEQNKRLWD 405
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    252 R-------IEELESEVREKEMELKDLRKqseIIPQLMSECE-YVSEKLEAAERDNQNLEdKVRSLKTDIEESKYRQRHLK 323
Cdd:pfam15921  406 RdtgnsitIDHLRRELDDRNMEVQRLEA---LLKAMKSECQgQMERQMAAIQGKNESLE-KVSSLTAQLESTKEMLRKVV 481
                          250       260
                   ....*....|....*....|
gi 16758006    324 GEL---KSFLEVLDGKIDDL 340
Cdd:pfam15921  482 EELtakKMTLESSERTVSDL 501
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
213-340 2.30e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.30e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 213 RQFSICRDENDRLRSKIEELEEQCGEINREKEKNTQLKRRIEELESEVREKEMELKDLRKQSEIIPQLmseceyvsEKLE 292
Cdd:COG4717  64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLY--------QELE 135
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*...
gi 16758006 293 AAERDNQNLEDKVRSLKTDIEESKYRQRHLKgELKSFLEVLDGKIDDL 340
Cdd:COG4717 136 ALEAELAELPERLEELEERLEELRELEEELE-ELEAELAELQEELEEL 182
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
162-349 2.36e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 43.11  E-value: 2.36e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  162 LKSENDKLKIALTQsAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINR 241
Cdd:PRK02224 487 LEEEVEEVEERLER-AEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEE 565
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  242 EKEKNTQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMSECEYVSEKLEA-AERDNQNLE------DKVRSLKTDIEE 314
Cdd:PRK02224 566 EAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREAlAELNDERRErlaekrERKRELEAEFDE 645
                        170       180       190
                 ....*....|....*....|....*....|....*.
gi 16758006  315 SKYRQ-RHLKGELKSFLEVLDGKIDDLHDFRRGLSK 349
Cdd:PRK02224 646 ARIEEaREDKERAEEYLEQVEEKLDELREERDDLQA 681
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
219-350 2.81e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 42.74  E-value: 2.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    219 RDENDRLRSKIEELEEQCGEINRE-KEKNTQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMSECEYVSEKLEAAERD 297
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKAlAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKE 755
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 16758006    298 NQNLEDKVRSLKTDIEESKYRQRHLKGELKSFLEVLDGKIDDLHDFRRGLSKL 350
Cdd:TIGR02168  756 LTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDEL 808
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
186-347 2.91e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.98  E-value: 2.91e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  186 ELQTLRESNARLTtALQESAASVEQWKRQFSICRDENDRLRS-----KIEELEEQCGEINREKEKNTQLKRRIEELESEV 260
Cdd:COG4913  243 ALEDAREQIELLE-PIRELAERYAAARERLAELEYLRAALRLwfaqrRLELLEAELEELRAELARLEAELERLEARLDAL 321
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  261 REKEMELKDLRKQS--EIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRHLKGELKSFLEVLDGKID 338
Cdd:COG4913  322 REELDELEAQIRGNggDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELE 401

                 ....*....
gi 16758006  339 DLHDFRRGL 347
Cdd:COG4913  402 ALEEALAEA 410
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
219-323 3.04e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 42.74  E-value: 3.04e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  219 RDENDRLRSKIEELEEQCGEIN---REKEKN-TQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMSECEYVSEKLEAA 294
Cdd:PRK03918 171 IKEIKRRIERLEKFIKRTENIEeliKEKEKElEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESL 250
                         90       100
                 ....*....|....*....|....*....
gi 16758006  295 ERDNQNLEDKVRSLKTDIEESKYRQRHLK 323
Cdd:PRK03918 251 EGSKRKLEEKIRELEERIEELKKEIEELE 279
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
158-345 3.32e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.59  E-value: 3.32e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  158 ADTHLKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFsicrdenDRLRSKIEELEEQCG 237
Cdd:COG4913  261 AERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEARL-------DALREELDELEAQIR 333
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  238 EINREKEKntQLKRRIEELESEVREKEMELKDLRKQSEII--PQLMSECEYVSEKLEAAERDNQ------NLEDKVRSLK 309
Cdd:COG4913  334 GNGGDRLE--QLEREIERLERELEERERRRARLEALLAALglPLPASAEEFAALRAEAAALLEAleeeleALEEALAEAE 411
                        170       180       190
                 ....*....|....*....|....*....|....*..
gi 16758006  310 TDIEESKYRQRHLKGELKSfLEVLDGKID-DLHDFRR 345
Cdd:COG4913  412 AALRDLRRELRELEAEIAS-LERRKSNIPaRLLALRD 447
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
217-323 5.84e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 41.97  E-value: 5.84e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  217 ICRDEN--DRLRSKIEELEEQCGEINrekekntQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMSECEYVSEKLEAA 294
Cdd:PRK03918 185 IKRTENieELIKEKEKELEEVLREIN-------EISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKL 257
                         90       100
                 ....*....|....*....|....*....
gi 16758006  295 ERDNQNLEDKVRSLKTDIEESKYRQRHLK 323
Cdd:PRK03918 258 EEKIRELEERIEELKKEIEELEEKVKELK 286
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
241-340 6.65e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 41.59  E-value: 6.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    241 REKEKNTQLKRRIEELEsevREKEMELKDLRKQSEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQR 320
Cdd:TIGR02169  671 SEPAELQRLRERLEGLK---RELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLS 747
                           90       100
                   ....*....|....*....|...
gi 16758006    321 HLKGEL---KSFLEVLDGKIDDL 340
Cdd:TIGR02169  748 SLEQEIenvKSELKELEARIEEL 770
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
160-342 7.68e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 41.54  E-value: 7.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   160 THLKSENDKLKIALTQSaanvkkwEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEI 239
Cdd:TIGR04523 352 TNSESENSEKQRELEEK-------QNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELL 424
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   240 NRE----KEKNTQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMS--ECEYVSEK----------------LEAAERD 297
Cdd:TIGR04523 425 EKEierlKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKvlSRSINKIKqnleqkqkelkskekeLKKLNEE 504
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 16758006   298 NQNLEDKVRSLKTDIEESKYRQRHLKGELKSflevLDGKIDDLHD 342
Cdd:TIGR04523 505 KKELEEKVKDLTKKISSLKEKIEKLESEKKE----KESKISDLED 545
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
181-336 7.79e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 41.59  E-value: 7.79e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  181 KKWEIELQTLRESNARLTTALQESAASVEQWKRQFSicRDENDRLRSKIEELEEqcgEINREKEKNTQLKRRIEELESEV 260
Cdd:PRK03918 622 KKLEEELDKAFEELAETEKRLEELRKELEELEKKYS--EEEYEELREEYLELSR---ELAGLRAELEELEKRREEIKKTL 696
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 16758006  261 REKEMELKDLRKQSEIIpqlmseceyvsEKLEAAERDNQNLEDKVRSLKTDIEESKYRQrhlKGELKS--FLEVLDGK 336
Cdd:PRK03918 697 EKLKEELEEREKAKKEL-----------EKLEKALERVEELREKVKKYKALLKERALSK---VGEIASeiFEELTEGK 760
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
191-345 8.41e-04

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 41.36  E-value: 8.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    191 RESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIE-ELEEQCGEINREKEkntQLKRRIEELESEVREKEMELKD 269
Cdd:pfam12128  375 AKYNRRRSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAEdDLQALESELREQLE---AGKLEFNEEEYRLKSRLGELKL 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    270 LRKQSEIIPQLM-------SECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQR-------HLKGELKSFLEVLDG 335
Cdd:pfam12128  452 RLNQATATPELLlqlenfdERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRqasrrleERQSALDELELQLFP 531
                          170
                   ....*....|
gi 16758006    336 KIDDLHDFRR 345
Cdd:pfam12128  532 QAGTLLHFLR 541
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
184-314 1.11e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 40.82  E-value: 1.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    184 EIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINRE----KEKNTQLKRRIEELESE 259
Cdd:TIGR02169  818 EQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAlrdlESRLGDLKKERDELEAQ 897
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 16758006    260 VREKEMELKDLRKQSEIIPQLMSECEyvsEKLEAAERDNQNLEDKVRSLKTDIEE 314
Cdd:TIGR02169  898 LRELERKIEELEAQIEKKRKRLSELK---AKLEALEEELSEIEDPKGEDEEIPEE 949
ClpA COG0542
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein ...
223-314 1.45e-03

ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 440308 [Multi-domain]  Cd Length: 836  Bit Score: 40.45  E-value: 1.45e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 223 DRLRSKIEELEEqcgEINR-EKEKNTQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMSECEYVSEKLEAAERDNQNL 301
Cdd:COG0542 414 DELERRLEQLEI---EKEAlKKEQDEASFERLAELRDELAELEEELEALKARWEAEKELIEEIQELKEELEQRYGKIPEL 490
                        90
                ....*....|...
gi 16758006 302 EDKVRSLKTDIEE 314
Cdd:COG0542 491 EKELAELEEELAE 503
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
93-314 1.59e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 40.43  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006     93 EQQLTKFAEKFQEVREAARLARDK-SQEKIETSSNHSQESGCETPSSTQASSVNGTDDEKAShASPADTHLKSENDKLKI 171
Cdd:TIGR02168  788 EAQIEQLKEELKALREALDELRAElTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEE-LSEDIESLAAEIEELEE 866
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    172 ALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWkrqfsicRDENDRLRSKIEELEEQCGEINREKEKntqLKR 251
Cdd:TIGR02168  867 LIEELESELEALLNERASLEEALALLRSELEELSEELREL-------ESKRSELRRELEELREKLAQLELRLEG---LEV 936
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 16758006    252 RIEELESEVREK-EMELkdlrkqseiipqlmsecEYVSEKLEAAERDNQNLEDKVRSLKTDIEE 314
Cdd:TIGR02168  937 RIDNLQERLSEEySLTL-----------------EEAEALENKIEDDEEEARRRLKRLENKIKE 983
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
184-308 2.21e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 39.75  E-value: 2.21e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 184 EIELQTLRESNARL-TTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEKNT--QLKRRIEELESEV 260
Cdd:COG4717 362 ELQLEELEQEIAALlAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDeeELEEELEELEEEL 441
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*...
gi 16758006 261 REKEMELKDLRKQSEIIPQLMSECEyVSEKLEAAERDNQNLEDKVRSL 308
Cdd:COG4717 442 EELEEELEELREELAELEAELEQLE-EDGELAELLQELEELKAELREL 488
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
181-333 2.50e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 39.66  E-value: 2.50e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  181 KKWEIELQTLRESNARLTtALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINREKEKNTQLKRRIEELESEV 260
Cdd:PRK03918 162 NAYKNLGEVIKEIKRRIE-RLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEI 240
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 16758006  261 REKEMELKDLRKQSEIIPQLMSECEYVSEKLEAAERDnqnLEDKVRSLKtDIEESKYRQRHLKGELKSFLEVL 333
Cdd:PRK03918 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE---LEEKVKELK-ELKEKAEEYIKLSEFYEEYLDEL 309
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
223-326 2.84e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 39.51  E-value: 2.84e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  223 DRLRSKIEELEEQCGEINREKEKNTQLKRRIEE--------------------LESEVREKEMELKDLRKQSEIIPQLMS 282
Cdd:COG4913  613 AALEAELAELEEELAEAEERLEALEAELDALQErrealqrlaeyswdeidvasAEREIAELEAELERLDASSDDLAALEE 692
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....
gi 16758006  283 ECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRHLKGEL 326
Cdd:COG4913  693 QLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRL 736
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
150-314 3.17e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 39.53  E-value: 3.17e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 150 EKASHASPADTHLKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKI 229
Cdd:COG1196 344 EELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEEL 423
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 230 EELEEQCGEINREKEKNTQLKRRIEELESEVREKEMELKDLRKQSeiipqlmsecEYVSEKLEAAERDNQNLEDKVRSLK 309
Cdd:COG1196 424 EELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAEL----------LEEAALLEAALAELLEELAEAAARL 493

                ....*
gi 16758006 310 TDIEE 314
Cdd:COG1196 494 LLLLE 498
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
149-314 3.22e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 39.12  E-value: 3.22e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 149 DEKASHASPADTHLKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSK 228
Cdd:COG4372   2 DRLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEE 81
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 229 IEELEEQcgeINREKEKNTQLKRRIEELESEVREKEMELKDLRKQSEiipQLMSECEYVSEKLEAAERDNQNLEDKVRSL 308
Cdd:COG4372  82 LEELNEQ---LQAAQAELAQAQEELESLQEEAEELQEELEELQKERQ---DLEQQRKQLEAQIAELQSEIAEREEELKEL 155

                ....*.
gi 16758006 309 KTDIEE 314
Cdd:COG4372 156 EEQLES 161
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
155-340 3.86e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 39.25  E-value: 3.86e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  155 ASPADTHLKSENDKLKIALTQSAANVKkwEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEE 234
Cdd:PRK02224 171 ASDARLGVERVLSDQRGSLDQLKAQIE--EKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEE 248
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  235 QCGEINREKEKNTQLKRRIEELESEVREKEMELKDLRKQ----SEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKT 310
Cdd:PRK02224 249 RREELETLEAEIEDLRETIAETEREREELAEEVRDLRERleelEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRD 328
                        170       180       190
                 ....*....|....*....|....*....|
gi 16758006  311 DIEESKYRQRHLKGELKSFLEvldgKIDDL 340
Cdd:PRK02224 329 RLEECRVAAQAHNEEAESLRE----DADDL 354
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
235-333 4.02e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 38.98  E-value: 4.02e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 235 QCGEINREKEKNTQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMSEceyVSEKLEAAERDNQNLEDKVRSLKTDIEE 314
Cdd:COG4942  18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAA---LARRIRALEQELAALEAELAELEKEIAE 94
                        90
                ....*....|....*....
gi 16758006 315 SKYRQRHLKGELKSFLEVL 333
Cdd:COG4942  95 LRAELEAQKEELAELLRAL 113
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
249-333 4.49e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 38.84  E-value: 4.49e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 249 LKRRIEELESEVREKEMELKDLRKQSEIIPqlmseceyVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRHLKGELKS 328
Cdd:COG3206 180 LEEQLPELRKELEEAEAALEEFRQKNGLVD--------LSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGS 251

                ....*
gi 16758006 329 FLEVL 333
Cdd:COG3206 252 GPDAL 256
UPF0242 pfam06785
Uncharacterized protein family (UPF0242) N-terminus; This region includes an N-terminal ...
219-312 4.58e-03

Uncharacterized protein family (UPF0242) N-terminus; This region includes an N-terminal transmembrane region and a C-terminal coiled-coil.


Pssm-ID: 429117 [Multi-domain]  Cd Length: 194  Bit Score: 37.88  E-value: 4.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   219 RDENDRLRSKIEELEEqcgEINREKEKNTQLKRRIEELESEVREKEMELKD-LRKQSEIIPQLMSECEYVSEKLEAAERD 297
Cdd:pfam06785  96 QSEEERLEEELSQKEE---ELRRLTEENQQLQIQLQQISQDFAEFRLESEEqLAEKQLLINEYQQTIEEQRSVLEKRQDQ 172
                          90
                  ....*....|....*
gi 16758006   298 NQNLEDKVRSLKTDI 312
Cdd:pfam06785 173 IENLESKVRDLNYEI 187
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
162-299 4.66e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 38.73  E-value: 4.66e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 162 LKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEINR 241
Cdd:COG4372  50 LREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQ 129
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*...
gi 16758006 242 EKEkntQLKRRIEELESEVREKEMELKDLRKQSEIIPQLMSECEYVSEKLEAAERDNQ 299
Cdd:COG4372 130 QRK---QLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQA 184
PRK01156 PRK01156
chromosome segregation protein; Provisional
223-353 5.13e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 38.73  E-value: 5.13e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  223 DRLRSKIEELEEQCGEINREKEKNTQLKRRIEELESEVREKE-------MELKDLRKQSEIIPQLMSECEYVSEKLEAAE 295
Cdd:PRK01156 190 EKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMddynnlkSALNELSSLEDMKNRYESEIKTAESDLSMEL 269
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 16758006  296 RDNQNLEDKVRSLKTDIEESKYRQRH-------LKGEL---KSFLEVLDGKIDDLHDFRRGLSKLGTD 353
Cdd:PRK01156 270 EKNNYYKELEERHMKIINDPVYKNRNyindyfkYKNDIenkKQILSNIDAEINKYHAIIKKLSVLQKD 337
PLN02939 PLN02939
transferase, transferring glycosyl groups
159-316 5.20e-03

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 38.73  E-value: 5.20e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  159 DTHLKSENDKLKIAlTQSAANVKKWEIELQTLRESnarLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGE 238
Cdd:PLN02939 176 EMRLSETDARIKLA-AQEKIHVEILEEQLEKLRNE---LLIRGATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAELIE 251
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  239 INREKEKNTQLKRRIEELESEVREKEMEL----KDLRKQSEI-IPQLMSECEYVSEKLEAAER----------DNQNLED 303
Cdd:PLN02939 252 VAETEERVFKLEKERSLLDASLRELESKFivaqEDVSKLSPLqYDCWWEKVENLQDLLDRATNqvekaalvldQNQDLRD 331
                        170
                 ....*....|...
gi 16758006  304 KVRSLKTDIEESK 316
Cdd:PLN02939 332 KVDKLEASLKEAN 344
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
192-350 5.64e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 38.88  E-value: 5.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    192 ESNARLTTALQESAASVEQWKRQFSIcrdenDRLRSKIEELEEQCGEINREKEKNTQLKRR-------IEELESEVREKE 264
Cdd:TIGR02168  206 ERQAEKAERYKELKAELRELELALLV-----LRLEELREELEELQEELKEAEEELEELTAElqeleekLEELRLEVSELE 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006    265 MELKDLRKQSEIIPQLMSECE-----------YVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRHLKGELKSFLEVL 333
Cdd:TIGR02168  281 EEIEELQKELYALANEISRLEqqkqilrerlaNLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAEL 360
                          170
                   ....*....|....*..
gi 16758006    334 DGKIDDLHDFRRGLSKL 350
Cdd:TIGR02168  361 EELEAELEELESRLEEL 377
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
162-327 5.81e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 38.51  E-value: 5.81e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  162 LKSENDKLKIALTQSAANVKKWEIELQTLRESNARLTTALQESAASVEQWKRQFSICRDENDRLRSKIEELEEQCGEinr 241
Cdd:PRK03918 194 LIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEE--- 270
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006  242 ekekntqLKRRIEELESEVREKEmELKDLRKQSEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRH 321
Cdd:PRK03918 271 -------LKKEIEELEEKVKELK-ELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEE 342

                 ....*.
gi 16758006  322 LKGELK 327
Cdd:PRK03918 343 LKKKLK 348
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
184-334 5.82e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 38.57  E-value: 5.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006   184 EIELQTLRESNARLTTALQEsaasvEQWKRQFSIcrdenDRLRSKIEELEEQCGEINREKEKntqlKRRIEELESEVREK 263
Cdd:pfam17380 434 QREVRRLEEERAREMERVRL-----EEQERQQQV-----ERLRQQEEERKRKKLELEKEKRD----RKRAEEQRRKILEK 499
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 16758006   264 EMELKDLRKQSEIIPQLMSECEYVSEKLEAAERDNQNLEDKVRSLKTDIEESKYRQRHLK--GELKSFLEVLD 334
Cdd:pfam17380 500 ELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRkaTEERSRLEAME 572
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
186-276 6.58e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 38.21  E-value: 6.58e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16758006 186 ELQTLRESNARLTTALQESAASVEQWKRQFSIcrDENDRLRSKIEELEEQCGEINREKEKNTQLKRRIEELESEVREKEM 265
Cdd:COG4717 157 ELRELEEELEELEAELAELQEELEELLEQLSL--ATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLEN 234
                        90
                ....*....|.
gi 16758006 266 ELKDLRKQSEI 276
Cdd:COG4717 235 ELEAAALEERL 245
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH