NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1804072757|ref|NP_001365135|]
View 

microtubule cross-linking factor 1 isoform 2 [Homo sapiens]

Protein Classification

kinesin family protein( domain architecture ID 13530637)

kinesin family protein is a microtubule-dependent molecular motor that plays an important role in intracellular transport and in cell division and has ATPase-containing motor domain; similar to carboxy-terminal kinesins that contains a C-terminal domain responsible for the motor activity (it hydrolyzes ATP and binds microtubules)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
1269-1392 2.15e-52

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


:

Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 180.65  E-value: 2.15e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757 1269 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 1336
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757 1337 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1392
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
501-595 3.16e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


:

Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 138.58  E-value: 3.16e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  501 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 580
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1804072757  581 LGRKIVELEVENRGL 595
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
630-722 7.29e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


:

Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 137.43  E-value: 7.29e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  630 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 707
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1804072757  708 LSGKVLKLQHENHAL 722
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
PHA03247 super family cl33720
large tegument protein UL36; Provisional
12-326 6.24e-15

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 81.52  E-value: 6.24e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   12 APDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFL----KDLHARPAAPGPAVP---------SSGRAPAPAAPRSP 78
Cdd:PHA03247  2563 APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRApvddRGDPRGPAPPSPLPPdthapdpppPSPSPAANEPDPHP 2642
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   79 NLAGKAPPSPGSLAAPGRLSR----RSGGVPGAKDKPPpgagaraaggakaALGSRRAARvAPAEPLSRAGKPPGAEPPS 154
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRprraRRLGRAAQASSPP-------------QRPRRRAAR-PTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  155 AAAKgRKAKRGSRAPP----ARTVGPPTPAARIPAVTLAVTSVAGSPARCSRiSHTDSSSDLSDCPSEPLSDEQRLLPAA 230
Cdd:PHA03247  2709 EPAP-HALVSATPLPPgpaaARQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  231 SSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVS------GGFAGPGvaEDVRG 304
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplGGSVAPG--GDVRR 2864
                          330       340
                   ....*....|....*....|....
gi 1804072757  305 RSPPERPV--PGTPKEPSLGEQSR 326
Cdd:PHA03247  2865 RPPSRSPAakPAAPARPPVRRLAR 2888
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-656 2.67e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 78.95  E-value: 2.67e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:TIGR02168  689 LEEKIAELEKALAELRKE-LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:TIGR02168  768 ERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  510 QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDsplptgeaggppsTREAELKLRLKLVEEEANILGRKIVELE 589
Cdd:TIGR02168  848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLE-------------EALALLRSELEELSEELRELESKRSELR 914
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1804072757  590 VENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 656
Cdd:TIGR02168  915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
 
Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
1269-1392 2.15e-52

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 180.65  E-value: 2.15e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757 1269 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 1336
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757 1337 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1392
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
501-595 3.16e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 138.58  E-value: 3.16e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  501 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 580
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1804072757  581 LGRKIVELEVENRGL 595
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
630-722 7.29e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 137.43  E-value: 7.29e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  630 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 707
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1804072757  708 LSGKVLKLQHENHAL 722
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
PHA03247 PHA03247
large tegument protein UL36; Provisional
12-326 6.24e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 81.52  E-value: 6.24e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   12 APDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFL----KDLHARPAAPGPAVP---------SSGRAPAPAAPRSP 78
Cdd:PHA03247  2563 APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRApvddRGDPRGPAPPSPLPPdthapdpppPSPSPAANEPDPHP 2642
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   79 NLAGKAPPSPGSLAAPGRLSR----RSGGVPGAKDKPPpgagaraaggakaALGSRRAARvAPAEPLSRAGKPPGAEPPS 154
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRprraRRLGRAAQASSPP-------------QRPRRRAAR-PTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  155 AAAKgRKAKRGSRAPP----ARTVGPPTPAARIPAVTLAVTSVAGSPARCSRiSHTDSSSDLSDCPSEPLSDEQRLLPAA 230
Cdd:PHA03247  2709 EPAP-HALVSATPLPPgpaaARQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  231 SSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVS------GGFAGPGvaEDVRG 304
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplGGSVAPG--GDVRR 2864
                          330       340
                   ....*....|....*....|....
gi 1804072757  305 RSPPERPV--PGTPKEPSLGEQSR 326
Cdd:PHA03247  2865 RPPSRSPAakPAAPARPPVRRLAR 2888
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-656 2.67e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 78.95  E-value: 2.67e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:TIGR02168  689 LEEKIAELEKALAELRKE-LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:TIGR02168  768 ERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  510 QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDsplptgeaggppsTREAELKLRLKLVEEEANILGRKIVELE 589
Cdd:TIGR02168  848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLE-------------EALALLRSELEELSEELRELESKRSELR 914
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1804072757  590 VENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 656
Cdd:TIGR02168  915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
345-668 1.77e-09

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 61.84  E-value: 1.77e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  345 SENDYLKDELDELRAEMEEMRDsyleedvyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQD 424
Cdd:COG4372     45 EELEQLREELEQAREELEQLEE--------ELEQARSELEQLEEELEELNEQLQAAQAELAQAQEE-------LESLQEE 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  425 LKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDS-- 502
Cdd:COG4372    110 AEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDEll 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  503 ADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPST---REAELKLRLKLVEEEAN 579
Cdd:COG4372    190 KEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEellEEVILKEIEELELAILV 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  580 ILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHN 659
Cdd:COG4372    270 EKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVG 349

                   ....*....
gi 1804072757  660 RQLTHELSK 668
Cdd:COG4372    350 LLDNDVLEL 358
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
350-542 2.70e-08

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 58.96  E-value: 2.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDsyleedvyqlqELRRELDRANKNCRILQYRLRKAEqKSLKVAETG------QVD--GELIRSL 421
Cdd:pfam05483  546 LRDELESVREEFIQKGD-----------EVKCKLDKSEENARSIEYEVLKKE-KQMKILENKcnnlkkQIEnkNKNIEEL 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQ---QMI-----EVEISKQALQNELERLKESSLKRRSTREMYKE 493
Cdd:pfam05483  614 HQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQkfeEIIdnyqkEIEDKKISEEKLLEEVEKAKAIADEAVKLQKE 693
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1804072757  494 kktfnqddsADLRCQLQFAkEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:pfam05483  694 ---------IDKRCQHKIA-EMVALMEKHKHQYDKIIEERDSELGLYKN 732
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
350-670 7.62e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 57.77  E-value: 7.62e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEmrdsyLEEDVYQLQELRRELDRANKNCRILQYRLR---------KAEQKSL--KVAETGQVDG--- 415
Cdd:PRK03918   219 LREELEKLEKEVKE-----LEELKEEIEELEKELESLEGSKRKLEEKIReleerieelKKEIEELeeKVKELKELKEkae 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  416 ---ELIRSLEQDLKVAKDVSVRL---HHELKTVEEKRAKAEDENETLRqqmiEVEISKQALQNELERLKESSLK----RR 485
Cdd:PRK03918   294 eyiKLSEFYEEYLDELREIEKRLsrlEEEINGIEERIKELEEKEERLE----ELKKKLKELEKRLEELEERHELyeeaKA 369
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  486 STREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREA 565
Cdd:PRK03918   370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEH 449
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  566 ELKLRLKLVEEEANILGRKIvELEVENRGLKAEMEDMRGQQEREgpgRDHAPSIPTSPFGDSLESSTEL--RRHLQFVEE 643
Cdd:PRK03918   450 RKELLEEYTAELKRIEKELK-EIEEKERKLRKELRELEKVLKKE---SELIKLKELAEQLKELEEKLKKynLEELEKKAE 525
                          330       340
                   ....*....|....*....|....*..
gi 1804072757  644 EAELLRRSISEIEDHNRQLTHELSKFK 670
Cdd:PRK03918   526 EYEKLKEKLIKLKGEIKSLKKELEKLE 552
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
13-319 2.08e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 46.21  E-value: 2.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   13 PDAKLQPPGQHHRHHHLHPVAERrrLHRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLA 92
Cdd:COG5180    158 SDPILAKDPDGDSASTLPPPAEK--LDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLTGGADHPRPEAASSPKVD 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   93 APGRLSRRSGgvPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAA-AKGRKAKRGSRAPPA 171
Cdd:COG5180    236 PPSTSEARSR--PATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAEtARPIDVKGVASAPPA 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  172 RTVGPPTPAAR---IPAVTLAVTSVAGSPARCSRISHTDSSSDlsdcPSEPLSDEQRLLPAASSDAESGTGSSDREPPRG 248
Cdd:COG5180    314 TRPVRPPGGARdpgTPRPGQPTERPAGVPEAASDAGQPPSAYP----PAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNG 389
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  249 APTPSPaARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAedvrgRSPPERPVPGTPKEP 319
Cdd:COG5180    390 APQPGL-GRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAGGAGQGPKAD-----FVPGDAESVSGPAGL 454
BAR cd07307
The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization module that binds membranes and detects ...
376-587 9.38e-04

The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization module that binds membranes and detects membrane curvature; BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. Mutations in BAR containing proteins have been linked to diseases and their inactivation in cells leads to altered membrane dynamics. A BAR domain with an additional N-terminal amphipathic helix (an N-BAR) can drive membrane curvature. These N-BAR domains are found in amphiphysins and endophilins, among others. BAR domains are also frequently found alongside domains that determine lipid specificity, such as the Pleckstrin Homology (PH) and Phox Homology (PX) domains which are present in beta centaurins (ACAPs and ASAPs) and sorting nexins, respectively. A FES-CIP4 Homology (FCH) domain together with a coiled coil region is called the F-BAR domain and is present in Pombe/Cdc15 homology (PCH) family proteins, which include Fes/Fes tyrosine kinases, PACSIN or syndapin, CIP4-like proteins, and srGAPs, among others. The Inverse (I)-BAR or IRSp53/MIM homology Domain (IMD) is found in multi-domain proteins, such as IRSp53 and MIM, that act as scaffolding proteins and transducers of a variety of signaling pathways that link membrane dynamics and the underlying actin cytoskeleton. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The I-BAR domain induces membrane protrusions in the opposite direction compared to classical BAR and F-BAR domains, which produce membrane invaginations. BAR domains that also serve as protein interaction domains include those of arfaptin and OPHN1-like proteins, among others, which bind to Rac and Rho GAP domains, respectively.


Pssm-ID: 153271 [Multi-domain]  Cd Length: 194  Bit Score: 42.43  E-value: 9.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  376 LQELRRELDRANKNCRILQYRLRKAEQKSLKVAEtgqvdgelirSLEqdlKVAKDVSVRLHHELKTVEEKRAKAEDENET 455
Cdd:cd07307      2 LDELEKLLKKLIKDTKKLLDSLKELPAAAEKLSE----------ALQ---ELGKELPDLSNTDLGEALEKFGKIQKELEE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  456 LRQQMIE------VEISKQALQNELERLKESSLKRRSTREMYkekktfnqdDSAdlrcqlqFAKEEAflMRKKMAKLGRE 529
Cdd:cd07307     69 FRDQLEQklenkvIEPLKEYLKKDLKEIKKRRKKLDKARLDY---------DAA-------REKLKK--LRKKKKDSSKL 130
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757  530 KdELEQELQKYKSLYGDVdsplptgeaggppstrEAELKLRLKLVEEEANILGRKIVE 587
Cdd:cd07307    131 A-EAEEELQEAKEKYEEL----------------REELIEDLNKLEEKRKELFLSLLL 171
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
434-601 3.05e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.45  E-value: 3.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  434 RLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEK--KTFNQDDSADLRCQLQF 511
Cdd:COG1579     21 RLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgNVRNNKEYEALQKEIES 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  512 AKEEAFLMRKKMAKLGREKDELEQELQKYKSLYgdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVE 591
Cdd:COG1579    101 LKRRISDLEDEILELMERIEELEEELAELEAEL----------------AELEAELEEKKAELDEELAELEAELEELEAE 164
                          170
                   ....*....|
gi 1804072757  592 NRGLKAEMED 601
Cdd:COG1579    165 REELAAKIPP 174
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
56-329 4.98e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.87  E-value: 4.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   56 PAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRS---------GGVP---GAKDKPPPGAGARAAGGA 123
Cdd:pfam17823  115 LAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAaiaaasaphAASPaprTAASSTTAASSTTAASSA 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  124 KAALGSRRAARVAPAEPLSRAGKPPGAepPSAAAKgrKAKRGSRAPPART----VGPPTPA--ARIPAVTLAVTSVAGSP 197
Cdd:pfam17823  195 PTTAASSAPATLTPARGISTAATATGH--PAAGTA--LAAVGNSSPAAGTvtaaVGTVTPAalATLAAAAGTVASAAGTI 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  198 ARCSRISHTDSSSdlSDCPSEPLSDEqrllPAASSDAES-GTGS--SDREP---PRGAPTPSPAARGAPPGSPEPPALLA 271
Cdd:pfam17823  271 NMGDPHARRLSPA--KHMPSDTMARN----PAAPMGAQAqGPIIqvSTDQPvhnTAGEPTPSPSNTTLEPNTPKSVASTN 344
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757  272 APLAAGACPGGRSiPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSLGEQSRLVP 329
Cdd:pfam17823  345 LAVVTTTKAQAKE-PSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAP 401
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
355-561 5.80e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.54  E-value: 5.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  355 DELRAEMEEMRDSYLEEdvyQLQELRRELDRA---------NKNCRILQ---YRLRKAEQKSlKVAETGQVDGELIRSLE 422
Cdd:NF033838    53 NESQKEHAKEVESHLEK---ILSEIQKSLDKRkhtqnvalnKKLSDIKTeylYELNVLKEKS-EAELTSKTKKELDAAFE 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  423 QDLKVAKDVSVRLHHELKTVEEKRAKAEDENE-------TLRQQMIEVEISKQALQnelerLKESSLkrrstrEMYKEKK 495
Cdd:NF033838   129 QFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEedrrnypTNTYKTLELEIAESDVE-----VKKAEL------ELVKEEA 197
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1804072757  496 TFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPS 561
Cdd:NF033838   198 KEPRDEEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNVATSEQDKPKR 263
 
Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
1269-1392 2.15e-52

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 180.65  E-value: 2.15e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757 1269 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 1336
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757 1337 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1392
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
501-595 3.16e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 138.58  E-value: 3.16e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  501 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 580
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1804072757  581 LGRKIVELEVENRGL 595
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
630-722 7.29e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 137.43  E-value: 7.29e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  630 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 707
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1804072757  708 LSGKVLKLQHENHAL 722
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
PHA03247 PHA03247
large tegument protein UL36; Provisional
12-326 6.24e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 81.52  E-value: 6.24e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   12 APDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFL----KDLHARPAAPGPAVP---------SSGRAPAPAAPRSP 78
Cdd:PHA03247  2563 APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRApvddRGDPRGPAPPSPLPPdthapdpppPSPSPAANEPDPHP 2642
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   79 NLAGKAPPSPGSLAAPGRLSR----RSGGVPGAKDKPPpgagaraaggakaALGSRRAARvAPAEPLSRAGKPPGAEPPS 154
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRprraRRLGRAAQASSPP-------------QRPRRRAAR-PTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  155 AAAKgRKAKRGSRAPP----ARTVGPPTPAARIPAVTLAVTSVAGSPARCSRiSHTDSSSDLSDCPSEPLSDEQRLLPAA 230
Cdd:PHA03247  2709 EPAP-HALVSATPLPPgpaaARQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  231 SSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVS------GGFAGPGvaEDVRG 304
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplGGSVAPG--GDVRR 2864
                          330       340
                   ....*....|....*....|....
gi 1804072757  305 RSPPERPV--PGTPKEPSLGEQSR 326
Cdd:PHA03247  2865 RPPSRSPAakPAAPARPPVRRLAR 2888
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-656 2.67e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 78.95  E-value: 2.67e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:TIGR02168  689 LEEKIAELEKALAELRKE-LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:TIGR02168  768 ERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  510 QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDsplptgeaggppsTREAELKLRLKLVEEEANILGRKIVELE 589
Cdd:TIGR02168  848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLE-------------EALALLRSELEELSEELRELESKRSELR 914
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1804072757  590 VENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 656
Cdd:TIGR02168  915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
7-316 1.12e-10

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 67.12  E-value: 1.12e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757    7 PAGGGAPDAklqPPGQHHRHHHLHPVAERRRLHRAPSPA-RPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAP 85
Cdd:PHA03307   122 PPASPPPSP---APDLSEMLRPVGSPGPPPAASPPAAGAsPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   86 PSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRraARVAPAEPLSRAGKP---PGAEPPSAAAKGRKA 162
Cdd:PHA03307   199 PAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSE--SSGCGWGPENECPLPrpaPITLPTRIWEASGWN 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  163 KRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDSSSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSD 242
Cdd:PHA03307   277 GPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP 356
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1804072757  243 REPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGP-----GVAEDVRGRSPPERPVPGTP 316
Cdd:PHA03307   357 PPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGrprpsPLDAGAASGAFYARYPLLTP 435
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-327 6.01e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 6.01e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   10 GGAPDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFLKDLHARPAAPGPAVPSsgrapapaaprspnLAGKAPPSPg 89
Cdd:PHA03247  2639 DPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS--------------LTSLADPPP- 2703
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   90 slaapgrlsrrsggvPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRK---AKRGS 166
Cdd:PHA03247  2704 ---------------PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPpttAGPPA 2768
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  167 RAPPARTVGPPTPAARIPAVTLAVTSVAGSPArcsrishtdsssdlsdcPSEPLSDEQRLLPAASSDAESGTGSSDREPP 246
Cdd:PHA03247  2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPS-----------------PWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  247 rgaPTPSPAARGAPPGSPEppallaaplaagacpggrsiPSGVSGGFAGPGvaEDVRgRSPPERPVPGTPKEPSLGEQSR 326
Cdd:PHA03247  2832 ---TSAQPTAPPPPPGPPP--------------------PSLPLGGSVAPG--GDVR-RRPPSRSPAAKPAAPARPPVRR 2885

                   .
gi 1804072757  327 L 327
Cdd:PHA03247  2886 L 2886
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
351-708 1.24e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 63.55  E-value: 1.24e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  351 KDELDELRAEMEEMrDSYLEEDVYQLQELRRELDRANKNcRILQYRLRKAEQKSL----------KVAETGQVDgELIRS 420
Cdd:TIGR02169  176 LEELEEVEENIERL-DLIIDEKRQQLERLRREREKAERY-QALLKEKREYEGYELlkekealerqKEAIERQLA-SLEEE 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  421 LEQDLKVAKDVSVRLHH---ELKTVEEK-RAKAEDENETLRQQM--IEVEISK-----QALQNELERLKESSLKRRSTRE 489
Cdd:TIGR02169  253 LEKLTEEISELEKRLEEieqLLEELNKKiKDLGEEEQLRVKEKIgeLEAEIASlersiAEKERELEDAEERLAKLEAEID 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  490 MYKEKKTfnqddsaDLRCQLQfakEEaflmRKKMAKLGREKDELEQELQKYKSLYGDVDsplptgeaggppsTREAELKL 569
Cdd:TIGR02169  333 KLLAEIE-------ELEREIE---EE----RKRRDKLTEEYAELKEELEDLRAELEEVD-------------KEFAETRD 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  570 RLKLVEEEANILGRKIVELEVENRGLKAEMedmrgqQEREGPGRDHAPSIPtspfgDSLESSTELrrhlqfvEEEAELLR 649
Cdd:TIGR02169  386 ELKDYREKLEKLKREINELKRELDRLQEEL------QRLSEELADLNAAIA-----GIEAKINEL-------EEEKEDKA 447
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1804072757  650 RSISEIEDHNRQLTHELSKFKFEpprepgwlgegASPGAGGGAPLQEELKSARLQISEL 708
Cdd:TIGR02169  448 LEIKKQEWKLEQLAADLSKYEQE-----------LYDLKEEYDRVEKELSKLQRELAEA 495
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
345-668 1.77e-09

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 61.84  E-value: 1.77e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  345 SENDYLKDELDELRAEMEEMRDsyleedvyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQD 424
Cdd:COG4372     45 EELEQLREELEQAREELEQLEE--------ELEQARSELEQLEEELEELNEQLQAAQAELAQAQEE-------LESLQEE 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  425 LKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDS-- 502
Cdd:COG4372    110 AEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDEll 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  503 ADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPST---REAELKLRLKLVEEEAN 579
Cdd:COG4372    190 KEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEellEEVILKEIEELELAILV 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  580 ILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHN 659
Cdd:COG4372    270 EKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVG 349

                   ....*....
gi 1804072757  660 RQLTHELSK 668
Cdd:COG4372    350 LLDNDVLEL 358
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
39-331 2.90e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 62.50  E-value: 2.90e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   39 HRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLS--RRSGGVPGAKDKPPPGAG 116
Cdd:PHA03307    77 TEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEmlRPVGSPGPPPAASPPAAG 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  117 ARAAGGAKAALGSRRAARVAPA-EPLSRAGKPPGAEP----PSAAAKGRKAKRGS----RAPPARTVGPPTPAARIPAvt 187
Cdd:PHA03307   157 ASPAAVASDAASSRQAALPLSSpEETARAPSSPPAEPppstPPAAASPRPPRRSSpisaSASSPAPAPGRSAADDAGA-- 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  188 lAVTSVAGSPARCSRISHTDSSSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPP 267
Cdd:PHA03307   235 -SSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPR 313
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1804072757  268 ALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSLGEQSRLVPAA 331
Cdd:PHA03307   314 ASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377
PHA03247 PHA03247
large tegument protein UL36; Provisional
8-330 5.25e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.88  E-value: 5.25e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757    8 AGGGAPDAKLQPPGQHHRHHHLHPVAERRRlHRAPSPARPflkdlharPAAPGPAVPSSGRapapaaprspnlAGKAPPS 87
Cdd:PHA03247  2728 ARQASPALPAAPAPPAVPAGPATPGGPARP-ARPPTTAGP--------PAPAPPAAPAAGP------------PRRLTRP 2786
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   88 PGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPP--SAAAKGRKAKRG 165
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRP 2866
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  166 -SRAPPARTVGPPTPAAR---IPAVTLAVTSVAGSPARCSRISHTDSssdlsdcPSEPLSDEQRLLPAASSDAEsgtgss 241
Cdd:PHA03247  2867 pSRSPAAKPAAPARPPVRrlaRPAVSRSTESFALPPDQPERPPQPQA-------PPPPQPQPQPPPPPQPQPPP------ 2933
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  242 drePPRGAPTPSPAARGAPPGSPEppallaaplAAGACPGGRSipsgvsgGFAGPGVAEDVRGRSPPERPVPGTPKEPSL 321
Cdd:PHA03247  2934 ---PPPPRPQPPLAPTTDPAGAGE---------PSGAVPQPWL-------GALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                   ....*....
gi 1804072757  322 GEQSRLVPA 330
Cdd:PHA03247  2995 PLTGHSLSR 3003
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
352-775 7.50e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 61.11  E-value: 7.50e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  352 DELDELRAEMEEmrdsyleedvyQLQELRRELDRANKNcRILQYRLRKAEQKSLKVAetgqvdgelIRSLEQDLKVAKDV 431
Cdd:COG1196    189 ERLEDILGELER-----------QLEPLERQAEKAERY-RELKEELKELEAELLLLK---------LRELEAELEELEAE 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  432 SVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQF 511
Cdd:COG1196    248 LEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  512 AKEEAFLMRKKMAKLGREKDELEQELQkykslygdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVE 591
Cdd:COG1196    328 LEEELEELEEELEELEEELEEAEEELE----------------------EAEAELAEAEEALLEAEAELAEAEEELEELA 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  592 NRGLKAEMEDMRGQQEREGPGRDHApsiptspfgdslESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKfkf 671
Cdd:COG1196    386 EELLEALRAAAELAAQLEELEEAEE------------ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEE--- 450
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  672 epprepgwlgegaspgaggGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRApsprdsdAESDA 751
Cdd:COG1196    451 -------------------EAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA-------EADYE 504
                          410       420
                   ....*....|....*....|....
gi 1804072757  752 GKKESDGEESRLPQPKREGPVGGE 775
Cdd:COG1196    505 GFLEGVKAALLLAGLRGLAGAVAV 528
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
350-725 2.19e-08

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 59.26  E-value: 2.19e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRD------SYLEEDVYQLQELRRELDRANKNCRILQYRLR--KAEQKSLKvaetGQVDGELIRSL 421
Cdd:TIGR04523  237 KQQEINEKTTEISNTQTqlnqlkDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNqlKSEISDLN----NQKEQDWNKEL 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERlKESSLKR-RSTREMYKEKKTFNQD 500
Cdd:TIGR04523  313 KSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEE-KQNEIEKlKKENQSYKQEIKNLES 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  501 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEaggppsTREAELKLRLKLVEEEANI 580
Cdd:TIGR04523  392 QINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLT------NQDSVKELIIKNLDNTRES 465
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  581 LGRKIVELEVENRGLKAEMEDMrgQQEREGPGRDHapSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNR 660
Cdd:TIGR04523  466 LETQLKVLSRSINKIKQNLEQK--QKELKSKEKEL--KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKIS 541
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1804072757  661 QLTHELSKFKFepprepgwlgegaspgagggaplqeELKSARL--QISELSGKVLKLQHENHALLSN 725
Cdd:TIGR04523  542 DLEDELNKDDF-------------------------ELKKENLekEIDEKNKEIEELKQTQKSLKKK 583
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
370-1076 2.42e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 59.30  E-value: 2.42e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  370 EEDVYQLQELRRELDRANKNCRI-----LQYRLRKAEQKSLKVAETGQVDGELIRSLEQdLKVAKDVSVRLHHELktvEE 444
Cdd:TIGR02168  185 RENLDRLEDILNELERQLKSLERqaekaERYKELKAELRELELALLVLRLEELREELEE-LQEELKEAEEELEEL---TA 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  445 KRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRRstREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKK 522
Cdd:TIGR02168  261 ELQELEEKLEELRLEVSELEEEIEELQKELYALANeiSRLEQQ--KQILRERLANLERQLEELEAQLEELESKLDELAEE 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  523 MAKLGREKDELEQELQkykslygDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDM 602
Cdd:TIGR02168  339 LAELEEKLEELKEELE-------SLEAELEELEA------ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERL 405
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  603 RGQQER--------EGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPP 674
Cdd:TIGR02168  406 EARLERledrrerlQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELA 485
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  675 REPGWLgEGASPGAGGGAPLQEELKSARLQISELSGKVlklqhenhALLSNIQRCD------LAAHLGLRAPSPRDSDAE 748
Cdd:TIGR02168  486 QLQARL-DSLERLQENLEGFSEGVKALLKNQSGLSGIL--------GVLSELISVDegyeaaIEAALGGRLQAVVVENLN 556
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  749 S-----DAGKKESDGEESRLPqPKREGPVGGESDSEEMFEKTSGF-GSGKPSEASEPCPTELLKAREDSEYLVTLKHEAQ 822
Cdd:TIGR02168  557 AakkaiAFLKQNELGRVTFLP-LDSIKGTEIQGNDREILKNIEGFlGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNAL 635
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  823 RL---ERTVERLITDTDSFLHDAGLRGGAPLPGPGLQGEEEQ------GEGDQQEPQLLgTINAKMKAFKKELQAF---L 890
Cdd:TIGR02168  636 ELakkLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERRReieeleEKIEELEEKIA-ELEKALAELRKELEELeeeL 714
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  891 EQVNRIGDGLSPLPHLTESSSFLSTVTSVSRDSPIGNLGKELGpDLQSRLKEQLEwQLGPARG------DERESL----- 959
Cdd:TIGR02168  715 EQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELT-ELEAEIEELEE-RLEEAEEelaeaeAEIEELeaqie 792
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  960 -----------RLRAARELHRRADGDTGSHGLGGQtcfSLELRgppvLPEQSVSIEELQGQL------VQAARLHQEETE 1022
Cdd:TIGR02168  793 qlkeelkalreALDELRAELTLLNEEAANLRERLE---SLERR----IAATERRLEDLEEQIeelsedIESLAAEIEELE 865
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1804072757 1023 TFTNKIHKMEEEHLYALRWKELEMHSL--ALQNTLHE-RTWSDEKNLMQQELRSLKQ 1076
Cdd:TIGR02168  866 ELIEELESELEALLNERASLEEALALLrsELEELSEElRELESKRSELRRELEELRE 922
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
350-542 2.70e-08

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 58.96  E-value: 2.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDsyleedvyqlqELRRELDRANKNCRILQYRLRKAEqKSLKVAETG------QVD--GELIRSL 421
Cdd:pfam05483  546 LRDELESVREEFIQKGD-----------EVKCKLDKSEENARSIEYEVLKKE-KQMKILENKcnnlkkQIEnkNKNIEEL 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQ---QMI-----EVEISKQALQNELERLKESSLKRRSTREMYKE 493
Cdd:pfam05483  614 HQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQkfeEIIdnyqkEIEDKKISEEKLLEEVEKAKAIADEAVKLQKE 693
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1804072757  494 kktfnqddsADLRCQLQFAkEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:pfam05483  694 ---------IDKRCQHKIA-EMVALMEKHKHQYDKIIEERDSELGLYKN 732
PHA03247 PHA03247
large tegument protein UL36; Provisional
85-331 3.78e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.18  E-value: 3.78e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   85 PPSPGSLAAPGRLSRRSggVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPlsrAGKPPGAEPPSAAAKGRKAKR 164
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRS--VPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDD---RGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  165 ---GSRAPPARTVGPPTPAARIPAvtlAVTSVAGSPARCSRISHTDSSSDlsdcPSEPLSDEQRLLPAASSDAESGTGSS 241
Cdd:PHA03247  2626 pppPSPSPAANEPDPHPPPTVPPP---ERPRDDPAPGRVSRPRRARRLGR----AAQASSPPQRPRRRAARPTVGSLTSL 2698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  242 DREPPRGaPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSL 321
Cdd:PHA03247  2699 ADPPPPP-PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
                          250
                   ....*....|
gi 1804072757  322 GEQSRLVPAA 331
Cdd:PHA03247  2778 GPPRRLTRPA 2787
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
443-769 5.57e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 58.16  E-value: 5.57e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  443 EEKRAKAEDENETLRQQMIEVEISKQALQNELERLKesslKRRSTREMYKEKKTFNQDDSADLRC-QLQFAKEEAFLMRK 521
Cdd:TIGR02169  169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLR----REREKAERYQALLKEKREYEGYELLkEKEALERQKEAIER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  522 KMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI----------------LGRKI 585
Cdd:TIGR02169  245 QLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIaslersiaekereledAEERL 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  586 VELEVENRGLKAEMEDMRGQQEREGPGRDHApsipTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHE 665
Cdd:TIGR02169  325 AKLEAEIDKLLAEIEELEREIEEERKRRDKL----TEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  666 LSKFKFEPPRepgwLGEGASPGAGGGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQRcdLAAHLGlrapsprDS 745
Cdd:TIGR02169  401 INELKRELDR----LQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ--LAADLS-------KY 467
                          330       340
                   ....*....|....*....|....
gi 1804072757  746 DAESDAGKKESDGEESRLPQPKRE 769
Cdd:TIGR02169  468 EQELYDLKEEYDRVEKELSKLQRE 491
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
7-264 6.62e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 57.94  E-value: 6.62e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757    7 PAGGGAPDAKLQPP---GQHHRHHHLHPVAERRRLHRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGK 83
Cdd:PRK07003   362 VTGGGAPGGGVPARvagAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDD 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   84 APPSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAK 163
Cdd:PRK07003   442 AADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDA 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  164 RGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCS--RIShtdsssdlsdcpseplSDEQRLLPAASSDAESGTGSS 241
Cdd:PRK07003   522 PAAAAPPAPEARPPTPAAAAPAARAGGAAAALDVLRNAgmRVS----------------SDRGARAAAAAKPAAAPAAAP 585
                          250       260
                   ....*....|....*....|....
gi 1804072757  242 DREPPRGA-PTPSPAARGAPPGSP 264
Cdd:PRK07003   586 KPAAPRVAvQVPTPRARAATGDAP 609
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
350-670 7.62e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 57.77  E-value: 7.62e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEmrdsyLEEDVYQLQELRRELDRANKNCRILQYRLR---------KAEQKSL--KVAETGQVDG--- 415
Cdd:PRK03918   219 LREELEKLEKEVKE-----LEELKEEIEELEKELESLEGSKRKLEEKIReleerieelKKEIEELeeKVKELKELKEkae 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  416 ---ELIRSLEQDLKVAKDVSVRL---HHELKTVEEKRAKAEDENETLRqqmiEVEISKQALQNELERLKESSLK----RR 485
Cdd:PRK03918   294 eyiKLSEFYEEYLDELREIEKRLsrlEEEINGIEERIKELEEKEERLE----ELKKKLKELEKRLEELEERHELyeeaKA 369
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  486 STREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREA 565
Cdd:PRK03918   370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEH 449
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  566 ELKLRLKLVEEEANILGRKIvELEVENRGLKAEMEDMRGQQEREgpgRDHAPSIPTSPFGDSLESSTEL--RRHLQFVEE 643
Cdd:PRK03918   450 RKELLEEYTAELKRIEKELK-EIEEKERKLRKELRELEKVLKKE---SELIKLKELAEQLKELEEKLKKynLEELEKKAE 525
                          330       340
                   ....*....|....*....|....*..
gi 1804072757  644 EAELLRRSISEIEDHNRQLTHELSKFK 670
Cdd:PRK03918   526 EYEKLKEKLIKLKGEIKSLKKELEKLE 552
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
350-609 9.81e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 57.25  E-value: 9.81e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQDLKVAK 429
Cdd:COG1196    265 LEAELEELRLELEELELE-LEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEE-------LAELEEELEELE 336
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:COG1196    337 EELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERL 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  510 QFAKEEAFLMRKKMAKLGREKDELEQELQkykslygdvdsplptgeaggppstreaELKLRLKLVEEEANILGRKIVELE 589
Cdd:COG1196    417 ERLEEELEELEEALAELEEEEEEEEEALE---------------------------EAAEEEAELEEEEEALLELLAELL 469
                          250       260
                   ....*....|....*....|
gi 1804072757  590 VENRGLKAEMEDMRGQQERE 609
Cdd:COG1196    470 EEAALLEAALAELLEELAEA 489
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
133-383 1.17e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.92  E-value: 1.17e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  133 ARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGSR-APPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDSSSD 211
Cdd:PRK07764   586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAApAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  212 LSDCPSEPLSdeqrllPAASSDAESGTGSSDRePPRGAPTPSPAARGAPPGSPEPPallaaplaagacPGGRSIPSGVSG 291
Cdd:PRK07764   666 GDGWPAKAGG------AAPAAPPPAPAPAAPA-APAGAAPAQPAPAPAATPPAGQA------------DDPAAQPPQAAQ 726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  292 GFAGPGVAEDVRGRSPPERPVPGTPKEPSLGEQSRLVPAAEEEELLREMEELRSEndylkdELDELRAEMEEMRDSYLEE 371
Cdd:PRK07764   727 GASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE------EEEMAEDDAPSMDDEDRRD 800
                          250
                   ....*....|...
gi 1804072757  372 -DVYQLQELRREL 383
Cdd:PRK07764   801 aEEVAMELLEEEL 813
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
81-331 1.30e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 56.78  E-value: 1.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   81 AGKAPPS-PGSLAAPGrlsRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKG 159
Cdd:PRK07003   369 GGGVPARvAGAVPAPG---ARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADG 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  160 RKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARcsrishtdsssdlSDCPSEPLSdeqrllPAASSDAESGTG 239
Cdd:PRK07003   446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAP-------------PDAAFEPAP------RAAAPSAATPAA 506
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  240 SSDREPPRGAPTPSPAARGAPPgSPEPPALLAAPLAAGACPGGRSIP---------------SGVSGGFAGPGVAEDVRG 304
Cdd:PRK07003   507 VPDARAPAAASREDAPAAAAPP-APEARPPTPAAAAPAARAGGAAAAldvlrnagmrvssdrGARAAAAAKPAAAPAAAP 585
                          250       260       270
                   ....*....|....*....|....*....|
gi 1804072757  305 RSPPER---PVPgTPKEPSLGEQSRLVPAA 331
Cdd:PRK07003   586 KPAAPRvavQVP-TPRARAATGDAPPNGAA 614
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
345-661 1.67e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 56.61  E-value: 1.67e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  345 SENDYLKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKV-AETGQVDGElIRSLEQ 423
Cdd:TIGR02169  695 SELRRIENRLDELSQELSDASRK-IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVkSELKELEAR-IEELEE 772
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  424 DLKVAK----DVSVRL-HHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFN 498
Cdd:TIGR02169  773 DLHKLEealnDLEARLsHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSI 852
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  499 QDDSADLRCQL-----QFAKEEAFL--MRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEaggppsTREAELKLRL 571
Cdd:TIGR02169  853 EKEIENLNGKKeeleeELEELEAALrdLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKR------KRLSELKAKL 926
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  572 KLVEEEANILGRKIVELEVEN------RGLKAEMEDMRGQQEREGPGRDHAPsiptSPFGDSLESSTELRRHLQFVEEEA 645
Cdd:TIGR02169  927 EALEEELSEIEDPKGEDEEIPeeelslEDVQAELQRVEEEIRALEPVNMLAI----QEYEEVLKRLDELKEKRAKLEEER 1002
                          330
                   ....*....|....*.
gi 1804072757  646 ELLRRSISEIEDHNRQ 661
Cdd:TIGR02169 1003 KAILERIEEYEKKKRE 1018
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
39-317 1.79e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 56.72  E-value: 1.79e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   39 HRAPSPARPFLKDLHARPAA--PGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAP---GRLSRRSGGVPGAKDKP-P 112
Cdd:PHA03307   118 PPTPPPASPPPSPAPDLSEMlrPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPlssPEETARAPSSPPAEPPPsT 197
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  113 PGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGSRA-----PPARTVGPPTPAARIPAVT 187
Cdd:PHA03307   198 PPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENecplpRPAPITLPTRIWEASGWNG 277
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  188 LAVTSVAGSPARCSRISHTDSSSDLSDCPSEP----LSDEQRLLPAASSDAESGTGSSDREP--PRGA-----PTPSPAA 256
Cdd:PHA03307   278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPAPssprASSSSSSSRESSSSSTSSSSESSRGAavSPGPspsrsPSPSRPP 357
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  257 RGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPK 317
Cdd:PHA03307   358 PPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLD 418
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
60-265 4.24e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 55.38  E-value: 4.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   60 GPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAE 139
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  140 PLSRAGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARipavtlAVTSVAGSPARCSRISHTDSSSDLSDCPSEP 219
Cdd:PRK07764   670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG------QADDPAAQPPQAAQGASAPSPAADDPVPLPP 743
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1804072757  220 LSDEQRLLPAASSDAESGTGSSDREPPrgAPTPSPAARGAPPGSPE 265
Cdd:PRK07764   744 EPDDPPDPAGAPAQPPPPPAPAPAAAP--AAAPPPSPPSEEEEMAE 787
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
350-662 5.29e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 54.77  E-value: 5.29e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSYLE--------EDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdGELIRSL 421
Cdd:COG4717     93 LQEELEELEEELEELEAELEElreeleklEKLLQLLPLYQELEALEAELAELPERLEELEERLEELREL----EEELEEL 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  422 EQDLKVAK--------DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKE 493
Cdd:COG4717    169 EAELAELQeeleelleQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEA 248
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  494 K---------------------KTFNQDDSADLRCQLQFAkeEAFLMRKKMAKLGREKDEL----------EQELQKYKS 542
Cdd:COG4717    249 RlllliaaallallglggsllsLILTIAGVLFLVLGLLAL--LFLLLAREKASLGKEAEELqalpaleeleEEELEELLA 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  543 LYGdVDSPLPTGEAGGPPST-----------REAELKLRLKLVEEEANILG-----------RKIVELEVENRGLKAEME 600
Cdd:COG4717    327 ALG-LPPDLSPEELLELLDRieelqellreaEELEEELQLEELEQEIAALLaeagvedeeelRAALEQAEEYQELKEELE 405
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1804072757  601 DMRGQQEREGPGRDHAPSIPTspfGDSLES-STELRRHLQFVEEEAELLRRSISEIEDHNRQL 662
Cdd:COG4717    406 ELEEQLEELLGELEELLEALD---EEELEEeLEELEEELEELEEELEELREELAELEAELEQL 465
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
6-264 8.84e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 54.11  E-value: 8.84e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757    6 GPAGGGAPDAklqppgqhhrHHHLHPVAERRRLHRAPSPARPflkdlharPAAPGPAVPSSGRAPAPAAPRSPNLAGKAP 85
Cdd:PRK12323   366 GQSGGGAGPA----------TAAAAPVAQPAPAAAAPAAAAP--------APAAPPAAPAAAPAAAAAARAVAAAPARRS 427
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   86 PSPGSLAAP-GRLSRRSGGVPGAKDKPPPGAgaraaggakaalgsrrAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKR 164
Cdd:PRK12323   428 PAPEALAAArQASARGPGGAPAPAPAPAAAP----------------AAAARPAAAGPRPVAAAAAAAPARAAPAAAPAP 491
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  165 GSRAPPARTVGPPTPAARIPAvtlavtsvagsparcsrishtDSSSDLSDCPSEPLSDEqrllpaaSSDAESGTGSSDRE 244
Cdd:PRK12323   492 ADDDPPPWEELPPEFASPAPA---------------------QPDAAPAGWVAESIPDP-------ATADPDDAFETLAP 543
                          250       260
                   ....*....|....*....|
gi 1804072757  245 PPRGAPTPSPAARGAPPGSP 264
Cdd:PRK12323   544 APAAAPAPRAAAATEPVVAP 563
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
352-542 1.72e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 53.38  E-value: 1.72e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  352 DELDELRAEMEEMRDSY--LEedvyQLQELRRELDRANKNCRILQYRLRKAEqkslkvAETGQVDGELIRSLEQDLKVAK 429
Cdd:COG4913    235 DDLERAHEALEDAREQIelLE----PIRELAERYAAARERLAELEYLRAALR------LWFAQRRLELLEAELEELRAEL 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 DvsvRLHHELKTVEEKRAKAEDENETLRQQMIEVE-ISKQALQNELERLKESSLKRRSTREMYKE------------KKT 496
Cdd:COG4913    305 A---RLEAELERLEARLDALREELDELEAQIRGNGgDRLEQLEREIERLERELEERERRRARLEAllaalglplpasAEE 381
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1804072757  497 FN------QDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:COG4913    382 FAalraeaAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
362-670 1.79e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 53.14  E-value: 1.79e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  362 EEMRDSYLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVD--GELIRSLEQDLKVakdvsvrlhHEL 439
Cdd:PRK03918   447 EEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKelAEQLKELEEKLKK---------YNL 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  440 KTVEEKrakaEDENETLRQQMIEVEISKQALQNELERLKEssLKRRStREMYKEKKTfnqddsadlrcqlqfAKEEAFLM 519
Cdd:PRK03918   518 EELEKK----AEEYEKLKEKLIKLKGEIKSLKKELEKLEE--LKKKL-AELEKKLDE---------------LEEELAEL 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  520 RKKMAKLGREK-DELEQELQKYKSLYGDVDsplptgEAGGPPSTREAELKlRLKLVEEEANILGRKIVELEVENRGLKAE 598
Cdd:PRK03918   576 LKELEELGFESvEELEERLKELEPFYNEYL------ELKDAEKELEREEK-ELKKLEEELDKAFEELAETEKRLEELRKE 648
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1804072757  599 MEDMRGQQEREgpgrdhapsiptsPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFK 670
Cdd:PRK03918   649 LEELEKKYSEE-------------EYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
54-313 1.85e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 53.31  E-value: 1.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   54 ARPAAPGPAVPSS-----GRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRSGGVPGAKD-------KPPPGAGARAAG 121
Cdd:PRK07003   361 AVTGGGAPGGGVParvagAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATraeappaAPAPPATADRGD 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  122 GAKAALGSRRAARVAPAEPLSRAGkPPGAEPPSAAAKgrKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCS 201
Cdd:PRK07003   441 DAADGDAPVPAKANARASADSRCD-ERDAQPPADSGS--ASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAAS 517
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  202 RishtdssSDLSDCPSEPLSDEQRLLPAASSDAESGTG---------------SSDREPPRGAPTPSPAARGAPPGSPEP 266
Cdd:PRK07003   518 R-------EDAPAAAAPPAPEARPPTPAAAAPAARAGGaaaaldvlrnagmrvSSDRGARAAAAAKPAAAPAAAPKPAAP 590
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1804072757  267 PALLAAPLaagacPGGRSIPSGVSGGFAGPGV-AEDVRGRSPPERPVP 313
Cdd:PRK07003   591 RVAVQVPT-----PRARAATGDAPPNGAARAEqAAESRGAPPPWEDIP 633
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-614 1.93e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.14  E-value: 1.93e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRD--SYLEEdvyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKV 427
Cdd:TIGR02168  279 LEEEIEELQKELYALANeiSRLEQ---QKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELES 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  428 AKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRRSTRemykekktfNQDDSADL 505
Cdd:TIGR02168  356 LEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEArlERLEDRRER---------LQQEIEEL 426
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  506 RCQLQfakeeaflmRKKMAKLGREKDELEQELQKYKslygdvdsplptgeaggppsTREAELKLRLKLVEEEANILGRKI 585
Cdd:TIGR02168  427 LKKLE---------EAELKELQAELEELEEELEELQ--------------------EELERLEEALEELREELEEAEQAL 477
                          250       260       270
                   ....*....|....*....|....*....|
gi 1804072757  586 VELEVENRGLKAEMEDMRGQQER-EGPGRD 614
Cdd:TIGR02168  478 DAAERELAQLQARLDSLERLQENlEGFSEG 507
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
346-484 2.01e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 53.15  E-value: 2.01e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  346 ENDYLKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDL 425
Cdd:TIGR02169  351 RRDKLTEEYAELKEELEDLRAELEEVDK-EFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAI 429
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  426 KVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKR 484
Cdd:TIGR02169  430 AGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKelSKLQR 490
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
353-542 2.10e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.14  E-value: 2.10e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  353 ELDELRAEMEEMRDsylEEDVYQ--LQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQ-------VDGELIRSLEQ 423
Cdd:TIGR02168  268 KLEELRLEVSELEE---EIEELQkeLYALANEISRLEQQKQILRERLANLERQLEELEAQLEeleskldELAEELAELEE 344
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  424 DLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEV-----EISKQALQN---------ELERLKESSLKRRSTRE 489
Cdd:TIGR02168  345 KLEELKEELESLEAELEELEAELEELESRLEELEEQLETLrskvaQLELQIASLnneierleaRLERLEDRRERLQQEIE 424
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757  490 MYKEKKTFN-----QDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:TIGR02168  425 ELLKKLEEAelkelQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAER 482
PTZ00121 PTZ00121
MAEBL; Provisional
358-662 2.85e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 52.84  E-value: 2.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  358 RAEMEEMRDSYLEEDVYQLQELRREldranKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHH 437
Cdd:PTZ00121  1591 EARIEEVMKLYEEEKKMKAEEAKKA-----EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAE 1665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  438 ELKTVEEKRAKAEdenETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLqfAKEEAF 517
Cdd:PTZ00121  1666 EAKKAEEDKKKAE---EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE--AKKEAE 1740
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  518 LMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRG--- 594
Cdd:PTZ00121  1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGnlv 1820
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1804072757  595 --LKAEMEDMRGQQ--EREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFvEEEAELLRRSISEIE--DHNRQL 662
Cdd:PTZ00121  1821 inDSKEMEDSAIKEvaDSKNMQLEEADAFEKHKFNKNNENGEDGNKEADF-NKEKDLKEDDEEEIEeaDEIEKI 1893
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
354-559 3.75e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 52.22  E-value: 3.75e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  354 LDELRAEMEEmrdsyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGElIRSLEQDLKVAKDVSV 433
Cdd:COG4913    612 LAALEAELAE-----LEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAERE-IAELEAELERLDASSD 685
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  434 rlhhELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAK 513
Cdd:COG4913    686 ----DLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGD 761
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1804072757  514 EEAFLMRKKM--------AKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGP 559
Cdd:COG4913    762 AVERELRENLeeridalrARLNRAEEELERAMRAFNREWPAETADLDADLESLP 815
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
350-485 4.95e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 51.84  E-value: 4.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSYLEEDVYQLQELRRELDRANKncrilqyRLRKAEQKSLKVAetgqvdgELIRSLEQDLKVAK 429
Cdd:COG4913    314 LEARLDALREELDELEAQIRGNGGDRLEQLEREIERLER-------ELEERERRRARLE-------ALLAALGLPLPASA 379
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757  430 DVSVRLHHELKtveEKRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRR 485
Cdd:COG4913    380 EEFAALRAEAA---ALLEALEEELEALEEALAEAEAALRDLRRELRELEAeiASLERR 434
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
350-608 6.19e-06

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 51.28  E-value: 6.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRdsyleedvYQLQELRRELDRANKNCRILQYRLRKAEQkslkvaetgqvdgeLIRSLEQDlkvak 429
Cdd:pfam05557  109 LKNELSELRRQIQRAE--------LELQSTNSELEELQERLDLLKAKASEAEQ--------------LRQNLEKQ----- 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 dvsvrlHHELKTVEEKRAKAEDENETLRQQMIEVEISKQAL------QNELERLKESSLKRRSTRE---MYKEKKtfnqd 500
Cdd:pfam05557  162 ------QSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSELaripelEKELERLREHNKHLNENIEnklLLKEEV----- 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  501 dsADLRCQLQfaKEEAflMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPP----STREAELKLRLKLVEE 576
Cdd:pfam05557  231 --EDLKRKLE--REEK--YREEAATLELEKEKLEQELQSWVKLAQDTGLNLRSPEDLSRRieqlQQREIVLKEENSSLTS 304
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1804072757  577 EANILGRKIVELEVENRGLKAEMEDMRGQQER 608
Cdd:pfam05557  305 SARQLEKARRELEQELAQYLKKIEDLNKKLKR 336
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
350-589 6.23e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 51.48  E-value: 6.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEmeemrdsyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:COG1196    321 LEEELAELEEE--------LEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEAL 392
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKtfnqddsADLRCQL 509
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEE-------EALLELL 465
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  510 QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDvdsplptgEAGGPPSTREAELKLRLKLVEEEANILGRKIVELE 589
Cdd:COG1196    466 AELLEEAALLEAALAELLEELAEAAARLLLLLEAEAD--------YEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYE 537
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
350-831 1.46e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 50.32  E-value: 1.46e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:COG1196    314 LEERLEELEEELAELEEE-LEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEAL 392
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA 472
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  510 QFAKEEAflmRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELE 589
Cdd:COG1196    473 ALLEAAL---AELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQ 549
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  590 VENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSisEIEDHNRQLTHELSKF 669
Cdd:COG1196    550 NIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREA--DARYYVLGDTLLGRTL 627
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  670 KFEPPREPGWLGEGASPGAGGGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSDAES 749
Cdd:COG1196    628 VAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEER 707
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  750 DAGKKESDGEESRLPQPKREGPVGGESDSEEMFEktsgfgsgkpSEASEPCPTELLKAREDSEYLVTLKHEAQRLERTVE 829
Cdd:COG1196    708 ELAEAEEERLEEELEEEALEEQLEAEREELLEEL----------LEEEELLEEEALEELPEPPDLEELERELERLEREIE 777

                   ..
gi 1804072757  830 RL 831
Cdd:COG1196    778 AL 779
PTZ00121 PTZ00121
MAEBL; Provisional
351-609 1.71e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 50.14  E-value: 1.71e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  351 KDELDELRAEMEEMRDSylEEDVYQLQELRRELDRANKNCRILQY--RLRKAEQK----SLKVAETGQVDGELIRSLE-- 422
Cdd:PTZ00121  1469 AKKADEAKKKAEEAKKA--DEAKKKAEEAKKKADEAKKAAEAKKKadEAKKAEEAkkadEAKKAEEAKKADEAKKAEEkk 1546
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  423 --QDLKVAKDVsvRLHHELKTVEEKRAKAEDENETLR--------------QQMIEVEISKQALQNELERLKESSLKRRS 486
Cdd:PTZ00121  1547 kaDELKKAEEL--KKAEEKKKAEEAKKAEEDKNMALRkaeeakkaeearieEVMKLYEEEKKMKAEEAKKAEEAKIKAEE 1624
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  487 TREMYKEKKTFNQ---DDSADLRCQLQFAKEEaflmRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAggppSTR 563
Cdd:PTZ00121  1625 LKKAEEEKKKVEQlkkKEAEEKKKAEELKKAE----EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA----LKK 1696
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1804072757  564 EAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQERE 609
Cdd:PTZ00121  1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
41-326 2.10e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.78  E-value: 2.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   41 APSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPgsLAAPGRLSRRSGGVPGAKDKPPPGAGARAA 120
Cdd:PHA03307    48 AELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTL--APASPAREGSPTPPGPSSPDPPPPTPPPAS 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  121 GGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTLAV-TSVAGSPAR 199
Cdd:PHA03307   126 PPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPsTPPAAASPR 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  200 CSRISHTDSSSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEppallaaplaaGAC 279
Cdd:PHA03307   206 PPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWE-----------ASG 274
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1804072757  280 PGGRSIPSGVSGGFAGPgvaedvRGRSPPerPVPGTPKEPSLGEQSR 326
Cdd:PHA03307   275 WNGPSSRPGPASSSSSP------RERSPS--PSPSSPGSGPAPSSPR 313
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-264 2.64e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 2.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757    6 GPAGGGAPDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFLKDLHARPAAPGP----AVPSSGRAPAPAAPRSPNLA 81
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPpptsAQPTAPPPPPGPPPPSLPLG 2853
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   82 GKAPP------------SPGSLAAPGR----------LSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAE 139
Cdd:PHA03247  2854 GSVAPggdvrrrppsrsPAAKPAAPARppvrrlarpaVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPP 2933
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  140 PLSRAGKPPGAEPPSAAAKGRKAKR-----------GSRAPPARTVGPPTPAARIPAVTlaVTSVAGSPArcSRISHTDS 208
Cdd:PHA03247  2934 PPPPRPQPPLAPTTDPAGAGEPSGAvpqpwlgalvpGRVAVPRFRVPQPAPSREAPASS--TPPLTGHSL--SRVSSWAS 3009
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  209 SSDLS-DCPSEPLSDEQRLLPAASSDAESGTGSSDREPPR---GAPTPSPAARGAPPGSP 264
Cdd:PHA03247  3010 SLALHeETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERsdlEALDPLPPEPHDPFAHE 3069
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
346-535 3.39e-05

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 48.97  E-value: 3.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  346 ENDYLKDELDELRAE--MEEMRDSYLEEDVYQLQELRRELDRANKN--CRILQYRLRKAEQKSLKVAETgQVDGELIRSL 421
Cdd:pfam17380  390 KNERVRQELEAARKVkiLEEERQRKIQQQKVEMEQIRAEQEEARQRevRRLEEERAREMERVRLEEQER-QQQVERLRQQ 468
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  422 EQDLKVAKDVSVRLHHELKTVEEKRAKA-EDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQ- 499
Cdd:pfam17380  469 EEERKRKKLELEKEKRDRKRAEEQRRKIlEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEm 548
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1804072757  500 DDSADLRCQLQFAKEEaflmRKKMAKLGREKDELEQ 535
Cdd:pfam17380  549 EERRRIQEQMRKATEE----RSRLEAMEREREMMRQ 580
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
338-668 5.28e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 48.43  E-value: 5.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  338 REMEELRSENDYLKDELDELRAEMEEMRDSYLEEDVYQLQELRRELDRankncRILQYRLRKAEQKSLKVAETGQVDG-E 416
Cdd:pfam02463  177 KLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEE-----YLLYLDYLKLNEERIDLLQELLRDEqE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  417 LIRSLEQDLKVAKDVSVRLHHEL-------KTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTRE 489
Cdd:pfam02463  252 EIESSKQEIEKEEEKLAQVLKENkeeekekKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELK 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  490 MYKEKKtfnqddsadlrcqLQFAKEEAFLMRKKmAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKL 569
Cdd:pfam02463  332 KEKEEI-------------EELEKELKELEIKR-EAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEEL 397
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  570 RLKLVEEeaNILGRKIVELEVENRGLKAEMEDMRGQQEREGpgrdhapsiptsPFGDSLESSTELRRHLQFVEEEAELLR 649
Cdd:pfam02463  398 ELKSEEE--KEAQLLLELARQLEDLLKEEKKEELEILEEEE------------ESIELKQGKLTEEKEELEKQELKLLKD 463
                          330
                   ....*....|....*....
gi 1804072757  650 RSISEIEDHNRQLTHELSK 668
Cdd:pfam02463  464 ELELKKSEDLLKETQLVKL 482
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
86-218 5.59e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 48.14  E-value: 5.59e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   86 PSPGSLAAPGRLSRRSGGVPGAKDKPPPGagaraaggakaalgSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRG 165
Cdd:PRK14959   373 PSGGGASAPSGSAAEGPASGGAATIPTPG--------------TQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDD 438
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1804072757  166 SRAPPARTVGPPTPAARIPavtlAVTSVAGSPARCSRISHTDSSSDLSDCPSE 218
Cdd:PRK14959   439 APPAPPRSGIPPRPAPRMP----EASPVPGAPDSVASASDAPPTLGDPSDTAE 487
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
55-261 6.45e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.95  E-value: 6.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   55 RPAAPGPAVPSSGRAPAPAAprspnlAGKAPPSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAAR 134
Cdd:PRK12323   359 RMLAFRPGQSGGGAGPATAA------AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEA 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  135 VAPAEPLSRAGKPPGAEPPSAAAkgrkakrgsrAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDSS--SDL 212
Cdd:PRK12323   433 LAAARQASARGPGGAPAPAPAPA----------AAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPpwEEL 502
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1804072757  213 SDCPSEPLSDEQRLLPA-ASSDAESGTGSSDREPPRGAPTPSPAARGAPP 261
Cdd:PRK12323   503 PPEFASPAPAQPDAAPAgWVAESIPDPATADPDDAFETLAPAPAAAPAPR 552
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
352-591 7.25e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 47.45  E-value: 7.25e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  352 DELDELRAEMEEMRDsyleedvyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQDLKVAKDv 431
Cdd:COG4942     20 DAAAEAEAELEQLQQ--------EIAELEKELAALKKEEKALLKQLAALERRIAALARR-------IRALEQELAALEA- 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  432 svrlhhELKTVEEKRAKAEDENETLRQQMIEV--EISKQALQNELERL--KESSLKRRSTREMYKEKKTFNQDDSADLRC 507
Cdd:COG4942     84 ------ELAELEKEIAELRAELEAQKEELAELlrALYRLGRQPPLALLlsPEDFLDAVRRLQYLKYLAPARREQAEELRA 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  508 QLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVE 587
Cdd:COG4942    158 DLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEK------ELAELAAELAELQQEAEELEALIAR 231

                   ....
gi 1804072757  588 LEVE 591
Cdd:COG4942    232 LEAE 235
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
375-539 8.28e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 47.82  E-value: 8.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  375 QLQELRRELDRANKNcriLQYRLRKAEQKSLKVAETGQVD----GELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAE 450
Cdd:pfam07111  482 ELEQLREERNRLDAE---LQLSAHLIQQEVGRAREQGEAErqqlSEVAQQLEQELQRAQESLASVGQQLEVARQGQQEST 558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  451 DENETLRQQMI-EVEISKQALQN---ELE-RLKE--SSLKRR---STREMYK---------EKKTFNQDDSADL-RCQLQ 510
Cdd:pfam07111  559 EEAASLRQELTqQQEIYGQALQEkvaEVEtRLREqlSDTKRRlneARREQAKavvslrqiqHRATQEKERNQELrRLQDE 638
                          170       180
                   ....*....|....*....|....*....
gi 1804072757  511 FAKEEAFLMRKKMAKLGREKDELEQELQK 539
Cdd:pfam07111  639 ARKEEGQRLARRVQELERDKNLMLATLQQ 667
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
350-551 8.51e-05

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 46.18  E-value: 8.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSYLEEDVY-------------QLQELRRELDRANKNCRILQYRLRKAEQKS------LKVAE- 409
Cdd:pfam00261    6 IKEELDEAEERLKEAMKKLEEAEKRaekaeaevaalnrRIQLLEEELERTEERLAEALEKLEEAEKAAdesergRKVLEn 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  410 TGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTRE 489
Cdd:pfam00261   86 RALKDEEKMEILEAQLKEAKEIAEEADRKYEEVARKLVVVEGDLERAEERAELAESKIVELEEELKVVGNNLKSLEASEE 165
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1804072757  490 MYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPL 551
Cdd:pfam00261  166 KASEREDKYEEQIRFLTEKLKEAETRAEFAERSVQKLEKEVDRLEDELEAEKEKYKAISEEL 227
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
346-895 1.28e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 47.35  E-value: 1.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  346 ENDYLKDE----LDELRAEMEEMRD-SYLEEDVYQLQELRRelDRANKNCRILQYRLRKAEQKSLKVAE---TGQVDgEL 417
Cdd:TIGR00606  502 EVKSLQNEkadlDRKLRKLDQEMEQlNHHTTTRTQMEMLTK--DKMDKDEQIRKIKSRHSDELTSLLGYfpnKKQLE-DW 578
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  418 IRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVE------ISKQALQNELERLKESSLKRRSTREM- 490
Cdd:TIGR00606  579 LHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEdklfdvCGSQDEESDLERLKEEIEKSSKQRAMl 658
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  491 ----------------------------YKEKKTFNQDdSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:TIGR00606  659 agatavysqfitqltdenqsccpvcqrvFQTEAELQEF-ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQS 737
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  543 LYGDVDSPLP-TGEAGGPPSTREAELKLRLklvEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPT 621
Cdd:TIGR00606  738 IIDLKEKEIPeLRNKLQKVNRDIQRLKNDI---EEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAK 814
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  622 SPFGDSLESSTELRrhlQFVEEEAELLRRSISEIE-------DHNRQLTHELSKFKfepprepgwlgegaspgagggapl 694
Cdd:TIGR00606  815 LQGSDLDRTVQQVN---QEKQEKQHELDTVVSKIElnrkliqDQQEQIQHLKSKTN------------------------ 867
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  695 qeELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSDAESDAGKKE----SDGEESRLPQPKREg 770
Cdd:TIGR00606  868 --ELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEelisSKETSNKKAQDKVN- 944
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  771 pvggesDSEEMFEKTSGFGSGKPSEASEPCPTELL-KAREDSEYLVTLKHEAQRLERTVERLITDTDSFlhdaglrggap 849
Cdd:TIGR00606  945 ------DIKEKVKNIHGYMKDIENKIQDGKDDYLKqKETELNTVNAQLEECEKHQEKINEDMRLMRQDI----------- 1007
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*.
gi 1804072757  850 lpgpglqGEEEQGEGDQQEPQLLGTINAKMKAFKKELQAFLEQVNR 895
Cdd:TIGR00606 1008 -------DTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQ 1046
PTZ00121 PTZ00121
MAEBL; Provisional
351-603 1.30e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 47.44  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  351 KDELDELRaEMEEMRDSyleEDVYQLQELRRELDRANKNCRILQyRLRKAEQK------------------SLKVAETGQ 412
Cdd:PTZ00121  1545 KKKADELK-KAEELKKA---EEKKKAEEAKKAEEDKNMALRKAE-EAKKAEEArieevmklyeeekkmkaeEAKKAEEAK 1619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  413 VDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRaKAEDENETLRQQMI--EVEISKQALQ----NELERLKESSLKRRS 486
Cdd:PTZ00121  1620 IKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK-KAEEENKIKAAEEAkkAEEDKKKAEEakkaEEDEKKAAEALKKEA 1698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  487 trEMYKEKKTFNQDDSADLRCQLQFAKEEaflmRKKMAKLGREKDELEQELQKYKSLYGDvdsplpTGEaggppSTREAE 566
Cdd:PTZ00121  1699 --EEAKKAEELKKKEAEEKKKAEELKKAE----EENKIKAEEAKKEAEEDKKKAEEAKKD------EEE-----KKKIAH 1761
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 1804072757  567 LKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMR 603
Cdd:PTZ00121  1762 LKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
392-1319 1.49e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.97  E-value: 1.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  392 ILQYRLRK--AEQKslkvaetgqvdgelIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENEtLRQQMIEVEIS--- 466
Cdd:TIGR02168  167 ISKYKERRkeTERK--------------LERTRENLDRLEDILNELERQLKSLERQAEKAERYKE-LKAELRELELAllv 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  467 --KQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLY 544
Cdd:TIGR02168  232 lrLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERL 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  545 GDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEregpgrdhapsiptspf 624
Cdd:TIGR02168  312 ANLERQLEELEA------QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELE----------------- 368
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  625 gdslesstELRRHLQFVEEEAELLRRSISEIEdhnRQLThelskfkfepprepgwlgegaspgagggaPLQEELKSARLQ 704
Cdd:TIGR02168  369 --------ELESRLEELEEQLETLRSKVAQLE---LQIA-----------------------------SLNNEIERLEAR 408
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  705 ISELSGKVLKLQHENHALLSNIQRCDLAAHlglrapspRDSDAESDAGKKESDGEESRLPQPKregpvggESDSEEMFEK 784
Cdd:TIGR02168  409 LERLEDRRERLQQEIEELLKKLEEAELKEL--------QAELEELEEELEELQEELERLEEAL-------EELREELEEA 473
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  785 TSGFGSGKPSEASepcptelLKAREDSeyLVTLKHEAQRLERTVERLitdtdsFLHDAGLRGGAPLPGPGLQGEEEQ--- 861
Cdd:TIGR02168  474 EQALDAAERELAQ-------LQARLDS--LERLQENLEGFSEGVKAL------LKNQSGLSGILGVLSELISVDEGYeaa 538
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  862 -----GEGDQQ-----EPQLLGTINAkMKAFKKELQAFLEQVNRIGDGL--SPLPHLTESSSFLSTVTSVSRDSPignlg 929
Cdd:TIGR02168  539 ieaalGGRLQAvvvenLNAAKKAIAF-LKQNELGRVTFLPLDSIKGTEIqgNDREILKNIEGFLGVAKDLVKFDP----- 612
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  930 kELGPDLQSRLkeqleWQLGPArgderESLR--LRAARELHRRA-----DGDT-GSHGL--GGQTCFSLelrgppVLPEQ 999
Cdd:TIGR02168  613 -KLRKALSYLL-----GGVLVV-----DDLDnaLELAKKLRPGYrivtlDGDLvRPGGVitGGSAKTNS------SILER 675
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757 1000 SVSIEELQgqlvqaarlhqEETETFTNKIHKMEEEhLYALRwKELEmhslALQNTLHERtwsdeknlmQQELRSLKQNIF 1079
Cdd:TIGR02168  676 RREIEELE-----------EKIEELEEKIAELEKA-LAELR-KELE----ELEEELEQL---------RKELEELSRQIS 729
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757 1080 LFYVKLRWLLKHWRQGKQMEEEGEEftegehpetlsRLGELGVQgghqadgpdhdsdrgcgfpvgEHSPHSRVQIGDHSL 1159
Cdd:TIGR02168  730 ALRKDLARLEAEVEQLEERIAQLSK-----------ELTELEAE---------------------IEELEERLEEAEEEL 777
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757 1160 RLQTADRGQPHKQVVENQQLFSAFKALLEDFRAELREDERARLRLQ---QQYASDKAAWDVEWAVLKCRLEQLEEKTENK 1236
Cdd:TIGR02168  778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRerlESLERRIAATERRLEDLEEQIEELSEDIESL 857
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757 1237 LGELGSSAESKGALKKEREVHQKLLADSHSLVMDLRwqihhseknwnrekvELLDRLDRDRQEWERQKKEFLWRIEQLQK 1316
Cdd:TIGR02168  858 AAEIEELEELIEELESELEALLNERASLEEALALLR---------------SELEELSEELRELESKRSELRRELEELRE 922

                   ...
gi 1804072757 1317 ENS 1319
Cdd:TIGR02168  923 KLA 925
COG5022 COG5022
Myosin heavy chain [General function prediction only];
359-670 1.60e-04

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 46.99  E-value: 1.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  359 AEMEEMRDSYLEEDVYQLQE------LRRELDRANKNCRILQ--------YRLRKAEQKSL-------------KVAETG 411
Cdd:COG5022    734 AALEDMRDAKLDNIATRIQRairgryLRRRYLQALKRIKKIQviqhgfrlRRLVDYELKWRlfiklqpllsllgSRKEYR 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  412 QVDgELIRSLEQDLKVAKDVSVRLHHELKTVEE-------------KRAKAEDENETLRQQMIEVEISK---QALQNELE 475
Cdd:COG5022    814 SYL-ACIIKLQKTIKREKKLRETEEVEFSLKAEvliqkfgrslkakKRFSLLKKETIYLQSAQRVELAErqlQELKIDVK 892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  476 RLKESSLKRRSTREMYKEkktFNQDDSADLRCQLQFaKEEAFLMRKKMAKLGREKDELEQELQKYKSLygdvdsplptge 555
Cdd:COG5022    893 SISSLKLVNLELESEIIE---LKKSLSSDLIENLEF-KTELIARLKKLLNNIDLEEGPSIEYVKLPEL------------ 956
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  556 aggppstreaelklrLKLVEEEANiLGRKIVELEVENRGLKAEMEDMRGQQERegpgrdhapsipTSPFGDSLESSTELR 635
Cdd:COG5022    957 ---------------NKLHEVESK-LKETSEEYEDLLKKSTILVREGNKANSE------------LKNFKKELAELSKQY 1008
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1804072757  636 RHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFK 670
Cdd:COG5022   1009 GALQESTKQLKELPVEVAELQSASKIISSESTELS 1043
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
431-662 1.62e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.98  E-value: 1.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  431 VSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQ 510
Cdd:TIGR02169  668 FSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLS 747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  511 FAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDV-----DSPLPTGEAGGPP----------STREAELKLRLKLVE 575
Cdd:TIGR02169  748 SLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLearlsHSRIPEIQAELSKleeevsrieaRLREIEQKLNRLTLE 827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  576 EEanILGRKIVELEVENRGLKaEMEDMRGQQEREGPGRdhapsiptspFGDSLESSTELRRHLQFVEEEAELLRRSISEI 655
Cdd:TIGR02169  828 KE--YLEKEIQELQEQRIDLK-EQIKSIEKEIENLNGK----------KEELEEELEELEAALRDLESRLGDLKKERDEL 894

                   ....*..
gi 1804072757  656 EDHNRQL 662
Cdd:TIGR02169  895 EAQLREL 901
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
374-551 1.62e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 45.30  E-value: 1.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  374 YQLQELRRELDRANKNCRILQYRLRKAEQKslkvaetgqvdgelIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDEN 453
Cdd:COG1579     10 LDLQELDSELDRLEHRLKELPAELAELEDE--------------LAALEARLEAAKTELEDLEKEIKRLELEIEEVEARI 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  454 ETLRQQMIEVEISK--QALQNELERLKesslKRRSTRE------MYK-EKKtfnQDDSADLRCQLQFAKEEaflMRKKMA 524
Cdd:COG1579     76 KKYEEQLGNVRNNKeyEALQKEIESLK----RRISDLEdeilelMERiEEL---EEELAELEAELAELEAE---LEEKKA 145
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1804072757  525 KLGREKDELEQELQKYKS----LYGDVDSPL 551
Cdd:COG1579    146 ELDEELAELEAELEELEAereeLAAKIPPEL 176
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
350-494 1.74e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 45.30  E-value: 1.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDG---ElIRSLEQDLK 426
Cdd:COG1579     29 LPAELAELEDELAALEARLEAAKT-ELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNKEYEAlqkE-IESLKRRIS 106
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757  427 VAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVeisKQALQNELERLKESSLKRRSTREMYKEK 494
Cdd:COG1579    107 DLEDEILELMERIEELEEELAELEAELAELEAELEEK---KAELDEELAELEAELEELEAEREELAAK 171
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
12-265 1.89e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.52  E-value: 1.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   12 APDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPflkdlhARPAAPGPAVPSSGRapapaAPRSPNLAGKAPPSPGSL 91
Cdd:PRK07764   589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAP------AAPAPAGAAAAPAEA-----SAAPAPGVAAPEHHPKHV 657
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   92 AAPGRLSRRSGGVPGAKDKPPPgagaraaggakaalgsrrAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGSRAPPA 171
Cdd:PRK07764   658 AVPDASDGGDGWPAKAGGAAPA------------------APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAA 719
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  172 RTVGPPTPAARIPAVTLAVTSVAGSParcsrishtdsssDLSDCPSEPLSDEQrllPAASSDAESGTGSSDREPPRGAPT 251
Cdd:PRK07764   720 QPPQAAQGASAPSPAADDPVPLPPEP-------------DDPPDPAGAPAQPP---PPPAPAPAAAPAAAPPPSPPSEEE 783
                          250
                   ....*....|....
gi 1804072757  252 PSPAARGAPPGSPE 265
Cdd:PRK07764   784 EMAEDDAPSMDDED 797
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
13-319 2.08e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 46.21  E-value: 2.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   13 PDAKLQPPGQHHRHHHLHPVAERrrLHRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLA 92
Cdd:COG5180    158 SDPILAKDPDGDSASTLPPPAEK--LDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLTGGADHPRPEAASSPKVD 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   93 APGRLSRRSGgvPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAA-AKGRKAKRGSRAPPA 171
Cdd:COG5180    236 PPSTSEARSR--PATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAEtARPIDVKGVASAPPA 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  172 RTVGPPTPAAR---IPAVTLAVTSVAGSPARCSRISHTDSSSDlsdcPSEPLSDEQRLLPAASSDAESGTGSSDREPPRG 248
Cdd:COG5180    314 TRPVRPPGGARdpgTPRPGQPTERPAGVPEAASDAGQPPSAYP----PAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNG 389
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  249 APTPSPaARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAedvrgRSPPERPVPGTPKEP 319
Cdd:COG5180    390 APQPGL-GRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAGGAGQGPKAD-----FVPGDAESVSGPAGL 454
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
131-264 2.22e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.13  E-value: 2.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  131 RAARVAPAEPLSRAGKPPGAEPPSAAAkgrkAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDSSS 210
Cdd:PRK07764   377 RLERLERRLGVAGGAGAPAAAAPSAAA----AAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1804072757  211 DLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSP 264
Cdd:PRK07764   453 APSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGA 506
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
144-264 2.44e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.86  E-value: 2.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  144 AGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTL-AVTSVAGSPArcSRISHTDSSSDLSDCPSEPLSD 222
Cdd:PRK14951   363 AFKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPaAAASAPAAPP--AAAPPAPVAAPAAAAPAAAPAA 440
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1804072757  223 EQRLLPAASSDAESGTGSSDREPPRGAPTPSPA-ARGAPPGSP 264
Cdd:PRK14951   441 APAAVALAPAPPAQAAPETVAIPVRVAPEPAVAsAAPAPAAAP 483
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-512 2.60e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.20  E-value: 2.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVA- 428
Cdd:TIGR02168  356 LEAELEELEAELEELESRLEELEE-QLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAe 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  429 -KDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRC 507
Cdd:TIGR02168  435 lKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKN 514

                   ....*
gi 1804072757  508 QLQFA 512
Cdd:TIGR02168  515 QSGLS 519
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
345-660 2.85e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 45.80  E-value: 2.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  345 SENDYLKDELDELRAEME---EMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSL 421
Cdd:PRK02224   213 SELAELDEEIERYEEQREqarETRDE-ADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEEL 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKktfnqdd 501
Cdd:PRK02224   292 EEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREE------- 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  502 SADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELqkykslygdvdsplptgeaGGPPSTREaELKLRLKLVEEEANIL 581
Cdd:PRK02224   365 AAELESELEEAREAVEDRREEIEELEEEIEELRERF-------------------GDAPVDLG-NAEDFLEELREERDEL 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  582 GRKIVELEVENRGLKAEMEDMRGQQErEGPGRDHAPSIPTSPFGDSLESSTELRRHLqfvEEEAELLRRSISEIED-HNR 660
Cdd:PRK02224   425 REREAELEATLRTARERVEEAEALLE-AGKCPECGQPVEGSPHVETIEEDRERVEEL---EAELEDLEEEVEEVEErLER 500
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
350-479 3.02e-04

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 45.62  E-value: 3.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSYL--EEDVYQLQELRRELDRANKNCRILQYRLRKAEQ-------------KSLKVAETGQVD 414
Cdd:pfam06160  303 AEEQNKELKEELERVQQSYTlnENELERVRGLEKQLEELEKRYDEIVERLEEKEVayselqeeleeilEQLEEIEEEQEE 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  415 G-ELIRSLEQDLKVAKDVSVRLHHELKTVE---EKR-------------AKAEDENETLRQQMIEV-----EISKQAL-- 470
Cdd:pfam06160  383 FkESLQSLRKDELEAREKLDEFKLELREIKrlvEKSnlpglpesyldyfFDVSDEIEDLADELNEVplnmdEVNRLLDea 462

                   ....*....
gi 1804072757  471 QNELERLKE 479
Cdd:pfam06160  463 QDDVDTLYE 471
DUF4201 pfam13870
Domain of unknown function (DUF4201); This is a family of coiled-coil proteins from eukaryotes. ...
376-539 3.28e-04

Domain of unknown function (DUF4201); This is a family of coiled-coil proteins from eukaryotes. The function is not known.


Pssm-ID: 464008 [Multi-domain]  Cd Length: 177  Bit Score: 43.36  E-value: 3.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  376 LQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQ----VD-----------GELIRSLEQDLKVAKDVSVRLHHELK 440
Cdd:pfam13870    1 MRAKRNELSKLRLELITLKHTLAKIQEKLEQKEELGEgltmIDflqlqienqalNEKIEERNKELKRLKLKVTNTVHALT 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  441 TVEEKRAKAEDENETLRQQMIEveisKQALQNELErlkesslkrrstREMYKEKKTFNQ--DDSADLRCQL--------- 509
Cdd:pfam13870   81 HLKEKLHFLSAELSRLKKELRE----RQELLAKLR------------KELYRVKLERDKlrKQNKKLRQQGgllhvpall 144
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1804072757  510 -QFAKEEAFL--MRKKMAKLGREKDELEQELQK 539
Cdd:pfam13870  145 hDYDKTKAEVeeKRKSVKKLRRKVKILEMRIKE 177
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
350-609 3.40e-04

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 44.90  E-value: 3.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEM---RDSY---LEEDVYQLQELRRELDRANKNCRILqyrlrKAEQKSLKvAETGQVDGELirsleQ 423
Cdd:COG1340     27 LKEKRDELNEELKELaekRDELnaqVKELREEAQELREKRDELNEKVKEL-----KEERDELN-EKLNELREEL-----D 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  424 DLKVAKDVSVRLHHELKTVEEKRAKAEDENET----LRQQMIEVEISKQaLQNELERLKESSLKRRSTREMYKEKKTFnQ 499
Cdd:COG1340     96 ELRKELAELNKAGGSIDKLRKEIERLEWRQQTevlsPEEEKELVEKIKE-LEKELEKAKKALEKNEKLKELRAELKEL-R 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  500 DDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYgdvdsplptgeaggppstreAELKLRLKLVEEEAN 579
Cdd:COG1340    174 KEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEI--------------------VEAQEKADELHEEII 233
                          250       260       270
                   ....*....|....*....|....*....|
gi 1804072757  580 ILGRKIVELEVENRGLKAEMEDMRGQQERE 609
Cdd:COG1340    234 ELQKELRELRKELKKLRKKQRALKREKEKE 263
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
102-323 3.53e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.75  E-value: 3.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  102 GGVPGAKDKPPPGAGARAAGGAKAalgSRRAARVAPAEPlSRAGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAA 181
Cdd:PRK07764   589 GPAPGAAGGEGPPAPASSGPPEEA---ARPAAPAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD 664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  182 RIPAVTLAVTSVAGSParcsrishtdsssdlsdcPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPP 261
Cdd:PRK07764   665 GGDGWPAKAGGAAPAA------------------PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQ 726
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1804072757  262 GSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSLGE 323
Cdd:PRK07764   727 GASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
349-544 3.73e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 45.39  E-value: 3.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  349 YLKDELDELRAEMEEMRDsYLEEdvyQLQELRRELDRANKncRILQYRlrkAEQKSLKVAETGQVDGELIRSLEQDLKVA 428
Cdd:COG3206    161 YLEQNLELRREEARKALE-FLEE---QLPELRKELEEAEA--ALEEFR---QKNGLVDLSEEAKLLLQQLSELESQLAEA 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  429 KDVSVRLHHELKTVEEKRAKAEDEN---------ETLRQQMIEVEISK--------------QALQNELERLKESSLKR- 484
Cdd:COG3206    232 RAELAEAEARLAALRAQLGSGPDALpellqspviQQLRAQLAELEAELaelsarytpnhpdvIALRAQIAALRAQLQQEa 311
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  485 RSTREMYKEKKTFNQDDSADLRCQLQFAKEEAflmrKKMAKLGREKDELEQELQKYKSLY 544
Cdd:COG3206    312 QRILASLEAELEALQAREASLQAQLAQLEARL----AELPELEAELRRLEREVEVARELY 367
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
345-657 3.88e-04

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 45.55  E-value: 3.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  345 SENDYLKDELDEL-RAEME-EMRDSYLEEdvyQLQELRRELDRANKNCRILQYRLRKAEqkslkvAETGQVDGELIRSLE 422
Cdd:pfam01576  384 SENAELQAELRTLqQAKQDsEHKRKKLEG---QLQELQARLSESERQRAELAEKLSKLQ------SELESVSSLLNEAEG 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  423 QDLKVAKDVSvRLHHELKTV-----EEKRAK---------AEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRR- 485
Cdd:pfam01576  455 KNIKLSKDVS-SLESQLQDTqellqEETRQKlnlstrlrqLEDERNSLQEQLEEEEEAKRNVERQLSTLQAqlSDMKKKl 533
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  486 ----STREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDV-------DSPLPTG 554
Cdd:pfam01576  534 eedaGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLekkqkkfDQMLAEE 613
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  555 EAGGPPSTRE---AELKLRLKlvEEEANILGR-------KIVELEVENRGLKAEMEDMRGQQEREGpgrdhapsiptspf 624
Cdd:pfam01576  614 KAISARYAEErdrAEAEAREK--ETRALSLARaleealeAKEELERTNKQLRAEMEDLVSSKDDVG-------------- 677
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1804072757  625 gdslESSTELRRHLQFVEEEAELLRRSISEIED 657
Cdd:pfam01576  678 ----KNVHELERSKRALEQQVEEMKTQLEELED 706
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
323-662 4.17e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 45.50  E-value: 4.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  323 EQSRLvpAAEEEELLREMEELRSendyLKDELDELRAEMEEMRDSYLEEDVYQLqELRRELDRankncriLQYRLRKAEQ 402
Cdd:pfam17380  297 EQERL--RQEKEEKAREVERRRK----LEEAEKARQAEMDRQAAIYAEQERMAM-ERERELER-------IRQEERKREL 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  403 KSLKVAETGqVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEI----SKQALQNELERLK 478
Cdd:pfam17380  363 ERIRQEEIA-MEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQiraeQEEARQREVRRLE 441
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  479 EsslKRRSTREMYKEKKTFNQDDSADLRcqlqfaKEEAFLMRKKMAKLGREKDELEQELQKYKSLygdvdsplptgeagg 558
Cdd:pfam17380  442 E---ERAREMERVRLEEQERQQQVERLR------QQEEERKRKKLELEKEKRDRKRAEEQRRKIL--------------- 497
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  559 ppstrEAELKLRLKLVEEEANilGRKIVELEVENRGLKAEMEDMRGQQEREgpGRDHAPSIPTSPFGDSLESSTELRRHL 638
Cdd:pfam17380  498 -----EKELEERKQAMIEEER--KRKLLEKEMEERQKAIYEEERRREAEEE--RRKQQEMEERRRIQEQMRKATEERSRL 568
                          330       340
                   ....*....|....*....|....
gi 1804072757  639 QFVEEEAELLRRsISEIEDHNRQL 662
Cdd:pfam17380  569 EAMEREREMMRQ-IVESEKARAEY 591
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
398-656 4.45e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.89  E-value: 4.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  398 RKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERL 477
Cdd:COG4372      6 EKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEEL 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  478 KEsSLKRRstremykekktfnQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLptgeag 557
Cdd:COG4372     86 NE-QLQAA-------------QAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEI------ 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  558 gppSTREAELK-LRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRR 636
Cdd:COG4372    146 ---AEREEELKeLEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLE 222
                          250       260
                   ....*....|....*....|
gi 1804072757  637 HLQFVEEEAELLRRSISEIE 656
Cdd:COG4372    223 AKDSLEAKLGLALSALLDAL 242
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
130-313 4.47e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 4.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  130 RRAARVAPAEPLSRA-GKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDS 208
Cdd:PHA03307   742 RRARARASAWDITDAlFSNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKS 821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  209 SSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSG 288
Cdd:PHA03307   822 RSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAP 901
                          170       180
                   ....*....|....*....|....*
gi 1804072757  289 VSggfAGPGVAEDVRGRSPPERPVP 313
Cdd:PHA03307   902 AP---RPRPAPRVKLGPMPPGGPDP 923
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
345-543 4.80e-04

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 44.52  E-value: 4.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  345 SENDYLKDELDELRAEMEEMRDSYLE---------EDVYQLQELRRELDRANKNCRILQYRLRKAEQKSlkvaETGQVDG 415
Cdd:COG1340     57 EEAQELREKRDELNEKVKELKEERDElneklnelrEELDELRKELAELNKAGGSIDKLRKEIERLEWRQ----QTEVLSP 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  416 EL-------IRSLEQDLKVAKDvSVRLHHELKTVEEKRAKAEDENETLRQQMIEV--EISK-----QALQNELERLKESS 481
Cdd:COG1340    133 EEekelvekIKELEKELEKAKK-ALEKNEKLKELRAELKELRKEAEEIHKKIKELaeEAQElheemIELYKEADELRKEA 211
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  482 -------LKRRSTREMYKEKKTFNQDDSADLRCQL--QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSL 543
Cdd:COG1340    212 delhkeiVEAQEKADELHEEIIELQKELRELRKELkkLRKKQRALKREKEKEELEEKAEEIFEKLKKGEKL 282
PTZ00121 PTZ00121
MAEBL; Provisional
352-496 4.94e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.52  E-value: 4.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  352 DELDELRAEMEEMRDsylEEDVYQLQELRRELDRANKNCRILQyrlRKAEQKSlKVAETGQVDGELIRSLEQDLKVAKDV 431
Cdd:PTZ00121  1656 EEENKIKAAEEAKKA---EEDKKKAEEAKKAEEDEKKAAEALK---KEAEEAK-KAEELKKKEAEEKKKAEELKKAEEEN 1728
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1804072757  432 SVRLHHELKTVEEKRAKAE----DENETLRQQMIEVEISKQALQNELER---LKESSLKRRSTREMYKEKKT 496
Cdd:PTZ00121  1729 KIKAEEAKKEAEEDKKKAEeakkDEEEKKKIAHLKKEEEKKAEEIRKEKeavIEEELDEEDEKRRMEVDKKI 1800
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
149-331 5.08e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 5.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  149 GAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRishtdsssdlsdcpsePLSDEQRLLP 228
Cdd:PRK12323   371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPA----------------RRSPAPEALA 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  229 AASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPE------PPALLAAPLAAGAcPGGRSIP--SGVSGGFAGPGVAE 300
Cdd:PRK12323   435 AARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPvaaaaaAAPARAAPAAAPA-PADDDPPpwEELPPEFASPAPAQ 513
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1804072757  301 DVRGRSPPER---PVPGTPKEPSLGEQSRLVPAA 331
Cdd:PRK12323   514 PDAAPAGWVAesiPDPATADPDDAFETLAPAPAA 547
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
351-538 6.16e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 45.03  E-value: 6.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  351 KDELDELRAEMEEMRDSY------LEEDVYQLQELRRELDRANK-------NCRILQYRLRKAEQ--KSLKVAETGQ--- 412
Cdd:PRK02224   383 REEIEELEEEIEELRERFgdapvdLGNAEDFLEELREERDELREreaeleaTLRTARERVEEAEAllEAGKCPECGQpve 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  413 ----VDG-----ELIRSLEQDLKVAKDVSVRLHHELKTVEEKrAKAEDENETLRQQmieVEISKQALQNELERLKESSLK 483
Cdd:PRK02224   463 gsphVETieedrERVEELEAELEDLEEEVEEVEERLERAEDL-VEAEDRIERLEER---REDLEELIAERRETIEEKRER 538
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1804072757  484 RRSTREMYKEKKTfnqdDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQ 538
Cdd:PRK02224   539 AEELRERAAELEA----EAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIE 589
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
348-516 8.91e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.52  E-value: 8.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  348 DYLKDELDELRAEMEEMRDSY-----LEEDVYQLQ----ELRRELDRANKNCRILQYRLRKAEQ-----KSLKVAETGQV 413
Cdd:COG4913    664 ASAEREIAELEAELERLDASSddlaaLEEQLEELEaeleELEEELDELKGEIGRLEKELEQAEEeldelQDRLEAAEDLA 743
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  414 DGELIRSLEQDL------KVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQ--------MIEVEISKQAL---QNELER 476
Cdd:COG4913    744 RLELRALLEERFaaalgdAVERELRENLEERIDALRARLNRAEEELERAMRAfnrewpaeTADLDADLESLpeyLALLDR 823
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1804072757  477 LKESSLKRRsTREMYKEKKTFNQDDSADLRCQLQFAKEEA 516
Cdd:COG4913    824 LEEDGLPEY-EERFKELLNENSIEFVADLLSKLRRAIREI 862
BAR cd07307
The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization module that binds membranes and detects ...
376-587 9.38e-04

The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization module that binds membranes and detects membrane curvature; BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. Mutations in BAR containing proteins have been linked to diseases and their inactivation in cells leads to altered membrane dynamics. A BAR domain with an additional N-terminal amphipathic helix (an N-BAR) can drive membrane curvature. These N-BAR domains are found in amphiphysins and endophilins, among others. BAR domains are also frequently found alongside domains that determine lipid specificity, such as the Pleckstrin Homology (PH) and Phox Homology (PX) domains which are present in beta centaurins (ACAPs and ASAPs) and sorting nexins, respectively. A FES-CIP4 Homology (FCH) domain together with a coiled coil region is called the F-BAR domain and is present in Pombe/Cdc15 homology (PCH) family proteins, which include Fes/Fes tyrosine kinases, PACSIN or syndapin, CIP4-like proteins, and srGAPs, among others. The Inverse (I)-BAR or IRSp53/MIM homology Domain (IMD) is found in multi-domain proteins, such as IRSp53 and MIM, that act as scaffolding proteins and transducers of a variety of signaling pathways that link membrane dynamics and the underlying actin cytoskeleton. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The I-BAR domain induces membrane protrusions in the opposite direction compared to classical BAR and F-BAR domains, which produce membrane invaginations. BAR domains that also serve as protein interaction domains include those of arfaptin and OPHN1-like proteins, among others, which bind to Rac and Rho GAP domains, respectively.


Pssm-ID: 153271 [Multi-domain]  Cd Length: 194  Bit Score: 42.43  E-value: 9.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  376 LQELRRELDRANKNCRILQYRLRKAEQKSLKVAEtgqvdgelirSLEqdlKVAKDVSVRLHHELKTVEEKRAKAEDENET 455
Cdd:cd07307      2 LDELEKLLKKLIKDTKKLLDSLKELPAAAEKLSE----------ALQ---ELGKELPDLSNTDLGEALEKFGKIQKELEE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  456 LRQQMIE------VEISKQALQNELERLKESSLKRRSTREMYkekktfnqdDSAdlrcqlqFAKEEAflMRKKMAKLGRE 529
Cdd:cd07307     69 FRDQLEQklenkvIEPLKEYLKKDLKEIKKRRKKLDKARLDY---------DAA-------REKLKK--LRKKKKDSSKL 130
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757  530 KdELEQELQKYKSLYGDVdsplptgeaggppstrEAELKLRLKLVEEEANILGRKIVE 587
Cdd:cd07307    131 A-EAEEELQEAKEKYEEL----------------REELIEDLNKLEEKRKELFLSLLL 171
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
416-627 1.24e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.21  E-value: 1.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  416 ELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLK-------------ESSL 482
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEaelaelekeiaelRAEL 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  483 KRRST------REMYKEKKT------FNQDDSAD-------LRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSL 543
Cdd:COG4942    100 EAQKEelaellRALYRLGRQpplallLSPEDFLDavrrlqyLKYLAPARREQAEELRADLAELAALRAELEAERAELEAL 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  544 YGDVdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQER-EGPGRDHAPSIPTS 622
Cdd:COG4942    180 LAEL-------------EEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARlEAEAAAAAERTPAA 246

                   ....*
gi 1804072757  623 PFGDS 627
Cdd:COG4942    247 GFAAL 251
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
110-262 1.29e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.55  E-value: 1.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  110 KPPPGAGaraaggakaalgsrrAARVAPAEPLSRAGKP-PGAEPPSAAAKGRKAKrgsRAPPARTVGPPTPAARI-PAVT 187
Cdd:PRK14951   365 KPAAAAE---------------AAAPAEKKTPARPEAAaPAAAPVAQAAAAPAPA---AAPAAAASAPAAPPAAApPAPV 426
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1804072757  188 LAVTSVAGSPARcsriSHTDSSSDLSDCPSEPLSDEQRLLPAAssdAESGTGSSDREPPRGAPTPSPAARGAPPG 262
Cdd:PRK14951   427 AAPAAAAPAAAP----AAAPAAVALAPAPPAQAAPETVAIPVR---VAPEPAVASAAPAPAAAPAAARLTPTEEG 494
PRK01156 PRK01156
chromosome segregation protein; Provisional
348-714 1.47e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 43.74  E-value: 1.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  348 DYLKDELDELRAEMEEMrdSYLEEDVY----QLQELRRELDRANKNCRILqyrLRKAEQKSlkvAETGQVDGELI--RSL 421
Cdd:PRK01156   169 DKLKDVIDMLRAEISNI--DYLEEKLKssnlELENIKKQIADDEKSHSIT---LKEIERLS---IEYNNAMDDYNnlKSA 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENetlrqqmieVEISkqALQNELERLKESSL--KRRSTREMYKEKKTFNQ 499
Cdd:PRK01156   241 LNELSSLEDMKNRYESEIKTAESDLSMELEKN---------NYYK--ELEERHMKIINDPVykNRNYINDYFKYKNDIEN 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  500 ---------------DDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDS--------------- 549
Cdd:PRK01156   310 kkqilsnidaeinkyHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESlkkkieeysknierm 389
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  550 PLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGpGRDHAPSIPTSpFGDslE 629
Cdd:PRK01156   390 SAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLN-GQSVCPVCGTT-LGE--E 465
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  630 SSTELRRH----LQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREpgwlgegASPGAGGGAPLQEELKSARLQI 705
Cdd:PRK01156   466 KSNHIINHynekKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINK-------SINEYNKIESARADLEDIKIKI 538

                   ....*....
gi 1804072757  706 SELSGKVLK 714
Cdd:PRK01156   539 NELKDKHDK 547
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
499-728 1.56e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.21  E-value: 1.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  499 QDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVdsplptgeaggppSTREAELKLRLKLVEEEA 578
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAAL-------------ARRIRALEQELAALEAEL 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  579 NILGRKIVELEVENRGLKAEMEDMRGQQEREgpGRDHAPSIPTSP--FGDSLESSTELRRHLQFVEEEAELLRRSISEIE 656
Cdd:COG4942     86 AELEKEIAELRAELEAQKEELAELLRALYRL--GRQPPLALLLSPedFLDAVRRLQYLKYLAPARREQAEELRADLAELA 163
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1804072757  657 DHNRQLTHELSKfkfepprepgwLGEGASPGAGGGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQR 728
Cdd:COG4942    164 ALRAELEAERAE-----------LEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEE 224
Rootletin pfam15035
Ciliary rootlet component, centrosome cohesion;
391-493 1.76e-03

Ciliary rootlet component, centrosome cohesion;


Pssm-ID: 464459 [Multi-domain]  Cd Length: 190  Bit Score: 41.56  E-value: 1.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  391 RILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDL-----KVAKDVSVRLHHELKTVEEKRAKAED---ENETLRQQMIE 462
Cdd:pfam15035   24 KVLQYKKRCSELEQQLLEKTSELEKTELLLRKLTLeprlqRLEREHSADLEEALIRLEEERQRSESlsqVNSLLREQLEQ 103
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1804072757  463 VEISKQALQNELERL-------------KESSLKRRstREMYKE 493
Cdd:pfam15035  104 ASRANEALREDLQKLtndwerareeleqKESEWRKE--EEAFNE 145
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
338-672 1.98e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 43.13  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  338 REMEELRSENDYLKDELDELRAEMEEMRDS---------YLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVA 408
Cdd:PRK03918   262 RELEERIEELKKEIEELEEKVKELKELKEKaeeyiklseFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE 341
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  409 ETgqvdGELIRSLEQDLKVAKDvSVRLHHELKTVEE-----KRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SS 481
Cdd:PRK03918   342 EL----KKKLKELEKRLEELEE-RHELYEEAKAKKEelerlKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITAriGE 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  482 LKRRSTRemykEKKTFNQDDSADLRCQLQFA----KEEAFLMRK---KMAKLGREKDELEQELQKYKSLYGDVDSPLptg 554
Cdd:PRK03918   417 LKKEIKE----LKKAIEELKKAKGKCPVCGRelteEHRKELLEEytaELKRIEKELKEIEEKERKLRKELRELEKVL--- 489
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  555 eAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVEN-RGLKAEMEDMRGQQERegpgrdhapsiptspFGDSLESSTE 633
Cdd:PRK03918   490 -KKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEyEKLKEKLIKLKGEIKS---------------LKKELEKLEE 553
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 1804072757  634 LrrhlqfvEEEAELLRRSISEIEDHNRQLTHELSKFKFE 672
Cdd:PRK03918   554 L-------KKKLAELEKKLDELEEELAELLKELEELGFE 585
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
340-607 2.41e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  340 MEELRSENDYLKDELDELRAEMEEMRDSYLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKsLKVAETGQVDGELIR 419
Cdd:COG4717    165 LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEE-LEQLENELEAAALEE 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  420 SLEQDLKVAKDVSVRL------------------------------------HHELKTVEEKRAKAEDENETLRQQMIEV 463
Cdd:COG4717    244 RLKEARLLLLIAAALLallglggsllsliltiagvlflvlgllallflllarEKASLGKEAEELQALPALEELEEEELEE 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  464 EISKQALQNELERLKESSLKRRSTRemYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKM------AKLGREKDELEQEL 537
Cdd:COG4717    324 LLAALGLPPDLSPEELLELLDRIEE--LQELLREAEELEEELQLEELEQEIAALLAEAGVedeeelRAALEQAEEYQELK 401
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  538 QKYKSLYGDVDSPLPTGEAGGPPSTREaELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQE 607
Cdd:COG4717    402 EELEELEEQLEELLGELEELLEALDEE-ELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGE 470
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
12-185 2.44e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 2.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   12 APDAKLQPPGQHHRHHHLHPV--AERRRLHRAPSPARPFLKDL----HARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAP 85
Cdd:PRK12323   391 APAAAAPAPAAPPAAPAAAPAaaAAARAVAAAPARRSPAPEALaaarQASARGPGGAPAPAPAPAAAPAAAARPAAAGPR 470
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   86 PSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRagkPPGAEPPSAAAKGRKAKRG 165
Cdd:PRK12323   471 PVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPD---PATADPDDAFETLAPAPAA 547
                          170       180
                   ....*....|....*....|
gi 1804072757  166 SRAPPARTVGPPTPAARIPA 185
Cdd:PRK12323   548 APAPRAAAATEPVVAPRPPR 567
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
392-609 2.47e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  392 ILQYRLRKAEQKSLKV-AETGQVDGELIRSLEQDLKvakdvsvrlhhELKTVEEKRAKAEDENETLRQQMIEVEISKQAL 470
Cdd:COG4717     46 MLLERLEKEADELFKPqGRKPELNLKELKELEEELK-----------EAEEKEEEYAELQEELEELEEELEELEAELEEL 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  471 QNELERLKesslKRRSTREMYKEKKTFNQDDsADLRCQLQFAKEEaflmRKKMAKLGREKDELEQELQKYKSLygdvdsp 550
Cdd:COG4717    115 REELEKLE----KLLQLLPLYQELEALEAEL-AELPERLEELEER----LEELRELEEELEELEAELAELQEE------- 178
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  551 LPTGEAGGPPSTREA--ELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQERE 609
Cdd:COG4717    179 LEELLEQLSLATEEElqDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAA 239
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-311 2.53e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 2.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757    7 PAGGGAPD------AKLQPPGQHHRHHHLHPVAERRRLHRAPsparpflkdLHARPAAPGPAVPSSGRAPAPAAPRSPNL 80
Cdd:PHA03247   206 PSGPGPAApadltaAALHLYGASETYLQDEPFVERRVVISHP---------LRGDIAAPAPPPVVGEGADRAPETARGAT 276
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   81 AGKAPPSPGSLAAPGRLSRRS-----GGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKP-------- 147
Cdd:PHA03247   277 GPPPPPEAAAPNGAAAPPDGVwgaalAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQHYPlgfpkrrr 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  148 PGAEPPSAA---AKGRKAKRgsRAPPARTVGPPTPAARIPavtlavtsVAGSPArcsrishtdssSDLSDCPSEPlsdeq 224
Cdd:PHA03247   357 PTWTPPSSLedlSAGRHHPK--RASLPTRKRRSARHAATP--------FARGPG-----------GDDQTRPAAP----- 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  225 rlLPAASSDAESGTGSSDREPPRGAPTPSP-AARGAPPGSPeppallaaplAAGACPGGRSIPSGVSGGFAGPGVAEDVR 303
Cdd:PHA03247   411 --VPASVPTPAPTPVPASAPPPPATPLPSAePGSDDGPAPP----------PERQPPAPATEPAPDDPDDATRKALDALR 478

                   ....*...
gi 1804072757  304 GRSPPERP 311
Cdd:PHA03247   479 ERRPPEPP 486
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
31-156 2.60e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.78  E-value: 2.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   31 PVAERRRLHRaPSPARPFLKDLHARPAAPGPAVPSSGrapapaaprspnlAGKAPPSPGSLAAPGRLSRRSGGVPGAKDK 110
Cdd:PRK14951   374 APAEKKTPAR-PEAAAPAAAPVAQAAAAPAPAAAPAA-------------AASAPAAPPAAAPPAPVAAPAAAAPAAAPA 439
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1804072757  111 PPPGAGARAAGGAKAALGSRRA--ARVAPAEPLSRAGKPPGAEPPSAA 156
Cdd:PRK14951   440 AAPAAVALAPAPPAQAAPETVAipVRVAPEPAVASAAPAPAAAPAAAR 487
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
350-487 2.77e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 42.70  E-value: 2.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKncrilqyRLRKAEQKSLKVAETgqvDGELIRSLEQDLKvak 429
Cdd:COG3206    268 LRAQLAELEAELAELSARYTPNHP-DVIALRAQIAALRA-------QLQQEAQRILASLEA---ELEALQAREASLQ--- 333
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  430 dvsvrlhhelKTVEEKRAKAEDENETLRQQMI---EVEISKQALQNELERLKESSLKRRST 487
Cdd:COG3206    334 ----------AQLAQLEARLAELPELEAELRRlerEVEVARELYESLLQRLEEARLAEALT 384
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
344-671 2.81e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 42.70  E-value: 2.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  344 RSENDYLKDELDELRAEMEEmrdsyLEEDVYQLQELRRELDRANKNCRIlQYRLRKAEQKSLKVAETGQvdGELIRSLEQ 423
Cdd:TIGR04523  376 KKENQSYKQEIKNLESQIND-----LESKIQNQEKLNQQKDEQIKKLQQ-EKELLEKEIERLKETIIKN--NSEIKDLTN 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  424 DLKVAKdvsvrlhhelKTVEEKRAKAEDENETLRQQMIEVEISKQALQN---ELERlKESSLKrrstreMYKEKKTfnqd 500
Cdd:TIGR04523  448 QDSVKE----------LIIKNLDNTRESLETQLKVLSRSINKIKQNLEQkqkELKS-KEKELK------KLNEEKK---- 506
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  501 dsaDLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELqkyKSLYGDVDsplptgeaggppstreaELKLRLK--LVEEEA 578
Cdd:TIGR04523  507 ---ELEEKVKDLTKKISSLKEKIEKLESEKKEKESKI---SDLEDELN-----------------KDDFELKkeNLEKEI 563
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  579 NILGRKIVELEVENRGLKA---EMEDMRGQQEREgpgrdhapsiptspfgdslesSTELRRHLQFVEEEAELLRRSISEI 655
Cdd:TIGR04523  564 DEKNKEIEELKQTQKSLKKkqeEKQELIDQKEKE---------------------KKDLIKEIEEKEKKISSLEKELEKA 622
                          330
                   ....*....|....*.
gi 1804072757  656 EDHNRQLTHELSKFKF 671
Cdd:TIGR04523  623 KKENEKLSSIIKNIKS 638
PHA03378 PHA03378
EBNA-3B; Provisional
36-184 2.97e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.75  E-value: 2.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   36 RRLHRAPSPARPflkdlhaRPAAPGPAVPSSGRAPAPAAPrspnlAGKAPPSPGSLAAPGRLsRRSGGVPGAKdkPPPGA 115
Cdd:PHA03378   685 LPIQWAPGTMQP-------PPRAPTPMRPPAAPPGRAQRP-----AAATGRARPPAAAPGRA-RPPAAAPGRA--RPPAA 749
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1804072757  116 GARAAGGAKAALGSRRAARVAPaeplsraGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIP 184
Cdd:PHA03378   750 APGRARPPAAAPGRARPPAAAP-------GAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP 811
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
434-601 3.05e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.45  E-value: 3.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  434 RLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEK--KTFNQDDSADLRCQLQF 511
Cdd:COG1579     21 RLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgNVRNNKEYEALQKEIES 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  512 AKEEAFLMRKKMAKLGREKDELEQELQKYKSLYgdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVE 591
Cdd:COG1579    101 LKRRISDLEDEILELMERIEELEEELAELEAEL----------------AELEAELEEKKAELDEELAELEAELEELEAE 164
                          170
                   ....*....|
gi 1804072757  592 NRGLKAEMED 601
Cdd:COG1579    165 REELAAKIPP 174
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
347-600 3.25e-03

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 42.52  E-value: 3.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  347 NDYLKDELDELRAEMEEMRDSY--LEEDVYQ----LQELRRELDRAN---KNCRILQYRLrKAEQKSLKVAETGQVD--- 414
Cdd:pfam12128  599 EEELRERLDKAEEALQSAREKQaaAEEQLVQangeLEKASREETFARtalKNARLDLRRL-FDEKQSEKDKKNKALAerk 677
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  415 ---GELIRSLEQDLKVakdvsvrLHHELKTVEEKRAKAEDENETLRQQMIEVEISkqALQNELERLKESSLKRRSTREmy 491
Cdd:pfam12128  678 dsaNERLNSLEAQLKQ-------LDKKHQAWLEEQKEQKREARTEKQAYWQVVEG--ALDAQLALLKAAIAARRSGAK-- 746
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  492 KEKKTFNQDDSADLRCQ-------LQFAKEEAFLMRKKMAKLGREKDELE----------QELQKYKSLYGDVDSPLPtg 554
Cdd:pfam12128  747 AELKALETWYKRDLASLgvdpdviAKLKREIRTLERKIERIAVRRQEVLRyfdwyqetwlQRRPRLATQLSNIERAIS-- 824
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1804072757  555 EAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEME 600
Cdd:pfam12128  825 ELQQQLARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMS 870
KASH_CCD pfam14662
Coiled-coil region of CCDC155 or KASH; This coiled-coil region is found in the central part of ...
418-598 3.37e-03

Coiled-coil region of CCDC155 or KASH; This coiled-coil region is found in the central part of KASH or Klarsicht/ANC-1/Syne/homology proteins. KASH are a meiosis-specific proteins that localize at telomeres and interact with SUN1, thus being implicated in meiotic chromosome dynamics and homolog pairing.


Pssm-ID: 405365 [Multi-domain]  Cd Length: 191  Bit Score: 40.55  E-value: 3.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  418 IRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMievEISKQALQNELE--------RLKESSLKRRSTRE 489
Cdd:pfam14662   10 VEDLQANNQKLLQENSKLKATVETREETNAKLLEENLNLRKQA---KSQQQAVQKEKLleeeledlKLIVNSLEEARRSL 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  490 MYKEK-KTFNQDDSA----DLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDvdsplptgeaggppstRE 564
Cdd:pfam14662   87 LAQNKqLEKENQSLLqeieSLQEENKKNQAERDKLQKKKKELLKSKACLKEQLHSCEDLACN----------------RE 150
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1804072757  565 AELKLRLKLVEEEAnilgRKIVELEVENRGLKAE 598
Cdd:pfam14662  151 TILIEKTTQIEELK----STVEEYSSIEEELRAE 180
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
351-665 3.50e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 42.34  E-value: 3.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  351 KDELDELR---AEMEEMRDSYLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLkv 427
Cdd:TIGR00606  244 ENELDPLKnrlKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKEREL-- 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  428 akdvsVRLHHELKtveekraKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREmykekktfnqddSADLRC 507
Cdd:TIGR00606  322 -----VDCQRELE-------KLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQ------------SLATRL 377
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  508 QLQFAKEEAFLMR--KKMAKLGREKdeLEQELQKYKSLYGDVDSPLPTGEAGGppSTREAELKLRLKLVEEEANILGRKI 585
Cdd:TIGR00606  378 ELDGFERGPFSERqiKNFHTLVIER--QEDEAKTAAQLCADLQSKERLKQEQA--DEIRDEKKGLGRTIELKKEILEKKQ 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  586 VELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLEssTELRRHLQFVEEEAELLRRSISEIEDhNRQLTHE 665
Cdd:TIGR00606  454 EELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTE--TLKKEVKSLQNEKADLDRKLRKLDQE-MEQLNHH 530
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
56-329 4.98e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.87  E-value: 4.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   56 PAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRS---------GGVP---GAKDKPPPGAGARAAGGA 123
Cdd:pfam17823  115 LAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAaiaaasaphAASPaprTAASSTTAASSTTAASSA 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  124 KAALGSRRAARVAPAEPLSRAGKPPGAepPSAAAKgrKAKRGSRAPPART----VGPPTPA--ARIPAVTLAVTSVAGSP 197
Cdd:pfam17823  195 PTTAASSAPATLTPARGISTAATATGH--PAAGTA--LAAVGNSSPAAGTvtaaVGTVTPAalATLAAAAGTVASAAGTI 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  198 ARCSRISHTDSSSdlSDCPSEPLSDEqrllPAASSDAES-GTGS--SDREP---PRGAPTPSPAARGAPPGSPEPPALLA 271
Cdd:pfam17823  271 NMGDPHARRLSPA--KHMPSDTMARN----PAAPMGAQAqGPIIqvSTDQPvhnTAGEPTPSPSNTTLEPNTPKSVASTN 344
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1804072757  272 APLAAGACPGGRSiPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSLGEQSRLVP 329
Cdd:pfam17823  345 LAVVTTTKAQAKE-PSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAP 401
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
355-561 5.80e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.54  E-value: 5.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  355 DELRAEMEEMRDSYLEEdvyQLQELRRELDRA---------NKNCRILQ---YRLRKAEQKSlKVAETGQVDGELIRSLE 422
Cdd:NF033838    53 NESQKEHAKEVESHLEK---ILSEIQKSLDKRkhtqnvalnKKLSDIKTeylYELNVLKEKS-EAELTSKTKKELDAAFE 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  423 QDLKVAKDVSVRLHHELKTVEEKRAKAEDENE-------TLRQQMIEVEISKQALQnelerLKESSLkrrstrEMYKEKK 495
Cdd:NF033838   129 QFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEedrrnypTNTYKTLELEIAESDVE-----VKKAEL------ELVKEEA 197
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1804072757  496 TFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPS 561
Cdd:NF033838   198 KEPRDEEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNVATSEQDKPKR 263
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
38-199 6.68e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 6.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757   38 LHRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRS---------GGVPGAK 108
Cdd:PHA03307   757 LFSNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKsrshtpdggSESSGPA 836
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  109 DKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGkppgAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTL 188
Cdd:PHA03307   837 RPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRR----PRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVK 912
                          170
                   ....*....|.
gi 1804072757  189 AVTSVAGSPAR 199
Cdd:PHA03307   913 LGPMPPGGPDP 923
EzrA COG4477
Septation ring formation regulator EzrA [Cell cycle control, cell division, chromosome ...
350-541 8.28e-03

Septation ring formation regulator EzrA [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443574 [Multi-domain]  Cd Length: 567  Bit Score: 40.98  E-value: 8.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDS--YLEEDVY--QLQELRRELDRANKNcrILQYRLRKAEQKsLKVAETgQVDgELIRSLEQDL 425
Cdd:COG4477    227 LPDQLEELKSGYREMKEQgyVLEHLNIekEIEQLEEQLKEALEL--LEELDLDEAEEE-LEEIEE-EID-ELYDLLEKEV 301
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  426 KVAKDVsvrlHHELKTVEEKRAKAEDENETLRqqmIEVEISKQALQ---NELERLKE-----SSLKRR---STREMYKEK 494
Cdd:COG4477    302 EAKKYV----DKNQEELEEYLEHLKEQNRELK---EEIDRVQQSYRlneNELEKVRNlekqiEELEKRydeIDERIEEEK 374
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  495 KTFN--QDDSADLRCQLQFAKEEAFLMRKKMAKLgrEKDELE--QELQKYK 541
Cdd:COG4477    375 VAYSelQEELEEIEEQLEEIEEEQEEFSEKLKSL--RKDELEarEKLDELK 423
PHA03378 PHA03378
EBNA-3B; Provisional
130-260 8.62e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.21  E-value: 8.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  130 RRAARVAPAEPLSRAgkpPGAEPPSAAAKGRkakrgsRAPPARTVGPPTPAARIPAvtlAVTSVAGSPARCSRISHTDSS 209
Cdd:PHA03378   686 PIQWAPGTMQPPPRA---PTPMRPPAAPPGR------AQRPAAATGRARPPAAAPG---RARPPAAAPGRARPPAAAPGR 753
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1804072757  210 SDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAP 260
Cdd:PHA03378   754 ARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGP 804
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
350-516 8.65e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.58  E-value: 8.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  350 LKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQdLKVAK 429
Cdd:COG3883     35 AQAELDALQAELEELNEEYNELQA-ELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYRSGGSVSYLDV-LLGSE 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1804072757  430 DVS-------------------VRLHHELK-TVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTRE 489
Cdd:COG3883    113 SFSdfldrlsalskiadadadlLEELKADKaELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEA 192
                          170       180
                   ....*....|....*....|....*..
gi 1804072757  490 MYKEKKTFNQDDSADLRCQLQFAKEEA 516
Cdd:COG3883    193 AAEAQLAELEAELAAAEAAAAAAAAAA 219
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH