|
Name |
Accession |
Description |
Interval |
E-value |
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
172-747 |
2.07e-18 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 91.15 E-value: 2.07e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 172 IEDALKKLADAEAKMRAAEALQAE--------------ANRYHRI-AERKLKEVES---REDDLTRRLASFKSECETKEN 233
Cdd:COG1196 174 KEEAERKLEATEENLERLEDILGElerqleplerqaekAERYRELkEELKELEAELlllKLRELEAELEELEAELEELEA 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 234 EMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCA 313
Cdd:COG1196 254 ELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELE 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 314 KREEAVSERESSLLKKEQELLVAEEKIASKESELiQNVLANQEVILRKRKSDVEAELECKSKsvEVEIESKRRAWELREV 393
Cdd:COG1196 334 ELEEELEELEEELEEAEEELEEAEAELAEAEEAL-LEAEAELAEAEEELEELAEELLEALRA--AAELAAQLEELEEAEE 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 394 DIKQREDLVGEKEHDLEVQSRALAEKEKDITEKsfnLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLE 473
Cdd:COG1196 411 ALLERLERLEEELEELEEALAELEEEEEEEEEA---LEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELA 487
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 474 DKRKRVDSatQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREE--LRKEAEY 551
Cdd:COG1196 488 EAAARLLL--LLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDevAAAAIEY 565
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 552 ITRQ---REAFSMYLKDERDNIKEERDALRNQHK--NDVESLNREREEFMNKMVEEHSEWLSKIQRERADfllGIEMQKR 626
Cdd:COG1196 566 LKAAkagRATFLPLDKIRARAALAAALARGAIGAavDLVASDLREADARYYVLGDTLLGRTLVAARLEAA---LRRAVTL 642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 627 ELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDS 706
Cdd:COG1196 643 AGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELE 722
|
570 580 590 600
....*....|....*....|....*....|....*....|.
gi 332010722 707 VEELKVQREKLETQRHMLRAERDEIRHEIEELKKLENLKVA 747
Cdd:COG1196 723 EEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLE 763
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
172-741 |
2.11e-16 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 84.60 E-value: 2.11e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 172 IEDALKKLADAEAKMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQ 251
Cdd:COG1196 234 LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRE 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 252 EHERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQ 331
Cdd:COG1196 314 LEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALR 393
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 332 ELLVAEEKIASKEsELIQNVLANQEVILRKRKSDVEAELEcksksVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEV 411
Cdd:COG1196 394 AAAELAAQLEELE-EAEEALLERLERLEEELEELEEALAE-----LEEEEEEEEEALEEAAEEEAELEEEEEALLELLAE 467
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 412 QSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKS 491
Cdd:COG1196 468 LLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAA 547
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 492 ETSELSTLEMKLKEELDDLRAQKLEmLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAFsmYLKDERDNIK 571
Cdd:COG1196 548 LQNIVVEDDEVAAAAIEYLKAAKAG-RATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADAR--YYVLGDTLLG 624
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 572 EERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREELENSSRDREKAFE 651
Cdd:COG1196 625 RTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEE 704
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 652 QEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIK---------LDRERREREWAELKDSVEELKVQREKLET--- 719
Cdd:COG1196 705 EERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEeeelleeeaLEELPEPPDLEELERELERLEREIEALGPvnl 784
|
570 580 590
....*....|....*....|....*....|...
gi 332010722 720 ----------QRH-MLRAERDEIRHEIEELKKL 741
Cdd:COG1196 785 laieeyeeleERYdFLSEQREDLEEARETLEEA 817
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
93-747 |
9.31e-15 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 79.41 E-value: 9.31e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 93 QYEEIKASVDESDLTHMREKSAYVSALAEAKKREESLKKDVGIAKECISSLE---------KTLHEMRAECAETKVSAGS 163
Cdd:PTZ00121 1068 QDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEakkkaedarKAEEARKAEDARKAEEARK 1147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 164 TMSEAHVMIEDALKKLADAEAKMRAAEALQAEANRyHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLN 243
Cdd:PTZ00121 1148 AEDAKRVEIARKAEDARKAEEARKAEDAKKAEAAR-KAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKA 1226
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 244 ERRKSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNlEIALALCAKREEAVseRE 323
Cdd:PTZ00121 1227 EAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAE-EKKKADEAKKAEEK--KK 1303
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 324 SSLLKKEQELLVAEEKIASKESELIQNVLANQEVILRKRKSDVEAELECKSKSVEVE-IESKRRAWELREVDIKQREDLV 402
Cdd:PTZ00121 1304 ADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEaAEEKAEAAEKKKEEAKKKADAA 1383
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 403 GEKEHDL----EVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDlelqQSLTSLEDKRKR 478
Cdd:PTZ00121 1384 KKKAEEKkkadEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKAD----EAKKKAEEAKKA 1459
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 479 VDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREA 558
Cdd:PTZ00121 1460 EEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 559 FSMYLKDERDNIKEERDALRNQHKNDVESLNREREEfmNKMVEEHSEWLSKIQRERadfllgIEMQKRELEYCIENKREE 638
Cdd:PTZ00121 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED--KNMALRKAEEAKKAEEAR------IEEVMKLYEEEKKMKAEE 1611
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 639 LENSSRDREKAfEQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREwaELKDSVEELKVQREKLE 718
Cdd:PTZ00121 1612 AKKAEEAKIKA-EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE--EDKKKAEEAKKAEEDEK 1688
|
650 660 670
....*....|....*....|....*....|....*.
gi 332010722 719 TQRHMLRAERDEIR-------HEIEELKKLENLKVA 747
Cdd:PTZ00121 1689 KAAEALKKEAEEAKkaeelkkKEAEEKKKAEELKKA 1724
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
89-738 |
1.11e-14 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 78.95 E-value: 1.11e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 89 ELSSQYEEIKASVDESDLTHMREK-SAYVSALAEAKKREESLKKDVGIAKECISSLEKTLHEMRAECAEtkvsAGSTMSE 167
Cdd:TIGR02168 217 ELKAELRELELALLVLRLEELREElEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE----LQKELYA 292
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 168 AHVMIEDALKKLADAEAKMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKEN---EMVIERQTLNE 244
Cdd:TIGR02168 293 LANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAeleELEAELEELES 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 245 RRKSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALAlcaKREEAVSERES 324
Cdd:TIGR02168 373 RLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKEL---QAELEELEEEL 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 325 SLLKKEQELLVAEEKIASKESELIQNVL---ANQEVILRKRKSDVEAELECKSKSVEVEIESKRRAWELREVD------I 395
Cdd:TIGR02168 450 EELQEELERLEEALEELREELEEAEQALdaaERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILgvlselI 529
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 396 KQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVAT--EEDINRKTTMLEDEKER-------------LRK 460
Cdd:TIGR02168 530 SVDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTflPLDSIKGTEIQGNDREIlkniegflgvakdLVK 609
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 461 LDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELS--TLEMKL---------------------KEELDDLRAQKLEM 517
Cdd:TIGR02168 610 FDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRivTLDGDLvrpggvitggsaktnssilerRREIEELEEKIEEL 689
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 518 LAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEfmN 597
Cdd:TIGR02168 690 EEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEEL--E 767
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 598 KMVEEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQslKEMAEKELEHVQV 677
Cdd:TIGR02168 768 ERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERR--IAATERRLEDLEE 845
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332010722 678 ELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEEL 738
Cdd:TIGR02168 846 QIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELREL 906
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
120-751 |
3.49e-14 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 77.49 E-value: 3.49e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 120 AEAKKREESLKK--DVGIAKECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEALQAEAN 197
Cdd:PTZ00121 1274 AEEARKADELKKaeEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 198 RYHRIAERKLKEVESreDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDhifarSQEL 277
Cdd:PTZ00121 1354 AAADEAEAAEEKAEA--AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE-----AKKK 1426
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 278 AELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQELLVAEEkIASKESELIQNVLANQEV 357
Cdd:PTZ00121 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADE-AKKKAEEAKKKADEAKKA 1505
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 358 ILRKRKSDVEAELECKSKSVEV-EIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKN 436
Cdd:PTZ00121 1506 AEAKKKADEAKKAEEAKKADEAkKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 437 LVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSE--LSTLEMKLKEELDDLRAQK 514
Cdd:PTZ00121 1586 AKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEekKKAEELKKAEEENKIKAAE 1665
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 515 LEMLAEADRLKVEKAKFEAEWEHidvKREELRKEAEYITRQREAFSMYLKDERDNIKEERDAlRNQHKNDVESLNREREE 594
Cdd:PTZ00121 1666 EAKKAEEDKKKAEEAKKAEEDEK---KAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA-EEENKIKAEEAKKEAEE 1741
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 595 FMNKMVE--EHSEWLSKIQRERADFLLGIEMQKRELEYCIEnkrEELENSSRDREKAFEQEKKLEEERIQSLKEMAEKEL 672
Cdd:PTZ00121 1742 DKKKAEEakKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIE---EELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGN 1818
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 673 EHVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKKL-ENLKVALDDM 751
Cdd:PTZ00121 1819 LVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEAdEIEKIDKDDI 1898
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
42-741 |
5.56e-14 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 76.65 E-value: 5.56e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 42 KRLKDAGFDEQSIKNRDKAALIAYIAKLESEVYDYQHNMGLLLLEKNELSSQYEEIKASVDESDlthmreksayvSALAE 121
Cdd:TIGR02169 279 KKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE-----------REIEE 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 122 AKKREESLKkdvgiakECISSLEKTLHEMRAECAETKVSAGSTMSEahvmIEDALKKLADAEAKMraaEALQAEANRYHR 201
Cdd:TIGR02169 348 ERKRRDKLT-------EEYAELKEELEDLRAELEEVDKEFAETRDE----LKDYREKLEKLKREI---NELKRELDRLQE 413
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 202 IAERKLKEVESREDDLTR---RLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQELA 278
Cdd:TIGR02169 414 ELQRLSEELADLNAAIAGieaKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA 493
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 279 ELEKGLDTAKTTFEEERKAFEDKKSN---------------------LEIA----LALCAKREEAVSERESSLLKKEQ-- 331
Cdd:TIGR02169 494 EAEAQARASEERVRGGRAVEEVLKASiqgvhgtvaqlgsvgeryataIEVAagnrLNNVVVEDDAVAKEAIELLKRRKag 573
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 332 ------------------------------ELLVAEEKIAS------KESELIQNVLANQEVILRKRKSDVEAELECKSK 375
Cdd:TIGR02169 574 ratflplnkmrderrdlsilsedgvigfavDLVEFDPKYEPafkyvfGDTLVVEDIEAARRLMGKYRMVTLEGELFEKSG 653
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 376 SVeVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEK 455
Cdd:TIGR02169 654 AM-TGGSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEE 732
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 456 ERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSE-TSELSTLEMKLKEELDDLRAQKLEML-AEADRLKVEKAKFEA 533
Cdd:TIGR02169 733 EKLKERLEELEEDLSSLEQEIENVKSELKELEARIEElEEDLHKLEEALNDLEARLSHSRIPEIqAELSKLEEEVSRIEA 812
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 534 EWEHIDVKREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQhKNDVESLNREREEFMNKmVEEHSEWLSKIQRE 613
Cdd:TIGR02169 813 RLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENL-NGKKEELEEELEELEAA-LRDLESRLGDLKKE 890
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 614 RADF---LLGIEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEK-ELEHVQVELKRLDAERLEI 689
Cdd:TIGR02169 891 RDELeaqLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEElSLEDVQAELQRVEEEIRAL 970
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|..
gi 332010722 690 KLDRERREREWAELKDSVEELKVQREKLEtqrhmlrAERDEIRHEIEELKKL 741
Cdd:TIGR02169 971 EPVNMLAIQEYEEVLKRLDELKEKRAKLE-------EERKAILERIEEYEKK 1015
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
183-762 |
1.05e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 75.95 E-value: 1.05e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 183 EAKMRAAEALQAEANRYHriaERKLKEVESREDDLTRRLASFKSECETKENEM--VIERQTLNERRKSLQQEHERLLDAQ 260
Cdd:PTZ00121 1234 EAKKDAEEAKKAEEERNN---EEIRKFEEARMAHFARRQAAIKAEEARKADELkkAEEKKKADEAKKAEEKKKADEAKKK 1310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 261 VSLNQREDHIFARSQELaelEKGLDTAKTTFEEERKAFEDKKSNLEIAlalCAKREEAVSERESSLLKKEQELLVAEEKI 340
Cdd:PTZ00121 1311 AEEAKKADEAKKKAEEA---KKKADAAKKKAEEAKKAAEAAKAEAEAA---ADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 341 ASKESELIQNVLANQEVILRKRKSDVEAELECKSKSVEVE--IESKRRAWELRevdiKQREDLVGEKEHDLEVQSRALAE 418
Cdd:PTZ00121 1385 KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKkkAEEKKKADEAK----KKAEEAKKADEAKKKAEEAKKAE 1460
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 419 KEKDITEKSFNLDEKEKnlvatEEDINRKTTMLEDEKERLRKLDLELQQSltslEDKRKRVDSATQKLEALKSETSELST 498
Cdd:PTZ00121 1461 EAKKKAEEAKKADEAKK-----KAEEAKKADEAKKKAEEAKKKADEAKKA----AEAKKKADEAKKAEEAKKADEAKKAE 1531
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 499 LEMKLKEELDDLRAQKLEMLAEADRL-------KVEKAKFEAEWEHIDVKREELRKEAEyitRQREAFSMYLKDERDNIK 571
Cdd:PTZ00121 1532 EAKKADEAKKAEEKKKADELKKAEELkkaeekkKAEEAKKAEEDKNMALRKAEEAKKAE---EARIEEVMKLYEEEKKMK 1608
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 572 EERDALRNQHKNDVESLNRERE-----EFMNKMVEEHSEWLSKIQRERADFLLGIEMQKRELEYcIENKREELENSSRDR 646
Cdd:PTZ00121 1609 AEEAKKAEEAKIKAEELKKAEEekkkvEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE-DKKKAEEAKKAEEDE 1687
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 647 EKAFEQEKKLEEE--RIQSLKEMAEKELEHVQvELKRLDAERL----EIKLDRERREREWAELKDSVEELKVQREKLETQ 720
Cdd:PTZ00121 1688 KKAAEALKKEAEEakKAEELKKKEAEEKKKAE-ELKKAEEENKikaeEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEE 1766
|
570 580 590 600
....*....|....*....|....*....|....*....|..
gi 332010722 721 RHMLRAERDEIRHEIEELKKLENLKVALDDMSMAKMQLSNLE 762
Cdd:PTZ00121 1767 EKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFA 1808
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
62-743 |
4.44e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 73.94 E-value: 4.44e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 62 LIAYIAKLESEVYDYQHNMGLLLLEKNELSSQYEEIKASVDESDlthmREKSAYVSALAEAKKREESLKKDVGIAKECIS 141
Cdd:TIGR02168 272 LRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLE----RQLEELEAQLEELESKLDELAEELAELEEKLE 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 142 SLEKTLHEMRAECAE---TKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEA-LQAEANRYHRIAERKLKEVESREDDL 217
Cdd:TIGR02168 348 ELKEELESLEAELEEleaELEELESRLEELEEQLETLRSKVAQLELQIASLNNeIERLEARLERLEDRRERLQQEIEELL 427
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 218 TRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDhifARSQELAELEKGLDTAKTTFEEERKA 297
Cdd:TIGR02168 428 KKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALD---AAERELAQLQARLDSLERLQENLEGF 504
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 298 FEDKKsnleiALALCAKREEAVSERESSLLKKEQELLVAEEKIASkesELIQNVLANQeviLRKRKSDVEAELECKSKSV 377
Cdd:TIGR02168 505 SEGVK-----ALLKNQSGLSGILGVLSELISVDEGYEAAIEAALG---GRLQAVVVEN---LNAAKKAIAFLKQNELGRV 573
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 378 EVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRA---------------LAEKEKDITEKSFNLDEKEKNLVATEE 442
Cdd:TIGR02168 574 TFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDpklrkalsyllggvlVVDDLDNALELAKKLRPGYRIVTLDGD 653
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 443 DINRKTTMLEDEKERlRKLDLELQQSLTSLEDKRKR----VDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEML 518
Cdd:TIGR02168 654 LVRPGGVITGGSAKT-NSSILERRREIEELEEKIEEleekIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 519 AEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAFSMYLKDERDNIkEERDALRNQHKNDVESLNREREEFmnk 598
Cdd:TIGR02168 733 KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEI-EELEAQIEQLKEELKALREALDEL--- 808
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 599 mveehSEWLSKIQRERADFLLGIEMQKRELEYCienkREELENSSRDREKAFEQEKKLEEER--IQSLKEMAEKELEHVQ 676
Cdd:TIGR02168 809 -----RAELTLLNEEAANLRERLESLERRIAAT----ERRLEDLEEQIEELSEDIESLAAEIeeLEELIEELESELEALL 879
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 332010722 677 VELKRLDAERleikldrerrerewaelkdsvEELKVQREKLETQRHMLRAERDEIRHEIEELKKLEN 743
Cdd:TIGR02168 880 NERASLEEAL---------------------ALLRSELEELSEELRELESKRSELRRELEELREKLA 925
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
183-727 |
3.06e-12 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 70.84 E-value: 3.06e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 183 EAKMRAAEALQAEANRYHRIAERkLKEVESREDDLtRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLdAQVS 262
Cdd:PRK02224 227 EQREQARETRDEADEVLEEHEER-REELETLEAEI-EDLRETIAETEREREELAEEVRDLRERLEELEEERDDLL-AEAG 303
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 263 LNQREDHifARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSEressllkkeqellvAEEKIAS 342
Cdd:PRK02224 304 LDDADAE--AVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEE--------------LREEAAE 367
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 343 KESELiqnvlANQEVILRKRKSDVEAelecksksVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKD 422
Cdd:PRK02224 368 LESEL-----EEAREAVEDRREEIEE--------LEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEAT 434
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 423 ITEKSFNLDEKEKNLVATE--------EDINRKTTmLEDEKERLRKLDLELQQsltsLEDKRKRVDSATQKLEALKSETS 494
Cdd:PRK02224 435 LRTARERVEEAEALLEAGKcpecgqpvEGSPHVET-IEEDRERVEELEAELED----LEEEVEEVEERLERAEDLVEAED 509
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 495 ELSTLEMKlKEELDDLRAQKLEMLAE----ADRLKVEKAKFEAEWEhidVKREELRKEAEYITRQREAFSMyLKDERDNI 570
Cdd:PRK02224 510 RIERLEER-REDLEELIAERRETIEEkrerAEELRERAAELEAEAE---EKREAAAEAEEEAEEAREEVAE-LNSKLAEL 584
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 571 KEERDALrnqhkNDVESLNREREEFMNKMVE--EHSEWLSKIQRERADFLLGIEMQKRELEYC-----IENKREELENSS 643
Cdd:PRK02224 585 KERIESL-----ERIRTLLAAIADAEDEIERlrEKREALAELNDERRERLAEKRERKRELEAEfdearIEEAREDKERAE 659
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 644 RDREKAFEQEKKLEEER--IQSLKEMAEKELEhvqvELKRLDAERLEIkldrerrerewAELKDSVEELKVQREKLETQR 721
Cdd:PRK02224 660 EYLEQVEEKLDELREERddLQAEIGAVENELE----ELEELRERREAL-----------ENRVEALEALYDEAEELESMY 724
|
....*.
gi 332010722 722 HMLRAE 727
Cdd:PRK02224 725 GDLRAE 730
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
318-655 |
3.62e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 70.86 E-value: 3.62e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 318 AVSERESSLLKKEQELLVAEEKIASKESELIQnvlANQEVI-LRKRKSDVEAELEckskSVEVEIESKRRAWELREVDIK 396
Cdd:TIGR02168 664 GSAKTNSSILERRREIEELEEKIEELEEKIAE---LEKALAeLRKELEELEEELE----QLRKELEELSRQISALRKDLA 736
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 397 QREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKR 476
Cdd:TIGR02168 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLN 816
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 477 KRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQR 556
Cdd:TIGR02168 817 EEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSEL 896
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 557 EAFSMYLKDERDNIKEERDALR------NQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADFLLGIEMQKRELEY 630
Cdd:TIGR02168 897 EELSEELRELESKRSELRRELEelreklAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKR 976
|
330 340
....*....|....*....|....*
gi 332010722 631 cIENKREELENSSRDREKAFEQEKK 655
Cdd:TIGR02168 977 -LENKIKELGPVNLAAIEEYEELKE 1000
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
432-763 |
9.68e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 69.32 E-value: 9.68e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 432 EKEKNLVATEEDINRKTTMLEDEKERLRKL------------------DLELQQSLTSLEDKRKRVDSATQKLEALKSET 493
Cdd:TIGR02168 176 ETERKLERTRENLDRLEDILNELERQLKSLerqaekaerykelkaelrELELALLVLRLEELREELEELQEELKEAEEEL 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 494 SELSTLEMKLKEELDDLRAQKLEMLAEADR-------LKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAfsmyLKDE 566
Cdd:TIGR02168 256 EELTAELQELEEKLEELRLEVSELEEEIEElqkelyaLANEISRLEQQKQILRERLANLERQLEELEAQLEE----LESK 331
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 567 RDNIKEERDALRNQhkndVESLNREREEFMNKMVEEHSEWlskiqreradfllgiemqkRELEYCIENKREELENSSRDR 646
Cdd:TIGR02168 332 LDELAEELAELEEK----LEELKEELESLEAELEELEAEL-------------------EELESRLEELEEQLETLRSKV 388
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 647 EKAFEQEKKLEEERIQslkemAEKELEHVQVELKRLDAERLEikLDRERREREWAELKDSVEELKVQREKLETQRHMLRA 726
Cdd:TIGR02168 389 AQLELQIASLNNEIER-----LEARLERLEDRRERLQQEIEE--LLKKLEEAELKELQAELEELEEELEELQEELERLEE 461
|
330 340 350
....*....|....*....|....*....|....*..
gi 332010722 727 ERDEIRHEIEELKklENLKVALDDMSMAKMQLSNLER 763
Cdd:TIGR02168 462 ALEELREELEEAE--QALDAAERELAQLQARLDSLER 496
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
173-762 |
1.53e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 68.93 E-value: 1.53e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 173 EDALKKLADAEAKMRAAEALQAEANR----YHRIAER--KLKEVESREDDL-----TRRLASFKSECETKE---NEMVIE 238
Cdd:TIGR02168 175 KETERKLERTRENLDRLEDILNELERqlksLERQAEKaeRYKELKAELRELelallVLRLEELREELEELQeelKEAEEE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 239 RQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEA 318
Cdd:TIGR02168 255 LEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 319 VSERESSLlkkEQELLVAEEKIASKESELiqnvlanqevilrKRKSDVEAELECKSKSVEVEIESKRRAWELREVDIKQR 398
Cdd:TIGR02168 335 LAEELAEL---EEKLEELKEELESLEAEL-------------EELEAELEELESRLEELEEQLETLRSKVAQLELQIASL 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 399 EDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEedINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKR 478
Cdd:TIGR02168 399 NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAE--LEELEEELEELQEELERLEEALEELREELEEAEQA 476
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 479 VDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKV--EKAKFEAEWE------------HIDVKREE 544
Cdd:TIGR02168 477 LDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVlsELISVDEGYEaaieaalggrlqAVVVENLN 556
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 545 LRKEAEYITRQREAFS---MYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSK------------ 609
Cdd:TIGR02168 557 AAKKAIAFLKQNELGRvtfLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGvlvvddldnale 636
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 610 ------------------------IQRERADFLLGIEMQKRELEYCiENKREELENSSRDREKAFeQEKKLEEERIQSLK 665
Cdd:TIGR02168 637 lakklrpgyrivtldgdlvrpggvITGGSAKTNSSILERRREIEEL-EEKIEELEEKIAELEKAL-AELRKELEELEEEL 714
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 666 EMAEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKklENLK 745
Cdd:TIGR02168 715 EQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE--AQIE 792
|
650
....*....|....*..
gi 332010722 746 VALDDMSMAKMQLSNLE 762
Cdd:TIGR02168 793 QLKEELKALREALDELR 809
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
202-751 |
2.15e-11 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 68.14 E-value: 2.15e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 202 IAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLldaqvslnqredhifarsQELAELE 281
Cdd:PRK02224 196 IEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERR------------------EELETLE 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 282 KGLDTAKTTFEE---ERKAFEDKKSNL-EIALALCAKREEAVSEREssLLKKEQELLVAEEKIASKESELIQNVLANQEV 357
Cdd:PRK02224 258 AEIEDLRETIAEterEREELAEEVRDLrERLEELEEERDDLLAEAG--LDDADAEAVEARREELEDRDEELRDRLEECRV 335
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 358 ILRKRKSDVEAELEcksksveveieskrRAWELREVDIKQREDlVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNL 437
Cdd:PRK02224 336 AAQAHNEEAESLRE--------------DADDLEERAEELREE-AAELESELEEAREAVEDRREEIEELEEEIEELRERF 400
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 438 VATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEM 517
Cdd:PRK02224 401 GDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEEL 480
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 518 LAEADRLKVEKAKFEAEWEHIdVKREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKN--DVESLNREREEF 595
Cdd:PRK02224 481 EAELEDLEEEVEEVEERLERA-EDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERaaELEAEAEEKREA 559
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 596 MNKM---VEEHSEWLSKIQRERADFLLGIEMQKReleycIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKEL 672
Cdd:PRK02224 560 AAEAeeeAEEAREEVAELNSKLAELKERIESLER-----IRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKRE 634
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 673 EHVQVELKrLDAERLE-IKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKKLENLKVALDDM 751
Cdd:PRK02224 635 RKRELEAE-FDEARIEeAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEELRERREALENRVEALEAL 713
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
117-594 |
9.48e-11 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 66.22 E-value: 9.48e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 117 SALAEAKKREESLKKDVGIAKECISSLEKTLHEMRAECAETKVSAgSTMSEAHVMIEDALKKLADAEAKMR-AAEALQAE 195
Cdd:PRK02224 265 ETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADA-EAVEARREELEDRDEELRDRLEECRvAAQAHNEE 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 196 ANRYHRIAerklKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQ 275
Cdd:PRK02224 344 AESLREDA----DDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELRE 419
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 276 ELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKeqellvaEEKIASKESELIQnvlanq 355
Cdd:PRK02224 420 ERDELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEED-------RERVEELEAELED------ 486
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 356 eviLRKRKSDVEAELECKSKSVEVEieskRRAWELREvDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEK 435
Cdd:PRK02224 487 ---LEEEVEEVEERLERAEDLVEAE----DRIERLEE-RREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKRE 558
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 436 NLVATEEDINRKTTMLEDEKERLRKLDLELqQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKL 515
Cdd:PRK02224 559 AAAEAEEEAEEAREEVAELNSKLAELKERI-ESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAEKRERKR 637
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 516 EMLAEADRLKVEKAK-----FEAEWEHIDVKREELRKEAEYITRQREAFSMYLKdERDNIKEERDALRNQHKnDVESLNR 590
Cdd:PRK02224 638 ELEAEFDEARIEEARedkerAEEYLEQVEEKLDELREERDDLQAEIGAVENELE-ELEELRERREALENRVE-ALEALYD 715
|
....
gi 332010722 591 EREE 594
Cdd:PRK02224 716 EAEE 719
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
85-673 |
1.36e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 65.93 E-value: 1.36e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 85 LEKNELSSQYEEIKASVDESDLTHMREKSAYVSALAEAKKREESLKKDVGIAKECISSLEKT--LHEMRAECAETKVSAG 162
Cdd:PTZ00121 1283 LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAaeAAKAEAEAAADEAEAA 1362
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 163 STMSEAHVMIEDALKKLAD-----AEAKMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVI 237
Cdd:PTZ00121 1363 EEKAEAAEKKKEEAKKKADaakkkAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEE 1442
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 238 ERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQE---LAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAK 314
Cdd:PTZ00121 1443 AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEakkADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 315 REEAVSERESSllKKEQELLVAEEKIASKESELIQNVLANQEV--ILRKRKSDVEAELECKSKSVEVEIESKRRAWELR- 391
Cdd:PTZ00121 1523 KADEAKKAEEA--KKADEAKKAEEKKKADELKKAEELKKAEEKkkAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKl 1600
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 392 --EVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEK------------------NLVATEEDINRKTTML 451
Cdd:PTZ00121 1601 yeEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKkkaeelkkaeeenkikaaEEAKKAEEDKKKAEEA 1680
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 452 ----EDEKERLRKLDLELQQSlTSLEDKRKRVDSATQKLEALKSETSE--LSTLEMKLKEELDDLRAQKLEMlAEADRLK 525
Cdd:PTZ00121 1681 kkaeEDEKKAAEALKKEAEEA-KKAEELKKKEAEEKKKAEELKKAEEEnkIKAEEAKKEAEEDKKKAEEAKK-DEEEKKK 1758
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 526 VEKAKFEAEWEHIDVKREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSE 605
Cdd:PTZ00121 1759 IAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADS 1838
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 332010722 606 wlSKIQRERADfllgiEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELE 673
Cdd:PTZ00121 1839 --KNMQLEEAD-----AFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIEKIDKDDIE 1899
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
293-788 |
5.72e-10 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 63.54 E-value: 5.72e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 293 EERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQELLVAEEKIASKESELIQnvlanqeviLRKRKSDVEaELEC 372
Cdd:PRK03918 169 EVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEK---------LEKEVKELE-ELKE 238
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 373 KSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVqsraLAEKEKDITEKSFNLDEKEKnLVATEEDINRKTTMLE 452
Cdd:PRK03918 239 EIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE----LEEKVKELKELKEKAEEYIK-LSEFYEEYLDELREIE 313
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 453 DEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEmklkEELDDLRAQKLEMLAEADRLK-VEKAKF 531
Cdd:PRK03918 314 KRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH----ELYEEAKAKKEELERLKKRLTgLTPEKL 389
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 532 EAEWEHIDVKREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQ 611
Cdd:PRK03918 390 EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELK 469
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 612 rERADFLLGIEMQKRELEYCIENKREELENssrdrEKAFEQEKKLEEEriqsLKEMAEKELEHVQVELKRLDAERLEIKL 691
Cdd:PRK03918 470 -EIEEKERKLRKELRELEKVLKKESELIKL-----KELAEQLKELEEK----LKKYNLEELEKKAEEYEKLKEKLIKLKG 539
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 692 DRERREREWAELkdsvEELKVQREKLETQRHMLRAERDEIRHEIEEL---------KKLENLKVALDDMSMAKMQLSNLE 762
Cdd:PRK03918 540 EIKSLKKELEKL----EELKKKLAELEKKLDELEEELAELLKELEELgfesveeleERLKELEPFYNEYLELKDAEKELE 615
|
490 500
....*....|....*....|....*.
gi 332010722 763 RSWEKVSALKQKVVSRDDELDLQNGV 788
Cdd:PRK03918 616 REEKELKKLEEELDKAFEELAETEKR 641
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
172-738 |
1.43e-09 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 62.24 E-value: 1.43e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 172 IEDALKKLADAEAKMRAAEALQAEANRYHriAERKLKEVESREDDLTRRLASFKSECETKENEmvieRQTLNERRKSLQQ 251
Cdd:COG4913 257 IRELAERYAAARERLAELEYLRAALRLWF--AQRRLELLEAELEELRAELARLEAELERLEAR----LDALREELDELEA 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 252 EH------------ERLLDAQVSLNQREdhifarsQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAV 319
Cdd:COG4913 331 QIrgnggdrleqleREIERLERELEERE-------RRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEAL 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 320 SERESSLLKKEQELlvaEEKIASKESELIQnvlanqeviLRKRKSDVEAEL---------ECKSKSVE-------VEIES 383
Cdd:COG4913 404 EEALAEAEAALRDL---RRELRELEAEIAS---------LERRKSNIPARLlalrdalaeALGLDEAElpfvgelIEVRP 471
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 384 KRRAWE------------------------LREVD-IKQREDLVGEKEHDLEVQSRALAEKEKDITEKsfnLDEKEKNL- 437
Cdd:COG4913 472 EEERWRgaiervlggfaltllvppehyaaaLRWVNrLHLRGRLVYERVRTGLPDPERPRLDPDSLAGK---LDFKPHPFr 548
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 438 ---------------VATEEDINR------KTTMLEDEKERLRKldlelqqsltsleDKRKRVDS-------ATQKLEAL 489
Cdd:COG4913 549 awleaelgrrfdyvcVDSPEELRRhpraitRAGQVKGNGTRHEK-------------DDRRRIRSryvlgfdNRAKLAAL 615
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 490 KSETSELSTLEMKLKEELDDLRAQKLEMlaeADRLKVEKAKFEAEWEHIDVkrEELRKEAEYITRQREAFS------MYL 563
Cdd:COG4913 616 EAELAELEEELAEAEERLEALEAELDAL---QERREALQRLAEYSWDEIDV--ASAEREIAELEAELERLDassddlAAL 690
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 564 KDERDNIKEERDALRNQhkndVESLNREREefmnKMVEEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREEL--EN 641
Cdd:COG4913 691 EEQLEELEAELEELEEE----LDELKGEIG----RLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAAlgDA 762
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 642 SSRDREKAFEQEKKLEEERIQSLKEMAEK---------------------ELEHVQVELKRLDAERLEikldreRREREW 700
Cdd:COG4913 763 VERELRENLEERIDALRARLNRAEEELERamrafnrewpaetadldadleSLPEYLALLDRLEEDGLP------EYEERF 836
|
650 660 670
....*....|....*....|....*....|....*...
gi 332010722 701 AELKDSVEElkvqrEKLETQRHMLRAERDEIRHEIEEL 738
Cdd:COG4913 837 KELLNENSI-----EFVADLLSKLRRAIREIKERIDPL 869
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
352-684 |
2.24e-09 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 61.62 E-value: 2.24e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 352 LANQEVILRKRKSDVEAELECKSKSVEV---EIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSF 428
Cdd:TIGR02169 686 LKRELSSLQSELRRIENRLDELSQELSDasrKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEA 765
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 429 NLDEKEKNLVATEEDINRKTTMLEDEKERlrkldlELQQSLTSLEDKRKRVDSATQKLEA-LKSETSELSTLEMK---LK 504
Cdd:TIGR02169 766 RIEELEEDLHKLEEALNDLEARLSHSRIP------EIQAELSKLEEEVSRIEARLREIEQkLNRLTLEKEYLEKEiqeLQ 839
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 505 EELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAFSMYLKDERDNI---------KEERD 575
Cdd:TIGR02169 840 EQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIeeleaqiekKRKRL 919
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 576 ALRNQHKNDVESLNREREEFMNKMVEEHSEWLS--KIQRERadflLGIEMQKRELEYCIENKREELENSSRDREKAFEQE 653
Cdd:TIGR02169 920 SELKAKLEALEEELSEIEDPKGEDEEIPEEELSleDVQAEL----QRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKR 995
|
330 340 350
....*....|....*....|....*....|.
gi 332010722 654 KKLEEERIQSLKEMAEKELEHVQVELKRLDA 684
Cdd:TIGR02169 996 AKLEEERKAILERIEEYEKKKREVFMEAFEA 1026
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
313-658 |
5.30e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 60.46 E-value: 5.30e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 313 AKREEAVSERESSLLKKEQELLVAEEKIASKESELI-----QNVLANQEVILRKRKSDVEAELECKSKSV---EVEIESK 384
Cdd:TIGR02168 666 AKTNSSILERRREIEELEEKIEELEEKIAELEKALAelrkeLEELEEELEQLRKELEELSRQISALRKDLarlEAEVEQL 745
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 385 RRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLE 464
Cdd:TIGR02168 746 EERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRER 825
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 465 LQQSLTSLEDKRKRVDSATQK-------LEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEH 537
Cdd:TIGR02168 826 LESLERRIAATERRLEDLEEQieelsedIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 538 IDVKREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVEslnrEREEFMNKMVEEHSEW---LSKIQRER 614
Cdd:TIGR02168 906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLE----EAEALENKIEDDEEEArrrLKRLENKI 981
|
330 340 350 360
....*....|....*....|....*....|....*....|....*....
gi 332010722 615 ADF----LLGIEmqkrelEYCIENKR-EELENSSRDREKAfeqEKKLEE 658
Cdd:TIGR02168 982 KELgpvnLAAIE------EYEELKERyDFLTAQKEDLTEA---KETLEE 1021
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
378-762 |
8.15e-09 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 59.70 E-value: 8.15e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 378 EVEIESKRRAWELREVdIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKER 457
Cdd:TIGR02169 181 EVEENIERLDLIIDEK-RQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEE 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 458 LRKLDLELQQSLTSLEDKRKRVDSATQklealkSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEH 537
Cdd:TIGR02169 260 ISELEKRLEEIEQLLEELNKKIKDLGE------EEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDK 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 538 IDVKREELRKEAEYITRQREAfsmyLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEehsewlskiqreradf 617
Cdd:TIGR02169 334 LLAEIEELEREIEEERKRRDK----LTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREK---------------- 393
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 618 llgIEMQKRELEycienkreelenssrdrekafeqEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIKldrerre 697
Cdd:TIGR02169 394 ---LEKLKREIN-----------------------ELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELE------- 440
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 332010722 698 rewAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKKLEnlkvalDDMSMAKMQLSNLE 762
Cdd:TIGR02169 441 ---EEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVE------KELSKLQRELAEAE 496
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
410-741 |
1.02e-08 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 59.28 E-value: 1.02e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 410 EVQSRALAEKEKDITEKsfnLDEKEKNLVATEEDINRkttmLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEAL 489
Cdd:PRK02224 191 QLKAQIEEKEEKDLHER---LNGLESELAELDEEIER----YEEQREQARETRDEADEVLEEHEERREELETLEAEIEDL 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 490 KSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYItrqreafsmylkdeRDN 569
Cdd:PRK02224 264 RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEEL--------------RDR 329
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 570 IKEERDALrNQHKNDVESLN---REREEFMNKMVEEHSEWLSKIQRERADFLLGiEMQKRELEYCIENKREELENSSRDR 646
Cdd:PRK02224 330 LEECRVAA-QAHNEEAESLRedaDDLEERAEELREEAAELESELEEAREAVEDR-REEIEELEEEIEELRERFGDAPVDL 407
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 647 EKAFEQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERL--EIKLDRERREREWAELKDSVEELKVQREKLETQRHML 724
Cdd:PRK02224 408 GNAEDFLEELREERDELREREAELEATLRTARERVEEAEALleAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDL 487
|
330
....*....|....*..
gi 332010722 725 RAERDEIRHEIEELKKL 741
Cdd:PRK02224 488 EEEVEEVEERLERAEDL 504
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
510-782 |
2.13e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 58.41 E-value: 2.13e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 510 LRAQKLEmlAEADRLKVEKAKfeAEWEHIDVKREELRKEAEYITRQREAfsmyLKDERDNIKEERDALRNQHKNDVESLN 589
Cdd:COG1196 213 ERYRELK--EELKELEAELLL--LKLRELEAELEELEAELEELEAELEE----LEAELAELEAELEELRLELEELELELE 284
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 590 REREEFmNKMVEEHSEWLSKIQRERAdfllgiemQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAE 669
Cdd:COG1196 285 EAQAEE-YELLAELARLEQDIARLEE--------RRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEE 355
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 670 KELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKKLENLKVALD 749
Cdd:COG1196 356 AEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEE 435
|
250 260 270
....*....|....*....|....*....|...
gi 332010722 750 DMSMAKMQLSNLERSWEKVSALKQKVVSRDDEL 782
Cdd:COG1196 436 EEEEEEEALEEAAEEEAELEEEEEALLELLAEL 468
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
276-577 |
2.29e-08 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 58.54 E-value: 2.29e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 276 ELAELEKGLDTAKTTFEE---ERKAFEDKKSNLEIALALCA-KREEAVSERESSLLKKEQELLVAEEKIASKESELIQNV 351
Cdd:TIGR02169 178 ELEEVEENIERLDLIIDEkrqQLERLRREREKAERYQALLKeKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLT 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 352 LANQEviLRKRKSDVEAELECKSKSVEVEIESKRRAWELR----EVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKS 427
Cdd:TIGR02169 258 EEISE--LEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKigelEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLL 335
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 428 FNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEEL 507
Cdd:TIGR02169 336 AEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEEL 415
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 508 DDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAfsmyLKDERDNIKEERDAL 577
Cdd:TIGR02169 416 QRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSK----YEQELYDLKEEYDRV 481
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
83-518 |
3.80e-08 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 57.74 E-value: 3.80e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 83 LLLEKNELSSQYEEIKASVDESDLTHMREKSAYVSALAEAKKREESLKKdvgiAKECISSLEKTLHEMRAECAETKvsag 162
Cdd:PRK02224 312 VEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEE----LREEAAELESELEEAREAVEDRR---- 383
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 163 STMSEAHVMIEDALKKLADAEAKMRAA----EALQAEANRYH-RIAER--KLKEVESREDDLTRRLASFK-SEC--ETKE 232
Cdd:PRK02224 384 EEIEELEEEIEELRERFGDAPVDLGNAedflEELREERDELReREAELeaTLRTARERVEEAEALLEAGKcPECgqPVEG 463
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 233 NEMVierQTLNERRKSLQQEHERLLDAQVSLNQREDHIfARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALC 312
Cdd:PRK02224 464 SPHV---ETIEEDRERVEELEAELEDLEEEVEEVEERL-ERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERA 539
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 313 AKREEAVSERESSLLKKEQELLVAEEKIASKESELIQnvlanqeviLRKRKSDVEAELECKSKSVEV--EIESKRRAWEL 390
Cdd:PRK02224 540 EELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAE---------LNSKLAELKERIESLERIRTLlaAIADAEDEIER 610
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 391 REVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFN-LDEKEKNLVATEEDINRKTTMLEDEKERLRK----LDLEL 465
Cdd:PRK02224 611 LREKREALAELNDERRERLAEKRERKRELEAEFDEARIEeAREDKERAEEYLEQVEEKLDELREERDDLQAeigaVENEL 690
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|...
gi 332010722 466 QQsLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEElddLRAQKLEML 518
Cdd:PRK02224 691 EE-LEELRERREALENRVEALEALYDEAEELESMYGDLRAE---LRQRNVETL 739
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
380-773 |
6.52e-08 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 56.70 E-value: 6.52e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 380 EIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEknLVATEEDINRKTTMLEDEKERLR 459
Cdd:COG4717 75 ELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLP--LYQELEALEAELAELPERLEELE 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 460 KLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMK-LKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHI 538
Cdd:COG4717 153 ERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQdLAEELEELQQRLAELEEELEEAQEELEELEEELEQL 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 539 DVKREELRKEAEYITRQREAFSM---------------------------------------YLKDERDNIKEERDALRN 579
Cdd:COG4717 233 ENELEAAALEERLKEARLLLLIAaallallglggsllsliltiagvlflvlgllallflllaREKASLGKEAEELQALPA 312
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 580 QHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREELEN--------SSRDREKAFE 651
Cdd:COG4717 313 LEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAAllaeagveDEEELRAALE 392
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 652 QEKKLEE--ERIQSLKEMAEKELEHVQVELKRLDAERLEIKLdrerrerewaelkdsvEELKVQREKLETQRHMLRAERD 729
Cdd:COG4717 393 QAEEYQElkEELEELEEQLEELLGELEELLEALDEEELEEEL----------------EELEEELEELEEELEELREELA 456
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 332010722 730 EIRHEIEELKKLENLKVALDDMSMAKMQLSNLERSWEKVSALKQ 773
Cdd:COG4717 457 ELEAELEQLEEDGELAELLQELEELKAELRELAEEWAALKLALE 500
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
40-786 |
7.05e-08 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 56.90 E-value: 7.05e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 40 MWKRLKDAGFDEQSIKNRDKAALIAYIAKLESEVYDYQHNMGLLLLEKNELSSQYEEIKASVDE---SDLTHMREKSAYV 116
Cdd:pfam02463 150 MKPERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYqlkEKLELEEEYLLYL 229
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 117 SALAEAKKREESLKKDVGIAKECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEALQAEA 196
Cdd:pfam02463 230 DYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 309
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 197 NRYHRIAERKLKEVESREDDLTRRLASF-------KSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDH 269
Cdd:pfam02463 310 VDDEEKLKESEKEKKKAEKELKKEKEEIeelekelKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSA 389
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 270 IFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQELLVAEEKIASKESELIQ 349
Cdd:pfam02463 390 AKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKK 469
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 350 NVLANQEVILRKRKSDVEAELECKSK--SVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKS 427
Cdd:pfam02463 470 SEDLLKETQLVKLQEQLELLLSRQKLeeRSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAV 549
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 428 FNLDEKEKNLVATEE-----------DINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSEL 496
Cdd:pfam02463 550 IVEVSATADEVEERQklvraltelplGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGIL 629
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 497 STLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAFSMYLKDERDNIKEER-- 574
Cdd:pfam02463 630 KDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQRek 709
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 575 DALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADFLLGIE--MQKRELEYCIENKREELENSSRDREKAFEQ 652
Cdd:pfam02463 710 EELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLkkEEKEEEKSELSLKEKELAEEREKTEKLKVE 789
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 653 EKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVE-------ELKVQREKLETQRHMLR 725
Cdd:pfam02463 790 EEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEqkleklaEEELERLEEEITKEELL 869
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332010722 726 AERDEIRHEIEELKKLENLKVALDDMSMAKMQLSNLERSWEKVSALKQKVVSRDDELDLQN 786
Cdd:pfam02463 870 QELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEIL 930
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
62-753 |
7.99e-08 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 56.67 E-value: 7.99e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 62 LIAYIAK--LESEVYDYQHNMGLLLLEKNELSSQYEEIKASVDESDL---THMREKSAYVSALAEAKKREESLKKDVGia 136
Cdd:pfam15921 67 IIAYPGKehIERVLEEYSHQVKDLQRRLNESNELHEKQKFYLRQSVIdlqTKLQEMQMERDAMADIRRRESQSQEDLR-- 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 137 keciSSLEKTLHEMRAecaeTKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEAL-----QAEANRYHR---------- 201
Cdd:pfam15921 145 ----NQLQNTVHELEA----AKCLKEDMLEDSNTQIEQLRKMMLSHEGVLQEIRSIlvdfeEASGKKIYEhdsmstmhfr 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 202 ----IAERKLKEVESREDDLTRRLASFKSECET----KENEMVIERQTLNERRKSLQQEHERLLD----------AQVSL 263
Cdd:pfam15921 217 slgsAISKILRELDTEISYLKGRIFPVEDQLEAlkseSQNKIELLLQQHQDRIEQLISEHEVEITgltekassarSQANS 296
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 264 NQREDHIF---ARSQ------ELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALC-AKREEAVSERESsllkkeqel 333
Cdd:pfam15921 297 IQSQLEIIqeqARNQnsmymrQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLAnSELTEARTERDQ--------- 367
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 334 lvaeekiASKESELIQNVLANQEVILRKRKSDVEAELECKSK------SVEVEIESKRRAWELREVDIKQREDLV----G 403
Cdd:pfam15921 368 -------FSQESGNLDDQLQKLLADLHKREKELSLEKEQNKRlwdrdtGNSITIDHLRRELDDRNMEVQRLEALLkamkS 440
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 404 EKEHDLEVQSRALAEKEKDIteksfnldekeknlvateEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSAT 483
Cdd:pfam15921 441 ECQGQMERQMAAIQGKNESL------------------EKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLT 502
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 484 QKLE----ALKSETSELSTLEMKLkeeldDLRAQKLEMLA-EADRLKVEKAKFEA---EWEHIDVKREELRKEAEYITR- 554
Cdd:pfam15921 503 ASLQekerAIEATNAEITKLRSRV-----DLKLQELQHLKnEGDHLRNVQTECEAlklQMAEKDKVIEILRQQIENMTQl 577
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 555 -------------QREAFSMYLKDERDNIKE------ERDALRNQHKNDVESLNREREEFMNKMvEEHSEWLSKIQRERA 615
Cdd:pfam15921 578 vgqhgrtagamqvEKAQLEKEINDRRLELQEfkilkdKKDAKIRELEARVSDLELEKVKLVNAG-SERLRAVKDIKQERD 656
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 616 DFLLGIEMQKRELEYCIE----------NKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELEHVQVEL---KRL 682
Cdd:pfam15921 657 QLLNEVKTSRNELNSLSEdyevlkrnfrNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMgmqKQI 736
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 683 DAERLEIKLDRERREREWAELKDSVEE---LKVQREKLETQRHMLRAERDEIRHEIEELK--------KLENLKVALDDM 751
Cdd:pfam15921 737 TAKRGQIDALQSKIQFLEEAMTNANKEkhfLKEEKNKLSQELSTVATEKNKMAGELEVLRsqerrlkeKVANMEVALDKA 816
|
..
gi 332010722 752 SM 753
Cdd:pfam15921 817 SL 818
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
117-474 |
9.48e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 56.22 E-value: 9.48e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 117 SALAEAKKREESLKKDVGIAKECISSLEKTLHEMRAECAETKVSAGSTMSEAH---VMIEDALKKLADAEAKMRAAEALQ 193
Cdd:TIGR02168 691 EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEqleERIAQLSKELTELEAEIEELEERL 770
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 194 AEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFAR 273
Cdd:TIGR02168 771 EEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEEL 850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 274 SQELAELEKGLDTAKTTFEEERKAfedkksnLEIALALCAKREEAVSERESSLLKKEQELLVAEEKIASKESELIQnvla 353
Cdd:TIGR02168 851 SEDIESLAAEIEELEELIEELESE-------LEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEE---- 919
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 354 nqeviLRKRKSDVEAELEckskSVEVEIESKR-RAWELREVDikqrEDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDe 432
Cdd:TIGR02168 920 -----LREKLAQLELRLE----GLEVRIDNLQeRLSEEYSLT----LEEAEALENKIEDDEEEARRRLKRLENKIKELG- 985
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 332010722 433 kEKNLVATEEdinrkttmLEDEKERLRKLDLE---LQQSLTSLED 474
Cdd:TIGR02168 986 -PVNLAAIEE--------YEELKERYDFLTAQkedLTEAKETLEE 1021
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
172-742 |
2.08e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 55.07 E-value: 2.08e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 172 IEDALKKLADAEAKMRAAEALQAEAnryhriaERKLKEVESREDDLTRRLASFKSECETKE---NEMVIERQTLNERRKS 248
Cdd:PRK03918 181 LEKFIKRTENIEELIKEKEKELEEV-------LREINEISSELPELREELEKLEKEVKELEelkEEIEELEKELESLEGS 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 249 LQQEHERLLDAQVSLNQREDHIFARSQELAELE--KGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSL 326
Cdd:PRK03918 254 KRKLEEKIRELEERIEELKKEIEELEEKVKELKelKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKEL 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 327 LKKEQELlvaeEKIASKESELIQNvlanqeviLRKRKSDVEAELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKE 406
Cdd:PRK03918 334 EEKEERL----EELKKKLKELEKR--------LEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKE 401
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 407 HdlevqsraLAEKEKDITEKSFNLDEKEKNLVATEEDIN---------RKTTMLEDEKERLRKLDLELQQS---LTSLED 474
Cdd:PRK03918 402 E--------IEEEISKITARIGELKKEIKELKKAIEELKkakgkcpvcGRELTEEHRKELLEEYTAELKRIekeLKEIEE 473
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 475 KRKRVDSATQKLEALKSETSELSTLE------MKLKEELDDLRAQKLEMLA-EADRLKVEKAKFEAEwehIDVKREELRK 547
Cdd:PRK03918 474 KERKLRKELRELEKVLKKESELIKLKelaeqlKELEEKLKKYNLEELEKKAeEYEKLKEKLIKLKGE---IKSLKKELEK 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 548 EAEYITRQREafsmyLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSkiqreradfLLGIEMQKRE 627
Cdd:PRK03918 551 LEELKKKLAE-----LEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLE---------LKDAEKELER 616
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 628 LEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAErleikldrerrereWAELKDSV 707
Cdd:PRK03918 617 EEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRE--------------LAGLRAEL 682
|
570 580 590
....*....|....*....|....*....|....*
gi 332010722 708 EELKVQREKLETQRHMLRAERDEIRHEIEELKKLE 742
Cdd:PRK03918 683 EELEKRREEIKKTLEKLKEELEEREKAKKELEKLE 717
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
83-685 |
3.17e-07 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 54.54 E-value: 3.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 83 LLLEKNELSSQYEEIKASVDesDLTHMREksayvsALAEAKKREESLKKdvgiakecISSLEKTLHEMRAECAETKVSAg 162
Cdd:COG4913 216 YMLEEPDTFEAADALVEHFD--DLERAHE------ALEDAREQIELLEP--------IRELAERYAAARERLAELEYLR- 278
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 163 stmseAHVMIEDALKKLADAEAKMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETK-ENEMVIERQT 241
Cdd:COG4913 279 -----AALRLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGGDRLEQlEREIERLERE 353
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 242 LNERRKSLQQEHERLLDAQVSLNQREDhifarsqELAELEKGLDTAKTTFEEERKAFEDKKSNLEIAL-ALCAKREEAVS 320
Cdd:COG4913 354 LEERERRRARLEALLAALGLPLPASAE-------EFAALRAEAAALLEALEEELEALEEALAEAEAALrDLRRELRELEA 426
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 321 ERES-----SLLKKEQELLVAE--EKIASKESEL-------------------IQNVLANQ------------------- 355
Cdd:COG4913 427 EIASlerrkSNIPARLLALRDAlaEALGLDEAELpfvgelievrpeeerwrgaIERVLGGFaltllvppehyaaalrwvn 506
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 356 ----------EVILRKRKSDVEAELECKSKSVEVEI-ESKRRAW------------------ELREVD--------IKQR 398
Cdd:COG4913 507 rlhlrgrlvyERVRTGLPDPERPRLDPDSLAGKLDFkPHPFRAWleaelgrrfdyvcvdspeELRRHPraitragqVKGN 586
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 399 EDLvGEKEHDLEVQSR---------ALAEKEKDITEksfnLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDlELQQSL 469
Cdd:COG4913 587 GTR-HEKDDRRRIRSRyvlgfdnraKLAALEAELAE----LEEELAEAEERLEALEAELDALQERREALQRLA-EYSWDE 660
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 470 TSLEDKRKRVDSATQKLEALKSETSELSTLEmklkEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEA 549
Cdd:COG4913 661 IDVASAEREIAELEAELERLDASSDDLAALE----EQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRL 736
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 550 EYITRQREAFSMYLKDER-------DNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADfLLGIE 622
Cdd:COG4913 737 EAAEDLARLELRALLEERfaaalgdAVERELRENLEERIDALRARLNRAEEELERAMRAFNREWPAETADLDAD-LESLP 815
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 332010722 623 mqkrelEYciENKREELENSS--RDREKAFEQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAE 685
Cdd:COG4913 816 ------EY--LALLDRLEEDGlpEYEERFKELLNENSIEFVADLLSKLRRAIREIKERIDPLNDS 872
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
190-737 |
1.19e-06 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 52.66 E-value: 1.19e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 190 EALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDH 269
Cdd:TIGR00618 179 TQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKK 258
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 270 IFARSQELAELEkgldtaKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQELLVAEEKIAsKESELIQ 349
Cdd:TIGR00618 259 QQLLKQLRARIE------ELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRA-KLLMKRA 331
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 350 NVLANQEVILRKRKSDVEAELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFN 429
Cdd:TIGR00618 332 AHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQAT 411
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 430 LDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEeldd 509
Cdd:TIGR00618 412 IDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETR---- 487
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 510 lraQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITR--QREAFSMYLKDERDNIKEERDALRNQHKNDVES 587
Cdd:TIGR00618 488 ---KKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRmqRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQ 564
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 588 LNREREEF--MNKMVEEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREELEnssRDREKAFEQEKKLEEERIQSLK 665
Cdd:TIGR00618 565 MQEIQQSFsiLTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLR---KLQPEQDLQDVRLHLQQCSQEL 641
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 332010722 666 EMAEKELEHVQVELKRlDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLrAERDEIRHEIEE 737
Cdd:TIGR00618 642 ALKLTALHALQLTLTQ-ERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEML-AQCQTLLRELET 711
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
211-777 |
4.34e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 50.79 E-value: 4.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 211 ESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKGLdtakTT 290
Cdd:TIGR04523 32 DTEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDL----SK 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 291 FEEERKAFEDKKSNLEIALALCAKREEAVSERE----SSLLKKEQELlvaeEKIASKESELIQNV--LANQEVILRKRKS 364
Cdd:TIGR04523 108 INSEIKNDKEQKNKLEVELNKLEKQKKENKKNIdkflTEIKKKEKEL----EKLNNKYNDLKKQKeeLENELNLLEKEKL 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 365 DVEAELEcksksvevEIESKRRAWELREVDIKQREdlvgEKEHDLEVQSRALAEKEKDITeksfnldekeKNLVATEEDI 444
Cdd:TIGR04523 184 NIQKNID--------KIKNKLLKLELLLSNLKKKI----QKNKSLESQISELKKQNNQLK----------DNIEKKQQEI 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 445 NRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELstlemklKEELDDLRAQKlemlaEADRL 524
Cdd:TIGR04523 242 NEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQL-------KSEISDLNNQK-----EQDWN 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 525 KVEKAKFEAEWEHIDVKREELRKEAEYITRqreafsmyLKDERDNIKEERDALRNqhknDVESLNREREEFMNKmveehs 604
Cdd:TIGR04523 310 KELKSELKNQEKKLEEIQNQISQNNKIISQ--------LNEQISQLKKELTNSES----ENSEKQRELEEKQNE------ 371
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 605 ewLSKIQRERADFLLGIEmqkreleycienkreELENSSRDREKAFEQEKKLEEERIQSLKEmAEKELEHVQVELKRLDA 684
Cdd:TIGR04523 372 --IEKLKKENQSYKQEIK---------------NLESQINDLESKIQNQEKLNQQKDEQIKK-LQQEKELLEKEIERLKE 433
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 685 ERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKKleNLKVALDDMSMAKMQLSNLErs 764
Cdd:TIGR04523 434 TIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQK--ELKSKEKELKKLNEEKKELE-- 509
|
570
....*....|...
gi 332010722 765 wEKVSALKQKVVS 777
Cdd:TIGR04523 510 -EKVKDLTKKISS 521
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
87-780 |
6.84e-06 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 50.22 E-value: 6.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 87 KNELSSQYEEIKASVDESDLthmrEKSAYVSALAEAKKREESLKKDVGIakecissLEKTLHEMRAECAETKVSAGSTMS 166
Cdd:pfam12128 289 NQLLRTLDDQWKEKRDELNG----ELSAADAAVAKDRSELEALEDQHGA-------FLDADIETAAADQEQLPSWQSELE 357
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 167 EAHVMIEDALKKLADAEAKMRAAEALQAEANRyhRIAERKLKEVESREDDLTRRLASFKSECETKENEMvieRQTLNERR 246
Cdd:pfam12128 358 NLEERLKALTGKHQDVTAKYNRRRSKIKEQNN--RDIAGIKDKLAKIREARDRQLAVAEDDLQALESEL---REQLEAGK 432
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 247 KSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKG---LDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERE 323
Cdd:pfam12128 433 LEFNEEEYRLKSRLGELKLRLNQATATPELLLQLENFderIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQAS 512
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 324 SSLLKKEQELLVAEEKIASKESELIQnVLANQEVILRKRKSDVEA-ELECKSKSVEveieskrrawELREVDIKQREDLV 402
Cdd:pfam12128 513 RRLEERQSALDELELQLFPQAGTLLH-FLRKEAPDWEQSIGKVISpELLHRTDLDP----------EVWDGSVGGELNLY 581
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 403 GEKEHDLEVQSRALAEKEKDITEKsfnLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSA 482
Cdd:pfam12128 582 GVKLDLKRIDVPEWAASEEELRER---LDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTALKNARLDLRRL 658
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 483 TQKLEALKSETSELSTLEMKLK-EELDDLRAQKlemlaeadrlKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAfsm 561
Cdd:pfam12128 659 FDEKQSEKDKKNKALAERKDSAnERLNSLEAQL----------KQLDKKHQAWLEEQKEQKREARTEKQAYWQVVEG--- 725
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 562 YLKDERDNIKEERDALRNQHKNDVESLNRERE-EFMNKMVEEhsEWLSKIQRERADFllgiemqKRELEYCIENKREELe 640
Cdd:pfam12128 726 ALDAQLALLKAAIAARRSGAKAELKALETWYKrDLASLGVDP--DVIAKLKREIRTL-------ERKIERIAVRRQEVL- 795
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 641 nssrdREKAFEQEK-KLEEERIQSLKEMAEKELEHVQVELKRLDAErleikldrerrerewaelkdsveeLKVQREKLET 719
Cdd:pfam12128 796 -----RYFDWYQETwLQRRPRLATQLSNIERAISELQQQLARLIAD------------------------TKLRRAKLEM 846
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332010722 720 QRHMLRAERDEIRHEIEELK----KLENLKVALDDMSMAKM------QLSNLERSWEKVSALKQKVVSRDD 780
Cdd:pfam12128 847 ERKASEKQQVRLSENLRGLRcemsKLATLKEDANSEQAQGSigerlaQLEDLKLKRDYLSESVKKYVEHFK 917
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
204-609 |
7.43e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 50.02 E-value: 7.43e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 204 ERKLKEVESRE---DDLTRRLASFKSECETKENEMviERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQELAEL 280
Cdd:TIGR04523 270 SEKQKELEQNNkkiKELEKQLNQLKSEISDLNNQK--EQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQL 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 281 EKGLDTAKTTFEEERKAFEDKKSNLEIA-------LALCAKREEAVSERESSLLKKEQELLVAEEKIASKESELiqNVLA 353
Cdd:TIGR04523 348 KKELTNSESENSEKQRELEEKQNEIEKLkkenqsyKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEK--ELLE 425
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 354 NQEVILRKRKSDVEAE---LECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDI---TEKS 427
Cdd:TIGR04523 426 KEIERLKETIIKNNSEikdLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELkklNEEK 505
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 428 FNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKR------VDSATQKLEALKSETSELSTLEM 501
Cdd:TIGR04523 506 KELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKenlekeIDEKNKEIEELKQTQKSLKKKQE 585
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 502 KLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREafsmYLKDERDNIKEERDALRNQh 581
Cdd:TIGR04523 586 EKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKN----KLKQEVKQIKETIKEIRNK- 660
|
410 420
....*....|....*....|....*...
gi 332010722 582 KNDVESLNREREEFMNKMVEEHSEWLSK 609
Cdd:TIGR04523 661 WPEIIKKIKESKTKIDDIIELMKDWLKE 688
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
367-535 |
9.61e-06 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 49.39 E-value: 9.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 367 EAELECKSKSVEVEIESKRRAWELREvdikqredlvgekehdlevqsralaEKEKDITEKSFNLDEKEKNLVATEEDINR 446
Cdd:PRK12704 46 EAKKEAEAIKKEALLEAKEEIHKLRN-------------------------EFEKELRERRNELQKLEKRLLQKEENLDR 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 447 KTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMK---LKEELDDLRAQKLEMLAEADR 523
Cdd:PRK12704 101 KLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKeilLEKVEEEARHEAAVLIKEIEE 180
|
170
....*....|..
gi 332010722 524 LKVEKAKFEAEW 535
Cdd:PRK12704 181 EAKEEADKKAKE 192
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
194-528 |
1.27e-05 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 49.30 E-value: 1.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 194 AEANRYHRIAERKLKEVESREDdltrRLASFKSECETKENEMVIERQTLnERRKSLQQEHERLlDAQVSLNQREDHIFAR 273
Cdd:TIGR02169 166 AEFDRKKEKALEELEEVEENIE----RLDLIIDEKRQQLERLRREREKA-ERYQALLKEKREY-EGYELLKEKEALERQK 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 274 SQELAELEkGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKK-----EQELLVAEEKIASKESELI 348
Cdd:TIGR02169 240 EAIERQLA-SLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEkigelEAEIASLERSIAEKERELE 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 349 QnvLANQEVILRKRKSDVEAELECKSKSVEvEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSF 428
Cdd:TIGR02169 319 D--AEERLAKLEAEIDKLLAEIEELEREIE-EERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLE 395
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 429 NLDEK----EKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLK 504
Cdd:TIGR02169 396 KLKREinelKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLK 475
|
330 340
....*....|....*....|....
gi 332010722 505 EELDDLRAQKLEMLAEADRLKVEK 528
Cdd:TIGR02169 476 EEYDRVEKELSKLQRELAEAEAQA 499
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
173-574 |
3.87e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.45 E-value: 3.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 173 EDALKKLADAEAKMRAAEALQAEanryHRIAERKLKEVESREDDLTRRLASFKSECETKEN-----EMVIERQTLNERRK 247
Cdd:COG4717 67 ELNLKELKELEEELKEAEEKEEE----YAELQEELEELEEELEELEAELEELREELEKLEKllqllPLYQELEALEAELA 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 248 SLQQEHERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTF-EEERKAFEDKKSNLEIALALCAKREEAVSERESSL 326
Cdd:COG4717 143 ELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLsLATEEELQDLAEELEELQQRLAELEEELEEAQEEL 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 327 LKKEQELLVAEEKIASKESE--------------------------------------LIQNVLANQEVILRKRKSDVEA 368
Cdd:COG4717 223 EELEEELEQLENELEAAALEerlkearlllliaaallallglggsllsliltiagvlfLVLGLLALLFLLLAREKASLGK 302
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 369 ---ELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKN------LVA 439
Cdd:COG4717 303 eaeELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAallaeaGVE 382
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 440 TEEDINRKTTMLEDEKERLRKLDlELQQSLTSLEDKRkRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLA 519
Cdd:COG4717 383 DEEELRAALEQAEEYQELKEELE-ELEEQLEELLGEL-EELLEALDEEELEEELEELEEELEELEEELEELREELAELEA 460
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 332010722 520 EADRLKVEKAKFEAEwEHIDVKREELRKEAEYITRQREAFSMyLKDERDNIKEER 574
Cdd:COG4717 461 ELEQLEEDGELAELL-QELEELKAELRELAEEWAALKLALEL-LEEAREEYREER 513
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
167-746 |
5.80e-05 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 47.27 E-value: 5.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 167 EAHVMIEDALKKLADAEAKMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQtlnERR 246
Cdd:TIGR00618 311 RIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTL---QQQ 387
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 247 KSLQQEHERLLDAQVSLNQREDH-IFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALAlCAKREEAVSERESS 325
Cdd:TIGR00618 388 KTTLTQKLQSLCKELDILQREQAtIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQ-CEKLEKIHLQESAQ 466
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 326 LLKKEQELLVAEEKIASKESELIQNVLANQEVIlrkrksdveAELECKSKSVEVEIESKRRAWELREVDikQREDLVGEK 405
Cdd:TIGR00618 467 SLKEREQQLQTKEQIHLQETRKKAVVLARLLEL---------QEEPCPLCGSCIHPNPARQDIDNPGPL--TRRMQRGEQ 535
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 406 EHDLEVQSRALAEKEKD-ITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQ 484
Cdd:TIGR00618 536 TYAQLETSEEDVYHQLTsERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQH 615
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 485 KLEALKSEtsELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAFSMYLK 564
Cdd:TIGR00618 616 ALLRKLQP--EQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLT 693
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 565 DERDNIKEERDALRNQHKNdVESLNREREEF----------MNKMVEEHSEWLSKIQRERADFLlgiemQKRELEYCIEN 634
Cdd:TIGR00618 694 YWKEMLAQCQTLLRELETH-IEEYDREFNEIenassslgsdLAAREDALNQSLKELMHQARTVL-----KARTEAHFNNN 767
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 635 KREELENSSRDREKAFEQE----KKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEEL 710
Cdd:TIGR00618 768 EEVTAALQTGAELSHLAAEiqffNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEI 847
|
570 580 590
....*....|....*....|....*....|....*.
gi 332010722 711 KVQREKLETQRHMLRAERDEIRHEIEELKKLENLKV 746
Cdd:TIGR00618 848 THQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQ 883
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
206-782 |
9.82e-05 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 46.55 E-value: 9.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 206 KLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKgld 285
Cdd:TIGR04523 90 KLKKNKDKINKLNSDLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKK--- 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 286 tAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQELLVAEEKIAskESELIQNVLANQEVILRKRKSD 365
Cdd:TIGR04523 167 -QKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQIS--ELKKQNNQLKDNIEKKQQEINE 243
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 366 VEAELECKSKSVEVEIESKRRA---WELREVDIKQREDLVGEKEHDL-EVQSRALA---EKEKDITEK-SFNLDEKEKNL 437
Cdd:TIGR04523 244 KTTEISNTQTQLNQLKDEQNKIkkqLSEKQKELEQNNKKIKELEKQLnQLKSEISDlnnQKEQDWNKElKSELKNQEKKL 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 438 VATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELstlemklKEELDDLRAQKLEM 517
Cdd:TIGR04523 324 EEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSY-------KQEIKNLESQINDL 396
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 518 LAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAfsmyLKDERDNIKEERDALR---NQHKNDVESLNREREE 594
Cdd:TIGR04523 397 ESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIK----NNSEIKDLTNQDSVKEliiKNLDNTRESLETQLKV 472
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 595 FMN--KMVEEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREELEnSSRDREKAFEQEKKLEEERIQSLkemaEKEL 672
Cdd:TIGR04523 473 LSRsiNKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKIS-SLKEKIEKLESEKKEKESKISDL----EDEL 547
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 673 EHVQVELKRldaERLEIKLDRERREREwaELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKKLenlkvalddMS 752
Cdd:TIGR04523 548 NKDDFELKK---ENLEKEIDEKNKEIE--ELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKK---------IS 613
|
570 580 590
....*....|....*....|....*....|
gi 332010722 753 MAKMQLSNLERSWEKVSALKQKVVSRDDEL 782
Cdd:TIGR04523 614 SLEKELEKAKKENEKLSSIIKNIKSKKNKL 643
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
204-764 |
1.00e-04 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 46.65 E-value: 1.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 204 ERKLKEVESREDDLTRRLASFKS--------------ECETKENEMVIERQTLNERRKSLQQEHERL----------LDA 259
Cdd:pfam15921 77 ERVLEEYSHQVKDLQRRLNESNElhekqkfylrqsviDLQTKLQEMQMERDAMADIRRRESQSQEDLrnqlqntvheLEA 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 260 QVSLnqREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKS-----------------------------NLEIALA 310
Cdd:pfam15921 157 AKCL--KEDMLEDSNTQIEQLRKMMLSHEGVLQEIRSILVDFEEasgkkiyehdsmstmhfrslgsaiskilrELDTEIS 234
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 311 LCAKREEAVSERESSLLKKEQ---ELLVA------EEKIASKESELI------------QNVLANQEVILRKRKSDVEAE 369
Cdd:pfam15921 235 YLKGRIFPVEDQLEALKSESQnkiELLLQqhqdriEQLISEHEVEITgltekassarsqANSIQSQLEIIQEQARNQNSM 314
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 370 LECKSKSVEVEIESKRRawELREVDiKQREDLVGEKEHDLEVQSRALAE--KEKD-ITEKSFNLDEKEKNLVAteeDINR 446
Cdd:pfam15921 315 YMRQLSDLESTVSQLRS--ELREAK-RMYEDKIEELEKQLVLANSELTEarTERDqFSQESGNLDDQLQKLLA---DLHK 388
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 447 KTTMLEDEKERLRKL-DLELQQSLT------SLEDKRKRVDSATQKLEALKSE-------------------------TS 494
Cdd:pfam15921 389 REKELSLEKEQNKRLwDRDTGNSITidhlrrELDDRNMEVQRLEALLKAMKSEcqgqmerqmaaiqgkneslekvsslTA 468
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 495 ELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAFSmYLKDERD---NIK 571
Cdd:pfam15921 469 QLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQ-HLKNEGDhlrNVQ 547
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 572 EERDALRNQ--HKNDVESLNREREEFMNKMVEEHSEWLSKIQRERAdfllgiemqkreleycienkreELENSSRDREKA 649
Cdd:pfam15921 548 TECEALKLQmaEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA----------------------QLEKEINDRRLE 605
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 650 FEQEKKLEEERIQSLKEMAEK--ELEHVQVELKRLDAERLEIKLD-RERREREWAELKDSVEELKVQREKLETQRHMLRA 726
Cdd:pfam15921 606 LQEFKILKDKKDAKIRELEARvsDLELEKVKLVNAGSERLRAVKDiKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRN 685
|
650 660 670
....*....|....*....|....*....|....*...
gi 332010722 727 ERDEIRHEIEELKKleNLKVALDDMSMAKMQLSNLERS 764
Cdd:pfam15921 686 KSEEMETTTNKLKM--QLKSAQSELEQTRNTLKSMEGS 721
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
271-497 |
1.20e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 45.53 E-value: 1.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 271 FARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQELLVAEEKIASKESEL--- 347
Cdd:COG4942 16 AAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIael 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 348 ------IQNVLANQEVILRKRKSDVEAELECKSKSVEveiESKRRAWELREVdIKQREDLVGEKEHDLEVQSRALAEKEK 421
Cdd:COG4942 96 raeleaQKEELAELLRALYRLGRQPPLALLLSPEDFL---DAVRRLQYLKYL-APARREQAEELRADLAELAALRAELEA 171
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 332010722 422 DITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRkldlELQQSLTSLEDKRKRVDSATQKLEALKSETSELS 497
Cdd:COG4942 172 ERAELEALLAELEEERAALEALKAERQKLLARLEKELA----ELAAELAELQQEAEELEALIARLEAEAAAAAERT 243
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
542-783 |
1.27e-04 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 46.20 E-value: 1.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 542 REELRKEAEYITRQREAFSMY--LKDERDNIK--------EERDALRNQHKNDVESLNREREEFMNKMVEEHSEW----- 606
Cdd:TIGR02168 195 LNELERQLKSLERQAEKAERYkeLKAELRELElallvlrlEELREELEELQEELKEAEEELEELTAELQELEEKLeelrl 274
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 607 ----LSKIQRERADFLLGIEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQslkemAEKELEHVQVELKRL 682
Cdd:TIGR02168 275 evseLEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE-----LAEELAELEEKLEEL 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 683 DAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELK-KLENLKVALDdmsmaKMQLSNL 761
Cdd:TIGR02168 350 KEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEaRLERLEDRRE-----RLQQEIE 424
|
250 260
....*....|....*....|..
gi 332010722 762 ERSWEKVSALKQKVVSRDDELD 783
Cdd:TIGR02168 425 ELLKKLEEAELKELQAELEELE 446
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
68-720 |
1.35e-04 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 45.88 E-value: 1.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 68 KLESEVYDYQHNMGLLLLEKNELSSQYeeikASVDESDLTHMREKSAYVSALAEAKKRE-----ESLKKDVGIAKECISS 142
Cdd:pfam15921 246 QLEALKSESQNKIELLLQQHQDRIEQL----ISEHEVEITGLTEKASSARSQANSIQSQleiiqEQARNQNSMYMRQLSD 321
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 143 LEKTLHEMRAEcaetkvsagstMSEAHVMIEDALKKLaDAEAKMRAAEALQAEANRyhriaERKLKEVESREDDLTRRLA 222
Cdd:pfam15921 322 LESTVSQLRSE-----------LREAKRMYEDKIEEL-EKQLVLANSELTEARTER-----DQFSQESGNLDDQLQKLLA 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 223 sfksECETKENEMVIERqtlnERRKSLQqehERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKT----TFEEERKAF 298
Cdd:pfam15921 385 ----DLHKREKELSLEK----EQNKRLW---DRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSecqgQMERQMAAI 453
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 299 EDKKSNLEIALALCAKREEAvseressllkKEQELLVAEEKIASKESeliqnvLANQEvilrKRKSDVEAELECKSKSVE 378
Cdd:pfam15921 454 QGKNESLEKVSSLTAQLEST----------KEMLRKVVEELTAKKMT------LESSE----RTVSDLTASLQEKERAIE 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 379 VEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRA------LAEKEKDITeksfNLDEKEKNLVATEEDINRKTTMLE 452
Cdd:pfam15921 514 ATNAEITKLRSRVDLKLQELQHLKNEGDHLRNVQTECealklqMAEKDKVIE----ILRQQIENMTQLVGQHGRTAGAMQ 589
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 453 DEKERLRKldlELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKL----KEELDDLRAQKLEMLAEADRLKVEK 528
Cdd:pfam15921 590 VEKAQLEK---EINDRRLELQEFKILKDKKDAKIRELEARVSDLELEKVKLvnagSERLRAVKDIKQERDQLLNEVKTSR 666
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 529 AKFEAEWEHIDVKREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREefMNKMVEEHSEWLS 608
Cdd:pfam15921 667 NELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMG--MQKQITAKRGQID 744
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 609 KIQREradfllgiemqkreleycIENKREELENSSRDREKAFEQEKKLEEE--RIQSLKEMAEKELEHVQVELKRLDAER 686
Cdd:pfam15921 745 ALQSK------------------IQFLEEAMTNANKEKHFLKEEKNKLSQElsTVATEKNKMAGELEVLRSQERRLKEKV 806
|
650 660 670
....*....|....*....|....*....|....
gi 332010722 687 LEIKLDRERREREWAELKDSVEELKVQREKLETQ 720
Cdd:pfam15921 807 ANMEVALDKASLQFAECQDIIQRQEQESVRLKLQ 840
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
539-750 |
1.44e-04 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 45.69 E-value: 1.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 539 DVKREELRKEAEYITRQREAFSmYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKI-------- 610
Cdd:PRK05771 39 ELSNERLRKLRSLLTKLSEALD-KLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEIseleneik 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 611 ----QRERADFL--LGIEMQK-RELEYCI----ENKREELENSSRDREKAFEQEKKLEEER----IQSLKEMAEKELEhv 675
Cdd:PRK05771 118 eleqEIERLEPWgnFDLDLSLlLGFKYVSvfvgTVPEDKLEELKLESDVENVEYISTDKGYvyvvVVVLKELSDEVEE-- 195
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 332010722 676 qvELKRLDAERLEIKldrerrerewaELKDSVEELKVQREKLETqrhmLRAERDEIRHEIEEL-KKLENLKVALDD 750
Cdd:PRK05771 196 --ELKKLGFERLELE-----------EEGTPSELIREIKEELEE----IEKERESLLEELKELaKKYLEELLALYE 254
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
201-495 |
1.59e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 45.83 E-value: 1.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 201 RIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHE--------RLLDAQVSLNQREDHIFA 272
Cdd:TIGR02169 233 EALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEeeqlrvkeKIGELEAEIASLERSIAE 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 273 RSQELAELEKGLDTAkttfEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQELLVAEEKIASKESELIQNVL 352
Cdd:TIGR02169 313 KERELEDAEERLAKL----EAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELK 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 353 AnqeviLRKRKSDVEAELEcksksvEVEIESKRRAWELREVDIKQREdlvgeKEHDLEVQSRALAEKEKDITEKSFNLDE 432
Cdd:TIGR02169 389 D-----YREKLEKLKREIN------ELKRELDRLQEELQRLSEELAD-----LNAAIAGIEAKINELEEEKEDKALEIKK 452
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 332010722 433 KEKNLVATEEDinrkttmLEDEKERLRKLdlelQQSLTSLEDKRKRVDSATQKLEALKSETSE 495
Cdd:TIGR02169 453 QEWKLEQLAAD-------LSKYEQELYDL----KEEYDRVEKELSKLQRELAEAEAQARASEE 504
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
172-387 |
2.10e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.75 E-value: 2.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 172 IEDALKKLADAEAKMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQ 251
Cdd:COG4942 22 AAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEA 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 252 EH----ERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLL 327
Cdd:COG4942 102 QKeelaELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLA 181
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 332010722 328 KKEQELLVAEEKIASKESEL--IQNVLANQEVILRKRKSDvEAELECKSKSVEVEIESKRRA 387
Cdd:COG4942 182 ELEEERAALEALKAERQKLLarLEKELAELAAELAELQQE-AEELEALIARLEAEAAAAAER 242
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
61-653 |
3.11e-04 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 45.07 E-value: 3.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 61 ALIAYIAKLESEVYDYQhnmGLLLLEKNELSSQYEEIKASVDESDLTHMR----EKSA----YVSALAEAKKREESLKKD 132
Cdd:COG5022 814 SYLACIIKLQKTIKREK---KLRETEEVEFSLKAEVLIQKFGRSLKAKKRfsllKKETiylqSAQRVELAERQLQELKID 890
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 133 VgiakECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEALQAEANRYHRI-----AERKL 207
Cdd:COG5022 891 V----KSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELnklheVESKL 966
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 208 KEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQ--QEHERLLDAQVSLNQR----------EDHIFARSQ 275
Cdd:COG5022 967 KETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGalQESTKQLKELPVEVAElqsaskiissESTELSILK 1046
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 276 ELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKRE-------------EAVSERESSLLKKEQELLVAEE---K 339
Cdd:COG5022 1047 PLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQLEstenllktinvkdLEVTNRNLVKPANVLQFIVAQMiklN 1126
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 340 IASKESELIQNVLANQEVILRKRKSDVEAELECKSKSVEVEIESKRRAWELREVDIKQrEDLVGEKEHDLEVQSRALAEK 419
Cdd:COG5022 1127 LLQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQ-SALYDEKSKLSSSEVNDLKNE 1205
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 420 EKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTL 499
Cdd:COG5022 1206 LIALFSKIFSGWPRGDKLKKLISEGWVPTEYSTSLKGFNNLNKKFDTPASMSNEKLLSLLNSIDNLLSSYKLEEEVLPAT 1285
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 500 EMKLKEELDDLRAQKLEMLAEADRLKV-----EKAKFEAEWEH---IDVKREELR--KEAEYITRQREAFSMYLKDERDN 569
Cdd:COG5022 1286 INSLLQYINVGLFNALRTKASSLRWKSatevnYNSEELDDWCRefeISDVDEELEelIQAVKVLQLLKDDLNKLDELLDA 1365
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 570 IKEERDALRNQHKNDVESLNRE---REEFMNKMVEehsewLSKIQRERADFLLGIEMQKRELEYCIENKRE-ELENSSRD 645
Cdd:COG5022 1366 CYSLNPAEIQNLKSRYDPADKEnnlPKEILKKIEA-----LLIKQELQLSLEGKDETEVHLSEIFSEEKSLiSLDRNSIY 1440
|
....*...
gi 332010722 646 REKAFEQE 653
Cdd:COG5022 1441 KEEVLSSL 1448
|
|
| ClpA |
COG0542 |
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein ... |
321-467 |
3.50e-04 |
|
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 440308 [Multi-domain] Cd Length: 836 Bit Score: 44.69 E-value: 3.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 321 ERESSLLKKEQELLVAEEKIASKEseliqnvlanqevilrkRKSDVEAELEckskSVEVEIESKRRAWELRevdiKQRED 400
Cdd:COG0542 417 ERRLEQLEIEKEALKKEQDEASFE-----------------RLAELRDELA----ELEEELEALKARWEAE----KELIE 471
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 332010722 401 LVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVA--TEEDI----NRKT-----TMLEDEKERLRKLDLELQQ 467
Cdd:COG0542 472 EIQELKEELEQRYGKIPELEKELAELEEELAELAPLLREevTEEDIaevvSRWTgipvgKLLEGEREKLLNLEEELHE 549
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
590-783 |
3.58e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 44.67 E-value: 3.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 590 REREEFMNKMVEEHSEWLSKIQRERADFLLGIEMQKRELEY-------CIENKREELENSSRDREKAFEQEKKLEEERIQ 662
Cdd:TIGR02169 183 EENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYegyellkEKEALERQKEAIERQLASLEEELEKLTEEISE 262
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 663 SLKEMAEKE--LEHVQVELKRLDAER--------LEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIR 732
Cdd:TIGR02169 263 LEKRLEEIEqlLEELNKKIKDLGEEEqlrvkekiGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 332010722 733 HEIEELKK-LENLKVALDDMSMAKMQL-SNLERSWEKVSALKQKVVSRDDELD 783
Cdd:TIGR02169 343 REIEEERKrRDKLTEEYAELKEELEDLrAELEEVDKEFAETRDELKDYREKLE 395
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
653-876 |
4.33e-04 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 44.15 E-value: 4.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 653 EKKLEEERIQSLKEMAEKELEHVqVELKR-LDAERLEIKLDRERREREWAELKDSVEE--LKVQRE--KLETQRHMLRAE 727
Cdd:PRK05771 37 KEELSNERLRKLRSLLTKLSEAL-DKLRSyLPKLNPLREEKKKVSVKSLEELIKDVEEelEKIEKEikELEEEISELENE 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 728 RDEIRHEIEELKKLENLKVALDDmsmakmqLSNLERSWEKVSALKQKVVSRDDELDLQNGVSTVSNSEDGYNSSMerqNG 807
Cdd:PRK05771 116 IKELEQEIERLEPWGNFDLDLSL-------LLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVV---VV 185
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 808 LTPSSATPFSWIKRCTNLIFKTSPEK--STLMHHYEEE-GGVPSEKLKLESS--------RREEKAYTEGLSIAVERLEA 876
Cdd:PRK05771 186 LKELSDEVEEELKKLGFERLELEEEGtpSELIREIKEElEEIEKERESLLEElkelakkyLEELLALYEYLEIELERAEA 265
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
155-375 |
5.28e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 43.60 E-value: 5.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 155 AETKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENE 234
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 235 MVIERQTLNERRKSLQQeHERLLDAQVSLNQREDHIFARSQE-LAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCA 313
Cdd:COG4942 99 LEAQKEELAELLRALYR-LGRQPPLALLLSPEDFLDAVRRLQyLKYLAPARREQAEELRADLAELAALRAELEAERAELE 177
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 332010722 314 KREEAVSERESSL--LKKEQELLVA--EEKIASKESELIQnvLANQEVILRKRKSDVEAELECKSK 375
Cdd:COG4942 178 ALLAELEEERAALeaLKAERQKLLArlEKELAELAAELAE--LQQEAEELEALIARLEAEAAAAAE 241
|
|
| PRK00106 |
PRK00106 |
ribonuclease Y; |
310-443 |
7.04e-04 |
|
ribonuclease Y;
Pssm-ID: 178867 [Multi-domain] Cd Length: 535 Bit Score: 43.32 E-value: 7.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 310 ALCAKREEAVSERESSLLKKEQELLVAEEKiASKESELIQNVlANQEVILRKRKSDVEAELECKSKSVEVEIESKRRAWE 389
Cdd:PRK00106 21 LISIKMKSAKEAAELTLLNAEQEAVNLRGK-AERDAEHIKKT-AKRESKALKKELLLEAKEEARKYREEIEQEFKSERQE 98
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 332010722 390 LREVD--IKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEED 443
Cdd:PRK00106 99 LKQIEsrLTERATSLDRKDENLSSKEKTLESKEQSLTDKSKHIDEREEQVEKLEEQ 154
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
412-688 |
8.55e-04 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 43.19 E-value: 8.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 412 QSRALAEKEKDI--TEKSFNLDEKEKnlvATEEDINRKTTMLEDEKERLRKLDLELQQslTSLEDKRKRVDSATQKLEAL 489
Cdd:pfam17380 298 QERLRQEKEEKAreVERRRKLEEAEK---ARQAEMDRQAAIYAEQERMAMERERELER--IRQEERKRELERIRQEEIAM 372
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 490 K-SETSELSTLEMKLKEELDDLRAQklemLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYiTRQREAFSMYLKDERD 568
Cdd:pfam17380 373 EiSRMRELERLQMERQQKNERVRQE----LEAARKVKILEEERQRKIQQQKVEMEQIRAEQEE-ARQREVRRLEEERARE 447
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 569 NIKEERDALRNQHKndVESLNREREEFMNKMVE-EHSEWLSKIQRERADFLLGIEMQKRELEYCIE-NKREELENSSRDR 646
Cdd:pfam17380 448 MERVRLEEQERQQQ--VERLRQQEEERKRKKLElEKEKRDRKRAEEQRRKILEKELEERKQAMIEEeRKRKLLEKEMEER 525
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 332010722 647 EKAF--EQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLE 688
Cdd:pfam17380 526 QKAIyeEERRREAEEERRKQQEMEERRRIQEQMRKATEERSRLE 569
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
464-766 |
8.58e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.37 E-value: 8.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 464 ELQQSLTSLEDKRKRVDSATQKLEALksetSELSTLEMKLKEELDDLRAQKlemlAEADRLKVEKAkfEAEWEHIDVKRE 543
Cdd:COG4913 229 ALVEHFDDLERAHEALEDAREQIELL----EPIRELAERYAAARERLAELE----YLRAALRLWFA--QRRLELLEAELE 298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 544 ELRKEAEYITRQREAfsmyLKDERDNIKEERDALRNQHKN----DVESLNREreefmnkmveehsewlskiqreradfll 619
Cdd:COG4913 299 ELRAELARLEAELER----LEARLDALREELDELEAQIRGnggdRLEQLERE---------------------------- 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 620 giemqkreleycIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEkELEHVQVELKRLDAERLEIKLDRERRERE 699
Cdd:COG4913 347 ------------IERLERELEERERRRARLEALLAALGLPLPASAEEFAA-LRAEAAALLEALEEELEALEEALAEAEAA 413
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332010722 700 WAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEelkklENLKVALDDMSMAK--MQLSNLERSWE 766
Cdd:COG4913 414 LRDLRRELRELEAEIASLERRKSNIPARLLALRDALA-----EALGLDEAELPFVGelIEVRPEEERWR 477
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
454-747 |
9.80e-04 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 43.19 E-value: 9.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 454 EKERLRKldlELQQSLTSLEDKRKRVDSATQKLEALKSETS---ELSTLEMKLKEELDdlRAQKLEMLAEADRLKVEKAK 530
Cdd:pfam17380 297 EQERLRQ---EKEEKAREVERRRKLEEAEKARQAEMDRQAAiyaEQERMAMERERELE--RIRQEERKRELERIRQEEIA 371
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 531 FEA----EWEHIDVKR----EELRKEAEYITRQR---EAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKM 599
Cdd:pfam17380 372 MEIsrmrELERLQMERqqknERVRQELEAARKVKileEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERV 451
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 600 VEEHSEWLSKIQRERADfllgiEMQKRELEYCIENKREELENSSRDREKAFEQEKK------LEEERIQSL--KEMAEKE 671
Cdd:pfam17380 452 RLEEQERQQQVERLRQQ-----EEERKRKKLELEKEKRDRKRAEEQRRKILEKELEerkqamIEEERKRKLleKEMEERQ 526
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 332010722 672 LEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEElkvqREKLETqrhmLRAERDEIRHEIEELKKLENLKVA 747
Cdd:pfam17380 527 KAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATEE----RSRLEA----MEREREMMRQIVESEKARAEYEAT 594
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
267-786 |
1.00e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.37 E-value: 1.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 267 EDHIFARSQELAELEKGLDTAKTTFEEERKafedKKSNLEIALALCAKREEAVSERE--------SSLLKKEQELLVAEE 338
Cdd:COG4913 220 EPDTFEAADALVEHFDDLERAHEALEDARE----QIELLEPIRELAERYAAARERLAeleylraaLRLWFAQRRLELLEA 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 339 KIASKESELiqNVLANQEVILRKRKSDVEAELEcksksvevEIESKRRAWELREVDIKQREdlVGEKEHDLEVQSRALAE 418
Cdd:COG4913 296 ELEELRAEL--ARLEAELERLEARLDALREELD--------ELEAQIRGNGGDRLEQLERE--IERLERELEERERRRAR 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 419 KEKDITEKSFNLDEKEKNLVATEEDINRkttMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSt 498
Cdd:COG4913 364 LEALLAALGLPLPASAEEFAALRAEAAA---LLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIP- 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 499 lemklkEELDDLRAQklemLAEADRLKVEKAKFEAEweHIDVKREELRkeaeyitrQREAFSMYLKDERDNI----KEER 574
Cdd:COG4913 440 ------ARLLALRDA----LAEALGLDEAELPFVGE--LIEVRPEEER--------WRGAIERVLGGFALTLlvppEHYA 499
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 575 DALR--NQHKN----DVESLNREREEFMNKMVEEHS-------------EWLSKIQRERADFLlgiemqkreleyCIENk 635
Cdd:COG4913 500 AALRwvNRLHLrgrlVYERVRTGLPDPERPRLDPDSlagkldfkphpfrAWLEAELGRRFDYV------------CVDS- 566
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 636 REELEN----------------------------------SSRDREKAFEQEKKLEEERIQSLKEMAEKelehVQVELKR 681
Cdd:COG4913 567 PEELRRhpraitragqvkgngtrhekddrrrirsryvlgfDNRAKLAALEAELAELEEELAEAEERLEA----LEAELDA 642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 682 LDAERLEIK--LDRERREREWAELKDSVEELKVQREKLETQRHMLRA---ERDEIRHEIEELKklENLKVALDDMSMAKM 756
Cdd:COG4913 643 LQERREALQrlAEYSWDEIDVASAEREIAELEAELERLDASSDDLAAleeQLEELEAELEELE--EELDELKGEIGRLEK 720
|
570 580 590
....*....|....*....|....*....|
gi 332010722 757 QLSNLERSWEKVSALKQKVVSRDDELDLQN 786
Cdd:COG4913 721 ELEQAEEELDELQDRLEAAEDLARLELRAL 750
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
204-690 |
1.08e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 43.09 E-value: 1.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 204 ERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQvSLNQREDHIfarSQELAELEKG 283
Cdd:TIGR04523 151 EKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLK-KKIQKNKSL---ESQISELKKQ 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 284 LDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSERESSLLKKEQELLVAEEKIASKESELiqNVLANQEVILRKRK 363
Cdd:TIGR04523 227 NNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQL--NQLKSEISDLNNQK 304
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 364 SdveaelECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEED 443
Cdd:TIGR04523 305 E------QDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKE 378
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 444 INRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADR 523
Cdd:TIGR04523 379 NQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKN 458
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 524 LKVEKAKFEAEWEHIDVKREELRKEAEYITRQ---REAFSMYLKDERDNIKEERDALRNQH---KNDVESLNREREEFMN 597
Cdd:TIGR04523 459 LDNTRESLETQLKVLSRSINKIKQNLEQKQKElksKEKELKKLNEEKKELEEKVKDLTKKIsslKEKIEKLESEKKEKES 538
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 598 KMVEEHSEWLSKIQRERADFLlgiEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMA--------- 668
Cdd:TIGR04523 539 KISDLEDELNKDDFELKKENL---EKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEekekkissl 615
|
490 500
....*....|....*....|..
gi 332010722 669 EKELEHVQVELKRLDAERLEIK 690
Cdd:TIGR04523 616 EKELEKAKKENEKLSSIIKNIK 637
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
446-661 |
1.42e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 42.45 E-value: 1.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 446 RKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTlemklkeelddlRAQKLEMLAEADRLK 525
Cdd:COG4717 64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELRE------------ELEKLEKLLQLLPLY 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 526 VEKAKFEAEWEHIDVKREELRKEAEYITRQREafsmylkdERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSE 605
Cdd:COG4717 132 QELEALEAELAELPERLEELEERLEELRELEE--------ELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEE 203
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 332010722 606 WLSKIQRERADFllgiemqkRELEYCIENKREELENSSRDREKAFEQEKKLEEERI 661
Cdd:COG4717 204 LQQRLAELEEEL--------EEAQEELEELEEELEQLENELEAAALEERLKEARLL 251
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
201-309 |
1.97e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 42.07 E-value: 1.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 201 RIAERKLKEVESREDDL----TRRLASFKSECETKENEMVIERQT-----LNERRKSLQQEHERLLDAQVSLNQREDHIF 271
Cdd:PRK12704 27 KIAEAKIKEAEEEAKRIleeaKKEAEAIKKEALLEAKEEIHKLRNefekeLRERRNELQKLEKRLLQKEENLDRKLELLE 106
|
90 100 110
....*....|....*....|....*....|....*...
gi 332010722 272 ARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIAL 309
Cdd:PRK12704 107 KREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
406-782 |
2.15e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 42.03 E-value: 2.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 406 EHDLEVQSRALAEKEKDITEkSFNLDEKEKnLVATEEDINRKTTMLEDEKERLRKLDLELQQSlTSLEDKRKRVDSATQK 485
Cdd:pfam15921 77 ERVLEEYSHQVKDLQRRLNE-SNELHEKQK-FYLRQSVIDLQTKLQEMQMERDAMADIRRRES-QSQEDLRNQLQNTVHE 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 486 LEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKV--EKAKFEAEWEHIDVKREELRKEAEYIT---RQREAFS 560
Cdd:pfam15921 154 LEAAKCLKEDMLEDSNTQIEQLRKMMLSHEGVLQEIRSILVdfEEASGKKIYEHDSMSTMHFRSLGSAISkilRELDTEI 233
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 561 MYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREELE 640
Cdd:pfam15921 234 SYLKGRIFPVEDQLEALKSESQNKIELLLQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNS 313
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 641 NSSRDREKAFEQEKKLEEErIQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQ 720
Cdd:pfam15921 314 MYMRQLSDLESTVSQLRSE-LREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLLADLHKREKE 392
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 721 RHMLRAER--------------DEIRHEIE----ELKKLENLKVALDDMSMAKMQ-----LSNLERSWEKVSALKQKVVS 777
Cdd:pfam15921 393 LSLEKEQNkrlwdrdtgnsitiDHLRRELDdrnmEVQRLEALLKAMKSECQGQMErqmaaIQGKNESLEKVSSLTAQLES 472
|
....*
gi 332010722 778 RDDEL 782
Cdd:pfam15921 473 TKEML 477
|
|
| PLN02939 |
PLN02939 |
transferase, transferring glycosyl groups |
432-782 |
2.46e-03 |
|
transferase, transferring glycosyl groups
Pssm-ID: 215507 [Multi-domain] Cd Length: 977 Bit Score: 41.81 E-value: 2.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 432 EKEKNLVATEEDINRKTTMLEDEKERLRKLDL----ELQQSLTSLEDKRKRVDSatQKLEALKSETSELSTlEMKLKEEL 507
Cdd:PLN02939 49 KRGKNIAPKQRSSNSKLQSNTDENGQLENTSLrtvmELPQKSTSSDDDHNRASM--QRDEAIAAIDNEQQT-NSKDGEQL 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 508 DDLRAQKL-EML--AEADRLKVEKAKFEAeWEHIDvkreELRKEAEYITRQREAFSMYLKDERDNIKeerdaLRNQHKND 584
Cdd:PLN02939 126 SDFQLEDLvGMIqnAEKNILLLNQARLQA-LEDLE----KILTEKEALQGKINILEMRLSETDARIK-----LAAQEKIH 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 585 VESLNREREEFMNKMVEEHSEWLSKIQRERADFLLgIEMQKRELEYCIENKREELENSSRDREKAFeqekKLEEERiqSL 664
Cdd:PLN02939 196 VEILEEQLEKLRNELLIRGATEGLCVHSLSKELDV-LKEENMLLKDDIQFLKAELIEVAETEERVF----KLEKER--SL 268
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 665 KEMAEKELEHvqvelKRLDAERLEIKLDRERREREWaelkdsveelkvqrEKLETQRHMLRAERDEIRHEIEEL------ 738
Cdd:PLN02939 269 LDASLRELES-----KFIVAQEDVSKLSPLQYDCWW--------------EKVENLQDLLDRATNQVEKAALVLdqnqdl 329
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 332010722 739 -KKLENLKVALDDMSMAKMQLSNLERSWEKVSALKQKVVSRDDEL 782
Cdd:PLN02939 330 rDKVDKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEI 374
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
306-785 |
2.59e-03 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 41.88 E-value: 2.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 306 EIALALCAKREEAVSERESSLLKKEQELLVAEEKIASKESELIQNVLANQEVILRKRKSDVEA--ELECKSKSVEVEIES 383
Cdd:pfam02463 144 IEIIAMMKPERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAkkALEYYQLKEKLELEE 223
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 384 KRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDL 463
Cdd:pfam02463 224 EYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELL 303
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 464 ELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLkvekakfeaewehidvKRE 543
Cdd:pfam02463 304 KLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKL----------------QEK 367
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 544 ELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADFLLGIEM 623
Cdd:pfam02463 368 LEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLT 447
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 624 QKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAEL 703
Cdd:pfam02463 448 EEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISA 527
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 704 KDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKKLENLKVALDDMSMAKMQLSNLERSWEKVSALKQKVVSRDDELD 783
Cdd:pfam02463 528 HGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQ 607
|
..
gi 332010722 784 LQ 785
Cdd:pfam02463 608 LD 609
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
416-578 |
3.03e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 40.29 E-value: 3.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 416 LAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETsE 495
Cdd:COG1579 12 LQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNK-E 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 496 LSTLEM---KLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQreafsmyLKDERDNIKE 572
Cdd:COG1579 91 YEALQKeieSLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAE-------LEAELEELEA 163
|
....*.
gi 332010722 573 ERDALR 578
Cdd:COG1579 164 EREELA 169
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
434-775 |
3.79e-03 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 41.20 E-value: 3.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 434 EKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQ 513
Cdd:PRK03918 164 YKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEEL 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 514 KLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQRE------------AFSMYLKDERDNIKEERDALRNQH 581
Cdd:PRK03918 244 EKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKElkekaeeyiklsEFYEEYLDELREIEKRLSRLEEEI 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 582 KN------DVESLNREREEFMNKMVE------EHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREELENSSRDREKA 649
Cdd:PRK03918 324 NGieerikELEEKEERLEELKKKLKElekrleELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEI 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 650 FEQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERD 729
Cdd:PRK03918 404 EEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELR 483
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 332010722 730 EIRHEIEELKKLENLKVALDDM-----SMAKMQLSNLERSWEKVSALKQKV 775
Cdd:PRK03918 484 ELEKVLKKESELIKLKELAEQLkeleeKLKKYNLEELEKKAEEYEKLKEKL 534
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
90-293 |
4.39e-03 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 41.16 E-value: 4.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 90 LSSQYEEIKASVDESD--LTHMREKSAYVSALAEAK---KREESLKKDVGIAKECISSLEKTLHEMRAECAETKVSAGST 164
Cdd:COG3206 180 LEEQLPELRKELEEAEaaLEEFRQKNGLVDLSEEAKlllQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPEL 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 165 MSEAhvMIEDALKKLADAEAKMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASfksecetkenEMVIERQTLNE 244
Cdd:COG3206 260 LQSP--VIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILA----------SLEAELEALQA 327
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 332010722 245 RRKSLQQEHERLLDAQVSLNQREdhifarsQELAELEKGLDTAKTTFEE 293
Cdd:COG3206 328 REASLQAQLAQLEARLAELPELE-------AELRRLEREVEVARELYES 369
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
245-658 |
4.58e-03 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 41.22 E-value: 4.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 245 RRKSLQQEHERLLDAQVSLNQREDHIFARSQE--LAELEKGLDTAKTTFEEERKAFEDK-KSNLEIA------LALCAKR 315
Cdd:COG5022 814 SYLACIIKLQKTIKREKKLRETEEVEFSLKAEvlIQKFGRSLKAKKRFSLLKKETIYLQsAQRVELAerqlqeLKIDVKS 893
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 316 EEAVSERESSLLKKEQELLVAEEKIASKESELIQNVLANQEVILRKRKSDVEAELECKSKSVEVEIESKRRawELREVdI 395
Cdd:COG5022 894 ISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVES--KLKET-S 970
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 396 KQREDLVgEKEHDLEVQS-----------RALAEKEKD---ITEKSFNLDEKEKNLVATEEDINRKTTM------LEDEK 455
Cdd:COG5022 971 EEYEDLL-KKSTILVREGnkanselknfkKELAELSKQygaLQESTKQLKELPVEVAELQSASKIISSEstelsiLKPLQ 1049
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 456 ERLRKLDLELQQ---SLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKlKEELDDLRAQKLEMLAEADRLKVEKAKFE 532
Cdd:COG5022 1050 KLKGLLLLENNQlqaRYKALKLRRENSLLDDKQLYQLESTENLLKTINVK-DLEVTNRNLVKPANVLQFIVAQMIKLNLL 1128
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 533 AE-----------WEHIDVKREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVE 601
Cdd:COG5022 1129 QEiskflsqlvntLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKSKLSSSEVNDLKNELIA 1208
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 332010722 602 EHSewlskiqRERADFLLGIEMQK--RELEYCIENKREELENSSRDREKAFEQEKKLEE 658
Cdd:COG5022 1209 LFS-------KIFSGWPRGDKLKKliSEGWVPTEYSTSLKGFNNLNKKFDTPASMSNEK 1260
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
470-673 |
5.00e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 40.58 E-value: 5.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 470 TSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEwehIDVKREELRKEA 549
Cdd:COG3883 16 PQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAE---IEERREELGERA 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 550 EYITRQREAFSMY-----------LKDERDNIkeerDALRNQHKNDVESLNREREEFMNKMVEehsewLSKIQRERADFL 618
Cdd:COG3883 93 RALYRSGGSVSYLdvllgsesfsdFLDRLSAL----SKIADADADLLEELKADKAELEAKKAE-----LEAKLAELEALK 163
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 332010722 619 LGIEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELE 673
Cdd:COG3883 164 AELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAA 218
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
628-744 |
5.92e-03 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 40.61 E-value: 5.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 628 LEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELEHVQV---ELKRLDA--ERLEIKLDRERREREWAE 702
Cdd:COG2433 382 LEELIEKELPEEEPEAEREKEHEERELTEEEEEIRRLEEQVERLEAEVEEleaELEEKDEriERLERELSEARSEERREI 461
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 332010722 703 LKDS-VEELKVQREKLETQRHMLRAERDEIRHEIEELKKLENL 744
Cdd:COG2433 462 RKDReISRLDREIERLERELEEERERIEELKRKLERLKELWKL 504
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
502-740 |
6.20e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 40.13 E-value: 6.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 502 KLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAfsmyLKDERDNIKEERDALRNQH 581
Cdd:COG4942 24 EAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAA----LEAELAELEKEIAELRAEL 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 582 KNDVESL-NREREEFMNKMVEEHSEWLSkiQRERADFLLGIEMQKRELEYcienkreelenssrDREKAFEQEKKLEEer 660
Cdd:COG4942 100 EAQKEELaELLRALYRLGRQPPLALLLS--PEDFLDAVRRLQYLKYLAPA--------------RREQAEELRADLAE-- 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 661 IQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKK 740
Cdd:COG4942 162 LAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
225-742 |
7.17e-03 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 40.27 E-value: 7.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 225 KSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSN 304
Cdd:PRK01156 189 EEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAESDLSME 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 305 LEIALALCAKREEAVSERESSLLKKEQELL--VAEEKIASKESELIQNVLA-------------------NQEVILRKRK 363
Cdd:PRK01156 269 LEKNNYYKELEERHMKIINDPVYKNRNYINdyFKYKNDIENKKQILSNIDAeinkyhaiikklsvlqkdyNDYIKKKSRY 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 364 SDVEAEL------ECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAE-------KEKDITEKSFNL 430
Cdd:PRK01156 349 DDLNNQIlelegyEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKelneinvKLQDISSKVSSL 428
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 431 DEKEKNLVATEEDINRKTTMLEDE-KERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEELDD 509
Cdd:PRK01156 429 NQRIRALRENLDELSRNMEMLNGQsVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEY 508
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 510 LRAQKLEMLAEADRLKVEKakfEAEWEHIDVKREELR-KEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESl 588
Cdd:PRK01156 509 LESEEINKSINEYNKIESA---RADLEDIKIKINELKdKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIET- 584
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 589 NREREEFMNKMVEEHSEWLSKIQRERADFLLGIEMQKRELE---YCIENKREELENSSRDREKAFEQEKKLEEEriqslk 665
Cdd:PRK01156 585 NRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIEneaNNLNNKYNEIQENKILIEKLRGKIDNYKKQ------ 658
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 332010722 666 emaEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRHEIEELKKLE 742
Cdd:PRK01156 659 ---IAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIK 732
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
637-806 |
8.67e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 40.14 E-value: 8.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 637 EELENSSRDREKAFEQEKKLEEERIQslKEMAEKELEHVQVELKRLDAE--RLEIKLDRERREREWAELKDSVEELKVQR 714
Cdd:COG4717 71 KELKELEEELKEAEEKEEEYAELQEE--LEELEEELEELEAELEELREEleKLEKLLQLLPLYQELEALEAELAELPERL 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 715 EKLETQRHMLRAERDEIRHEIEELKKLEN-LKVALDDMSMAKmqLSNLERSWEKVSALKQKVVSRDDELD-LQNGVSTVS 792
Cdd:COG4717 149 EELEERLEELRELEEELEELEAELAELQEeLEELLEQLSLAT--EEELQDLAEELEELQQRLAELEEELEeAQEELEELE 226
|
170
....*....|....
gi 332010722 793 NSEDGYNSSMERQN 806
Cdd:COG4717 227 EELEQLENELEAAA 240
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
172-381 |
9.25e-03 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 39.91 E-value: 9.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 172 IEDALKKLADAeakMRAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQ 251
Cdd:PRK05771 45 LRKLRSLLTKL---SEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQ 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332010722 252 EHERL-----LDAQVSLNQREDHIFARSQELAelEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEAVSEREssl 326
Cdd:PRK05771 122 EIERLepwgnFDLDLSLLLGFKYVSVFVGTVP--EDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVEEE--- 196
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 332010722 327 LKKE--QELLVAEEKIAskeSELIQNvLANQEVILRKRKSDVEAELECKSKSVEVEI 381
Cdd:PRK05771 197 LKKLgfERLELEEEGTP---SELIRE-IKEELEEIEKERESLLEELKELAKKYLEEL 249
|
|
|