NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217375254|ref|XP_047278635|]
View 

centriolin isoform X12 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PPP1R42 super family cl42388
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
89-261 2.41e-23

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


The actual alignment was detected with superfamily member cd21340:

Pssm-ID: 455733 [Multi-domain]  Cd Length: 220  Bit Score: 100.63  E-value: 2.41e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   89 KKLTKQDNLALIKSL-------NlslskdggkKFKYIENLEKCVKLEVLNLSYNLIGKIEKLDKLLKLRELNLSYNKISK 161
Cdd:cd21340     12 KNITKIDNLSLCKNLkvlylydN---------KITKIENLEFLTNLTHLYLQNNQIEKIENLENLVNLKKLYLGGNRISV 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  162 IEGIENMCNLQKLNLAGNEIEhipvwLGKKL-----------KSLRVLNLKGNKISSLQDISKLKPLQDLI--------- 221
Cdd:cd21340     83 VEGLENLTNLEELHIENQRLP-----PGEKLtfdprslaalsNSLRVLNISGNNIDSLEPLAPLRNLEQLDasnnqisdl 157
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217375254  222 --------------SLILVENPVVTLPHYLQFTIFHLRSLESLEGQPVTTQDRQ 261
Cdd:cd21340    158 eelldllsswpslrELDLTGNPVCKKPKYRDKIILASKSLEVLDGKEITDTERQ 211
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1287-2161 3.76e-23

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 108.22  E-value: 3.76e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1287 RASKRQSEKEMEELHHN---IDDLLQEkksLECEVEELHRTVQKRQQQKDfidgnVESLMTELEIEKSLKHHEDIVDEIE 1363
Cdd:TIGR02168  171 KERRKETERKLERTRENldrLEDILNE---LERQLKSLERQAEKAERYKE-----LKAELRELELALLVLRLEELREELE 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1364 CIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQ 1443
Cdd:TIGR02168  243 ELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELE 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1444 ADAKDLEQHKIKQEEILKEINKivaakdsDFQCLSKKKEKLTEELQKLQKDIEMAERNedhhlqvLKESEVLLQAKRAEL 1523
Cdd:TIGR02168  323 AQLEELESKLDELAEELAELEE-------KLEELKEELESLEAELEELEAELEELESR-------LEELEEQLETLRSKV 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1524 EKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEA-LRLGETEVTEKCNHIREVKSLLEELSFQKGELNV 1602
Cdd:TIGR02168  389 AQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAeLKELQAELEELEEELEELQEELERLEEALEELRE 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1603 QISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQlENHELQGLKLQHDQRVSELEKTQVAVLEEKLELE 1682
Cdd:TIGR02168  469 ELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLK-NQSGLSGILGVLSELISVDEGYEAAIEAALGGRL 547
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1683 nlQQI----SQQQKGEIEWQKQ----------LLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTK 1748
Cdd:TIGR02168  548 --QAVvvenLNAAKKAIAFLKQnelgrvtflpLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGV 625
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1749 RV---LAAAEENSKMEQSNL--------------------EKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKRE 1805
Cdd:TIGR02168  626 LVvddLDNALELAKKLRPGYrivtldgdlvrpggvitggsAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK 705
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1806 AVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLV 1885
Cdd:TIGR02168  706 ELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE 785
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1886 QQEMMFQRLQKERESEESKLEtskvtlkeqqhQLEKELTDQKSKLDQVLSKvlaaeervrtlqeeerwCESLEKTLSQTK 1965
Cdd:TIGR02168  786 ELEAQIEQLKEELKALREALD-----------ELRAELTLLNEEAANLRER-----------------LESLERRIAATE 837
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1966 RQLSEREQQLVEKSGELLALQKEadsmRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKnlltinedserdssl 2045
Cdd:TIGR02168  838 RRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEE--------------- 898
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2046 lskeqkqenscIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMANQKD--LERRQMEISDAMRTLKSEVKD- 2122
Cdd:TIGR02168  899 -----------LSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQErlSEEYSLTLEEAEALENKIEDDe 967
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....*..
gi 2217375254 2123 -EIRTSLKNLNQFLPEL-PADLEAILE------RNENLEGELESLKE 2161
Cdd:TIGR02168  968 eEARRRLKRLENKIKELgPVNLAAIEEyeelkeRYDFLTAQKEDLTE 1014
Smc super family cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
435-995 2.09e-19

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 95.77  E-value: 2.09e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAgkdllykQLSGRLQLVNKLRQEALD 514
Cdd:COG1196    244 LEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELA-------RLEQDIARLEERRRELEE 316
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAIdsldskdpkhshmKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKD 594
Cdd:COG1196    317 RLEELEEELAELEEELEELEEELEEL-------------EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  595 LEEQLTEGQIAANEALKKDLEgvisglqeylgtIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMDAENMR 674
Cdd:COG1196    384 LAEELLEALRAAAELAAQLEE------------LEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEE 451
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  675 KLEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNL--------LKQQLKDFQNHLNHVVDGLVR 746
Cdd:COG1196    452 AELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADyegflegvKAALLLAGLRGLAGAVAVLIG 531
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  747 PEEVAARVDELRrklkLGTGEMNIHSPSDVLGKSLADLQKQ-------FSEILARSKWERDEAQVRERKLQEEMALQQEK 819
Cdd:COG1196    532 VEAAYEAALEAA----LAAALQNIVVEDDEVAAAAIEYLKAakagratFLPLDKIRARAALAAALARGAIGAAVDLVASD 607
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  820 LATGQEEFRQACERALEARMnfDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEKKK 899
Cdd:COG1196    608 LREADARYYVLGDTLLGRTL--VAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELA 685
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  900 KLEDAKSQEQVFGLDKELKKLKKAVATSDKLATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAArDLTR 979
Cdd:COG1196    686 ERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPP-DLEE 764
                          570
                   ....*....|....*.
gi 2217375254  980 AEAEIELLQNLLRQKG 995
Cdd:COG1196    765 LERELERLEREIEALG 780
SMC_N super family cl47134
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
270-597 2.65e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


The actual alignment was detected with superfamily member TIGR02169:

Pssm-ID: 481474 [Multi-domain]  Cd Length: 1164  Bit Score: 46.60  E-value: 2.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  270 EEVERLERDLEKKMIETEELK----------SKQTRFLEEIKNQDKLnksLKEEAMLQKQSCEELKSDlntknelLKQKT 339
Cdd:TIGR02169  681 ERLEGLKRELSSLQSELRRIEnrldelsqelSDASRKIGEIEKEIEQ---LEQEEEKLKERLEELEED-------LSSLE 750
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  340 IELTRACQKQYELEQElafykIDAKFEPLNYYPSEYAEIDKAPDESPYIGKSRYKRNMFATESYIIDSAQAVQ--IKKME 417
Cdd:TIGR02169  751 QEIENVKSELKELEAR-----IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEqkLNRLT 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  418 PDEQLRNDHMN--LRGHTPLDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEeaiqlKKISEAGKDLly 495
Cdd:TIGR02169  826 LEKEYLEKEIQelQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLK-----KERDELEAQL-- 898
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  496 KQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKSGKEQQLDIMN------- 568
Cdd:TIGR02169  899 RELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEpvnmlai 978
                          330       340
                   ....*....|....*....|....*....
gi 2217375254  569 KQYQQLESRLDEILSRIAKETEEIKDLEE 597
Cdd:TIGR02169  979 QEYEEVLKRLDELKEKRAKLEEERKAILE 1007
 
Name Accession Description Interval E-value
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
89-261 2.41e-23

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 100.63  E-value: 2.41e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   89 KKLTKQDNLALIKSL-------NlslskdggkKFKYIENLEKCVKLEVLNLSYNLIGKIEKLDKLLKLRELNLSYNKISK 161
Cdd:cd21340     12 KNITKIDNLSLCKNLkvlylydN---------KITKIENLEFLTNLTHLYLQNNQIEKIENLENLVNLKKLYLGGNRISV 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  162 IEGIENMCNLQKLNLAGNEIEhipvwLGKKL-----------KSLRVLNLKGNKISSLQDISKLKPLQDLI--------- 221
Cdd:cd21340     83 VEGLENLTNLEELHIENQRLP-----PGEKLtfdprslaalsNSLRVLNISGNNIDSLEPLAPLRNLEQLDasnnqisdl 157
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217375254  222 --------------SLILVENPVVTLPHYLQFTIFHLRSLESLEGQPVTTQDRQ 261
Cdd:cd21340    158 eelldllsswpslrELDLTGNPVCKKPKYRDKIILASKSLEVLDGKEITDTERQ 211
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1287-2161 3.76e-23

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 108.22  E-value: 3.76e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1287 RASKRQSEKEMEELHHN---IDDLLQEkksLECEVEELHRTVQKRQQQKDfidgnVESLMTELEIEKSLKHHEDIVDEIE 1363
Cdd:TIGR02168  171 KERRKETERKLERTRENldrLEDILNE---LERQLKSLERQAEKAERYKE-----LKAELRELELALLVLRLEELREELE 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1364 CIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQ 1443
Cdd:TIGR02168  243 ELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELE 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1444 ADAKDLEQHKIKQEEILKEINKivaakdsDFQCLSKKKEKLTEELQKLQKDIEMAERNedhhlqvLKESEVLLQAKRAEL 1523
Cdd:TIGR02168  323 AQLEELESKLDELAEELAELEE-------KLEELKEELESLEAELEELEAELEELESR-------LEELEEQLETLRSKV 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1524 EKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEA-LRLGETEVTEKCNHIREVKSLLEELSFQKGELNV 1602
Cdd:TIGR02168  389 AQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAeLKELQAELEELEEELEELQEELERLEEALEELRE 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1603 QISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQlENHELQGLKLQHDQRVSELEKTQVAVLEEKLELE 1682
Cdd:TIGR02168  469 ELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLK-NQSGLSGILGVLSELISVDEGYEAAIEAALGGRL 547
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1683 nlQQI----SQQQKGEIEWQKQ----------LLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTK 1748
Cdd:TIGR02168  548 --QAVvvenLNAAKKAIAFLKQnelgrvtflpLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGV 625
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1749 RV---LAAAEENSKMEQSNL--------------------EKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKRE 1805
Cdd:TIGR02168  626 LVvddLDNALELAKKLRPGYrivtldgdlvrpggvitggsAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK 705
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1806 AVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLV 1885
Cdd:TIGR02168  706 ELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE 785
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1886 QQEMMFQRLQKERESEESKLEtskvtlkeqqhQLEKELTDQKSKLDQVLSKvlaaeervrtlqeeerwCESLEKTLSQTK 1965
Cdd:TIGR02168  786 ELEAQIEQLKEELKALREALD-----------ELRAELTLLNEEAANLRER-----------------LESLERRIAATE 837
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1966 RQLSEREQQLVEKSGELLALQKEadsmRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKnlltinedserdssl 2045
Cdd:TIGR02168  838 RRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEE--------------- 898
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2046 lskeqkqenscIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMANQKD--LERRQMEISDAMRTLKSEVKD- 2122
Cdd:TIGR02168  899 -----------LSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQErlSEEYSLTLEEAEALENKIEDDe 967
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....*..
gi 2217375254 2123 -EIRTSLKNLNQFLPEL-PADLEAILE------RNENLEGELESLKE 2161
Cdd:TIGR02168  968 eEARRRLKRLENKIKELgPVNLAAIEEyeelkeRYDFLTAQKEDLTE 1014
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
96-345 5.81e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 98.08  E-value: 5.81e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   96 NLALIKSLNLSlskdgGKKFKYI-ENLEKCVKLEVLNLSYNLIGKI-EKLDKLLKLRELNLSYNKISKI-EGIENMCNLQ 172
Cdd:COG4886    134 NLTNLKELDLS-----NNQLTDLpEPLGNLTNLKSLDLSNNQLTDLpEELGNLTNLKELDLSNNQITDLpEPLGNLTNLE 208
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  173 KLNLAGNEIEHIPVWLGkKLKSLRVLNLKGNKISSLQDISKLKPLQdliSLILVENPVVTLPHYLQFTifHLRSLeSLEG 252
Cdd:COG4886    209 ELDLSGNQLTDLPEPLA-NLTNLETLDLSNNQLTDLPELGNLTNLE---ELDLSNNQLTDLPPLANLT--NLKTL-DLSN 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  253 QPVTTQDRQEAFERFSLEEVERLERDLEKKMIETEELKSKQTRFLEEIKNQDKLNKSLKEEAMLQKQSCEELKSDLNTKN 332
Cdd:COG4886    282 NQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSL 361
                          250
                   ....*....|...
gi 2217375254  333 ELLKQKTIELTRA 345
Cdd:COG4886    362 LLTLLLTLGLLGL 374
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
435-995 2.09e-19

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 95.77  E-value: 2.09e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAgkdllykQLSGRLQLVNKLRQEALD 514
Cdd:COG1196    244 LEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELA-------RLEQDIARLEERRRELEE 316
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAIdsldskdpkhshmKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKD 594
Cdd:COG1196    317 RLEELEEELAELEEELEELEEELEEL-------------EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  595 LEEQLTEGQIAANEALKKDLEgvisglqeylgtIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMDAENMR 674
Cdd:COG1196    384 LAEELLEALRAAAELAAQLEE------------LEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEE 451
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  675 KLEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNL--------LKQQLKDFQNHLNHVVDGLVR 746
Cdd:COG1196    452 AELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADyegflegvKAALLLAGLRGLAGAVAVLIG 531
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  747 PEEVAARVDELRrklkLGTGEMNIHSPSDVLGKSLADLQKQ-------FSEILARSKWERDEAQVRERKLQEEMALQQEK 819
Cdd:COG1196    532 VEAAYEAALEAA----LAAALQNIVVEDDEVAAAAIEYLKAakagratFLPLDKIRARAALAAALARGAIGAAVDLVASD 607
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  820 LATGQEEFRQACERALEARMnfDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEKKK 899
Cdd:COG1196    608 LREADARYYVLGDTLLGRTL--VAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELA 685
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  900 KLEDAKSQEQVFGLDKELKKLKKAVATSDKLATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAArDLTR 979
Cdd:COG1196    686 ERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPP-DLEE 764
                          570
                   ....*....|....*.
gi 2217375254  980 AEAEIELLQNLLRQKG 995
Cdd:COG1196    765 LERELERLEREIEALG 780
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1741-2002 9.37e-15

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 80.75  E-value: 9.37e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1741 EKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDH 1820
Cdd:COG1196    238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEER 317
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1821 LNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERES 1900
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAE 397
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1901 EESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSG 1980
Cdd:COG1196    398 LAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEA 477
                          250       260
                   ....*....|....*....|..
gi 2217375254 1981 ELLALQKEADSMRADFSLLRNQ 2002
Cdd:COG1196    478 ALAELLEELAEAAARLLLLLEA 499
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
435-1060 1.07e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 77.40  E-value: 1.07e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYKQLSGRLQlvnKLRQEALD 514
Cdd:TIGR02168  258 LTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLE---ELESKLDE 334
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKhshMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKD 594
Cdd:TIGR02168  335 LAEELAELEEKLEELKEELESLEAELEELEAELEE---LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER 411
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  595 LEEQLTEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLE--------IV 666
Cdd:TIGR02168  412 LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAErelaqlqaRL 491
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  667 AMDAENMRKLE---------------------------------QSALQAELEKERQAL----------------KNALG 697
Cdd:TIGR02168  492 DSLERLQENLEgfsegvkallknqsglsgilgvlselisvdegyEAAIEAALGGRLQAVvvenlnaakkaiaflkQNELG 571
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  698 KAQFSEE---KEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRPEEVAARVDELRRKLKLGTGEMNIHSPS 774
Cdd:TIGR02168  572 RVTFLPLdsiKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLD 651
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  775 DVL---------GKSLADLQKQFSEI-LARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFRQACERALEAR-----M 839
Cdd:TIGR02168  652 GDLvrpggvitgGSAKTNSSILERRReIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSrqisaL 731
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  840 NFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEkkKKLEDAKSQEQVFGLDKELKK 919
Cdd:TIGR02168  732 RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE--AQIEQLKEELKALREALDELR 809
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  920 LKKAvATSDKLATAELTIA--KDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAARDLTRAEAEIELLQNLLRQKGEQ 997
Cdd:TIGR02168  810 AELT-LLNEEAANLRERLEslERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEA 888
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217375254  998 F-RLEMEKTGVGTGANSQVLEIEKLNETMERQRTEIARLQNVLDLTgsdnKGGFENVLEEIAEL 1060
Cdd:TIGR02168  889 LaLLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGL----EVRIDNLQERLSEE 948
LRR_9 pfam14580
Leucine-rich repeat;
130-279 1.62e-13

Leucine-rich repeat;


Pssm-ID: 405295 [Multi-domain]  Cd Length: 175  Bit Score: 70.95  E-value: 1.62e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  130 LNLSYNLIGKIEKLDKLL-KLRELNLSYNKISKIEGIENMCNLQKLNLAGNEIEHIPVWLGKKLKSLRVLNLKGNKISSL 208
Cdd:pfam14580   24 LDLRGYKIPIIENLGATLdQFDTIDFSDNEIRKLDGFPLLRRLKTLLLNNNRICRIGEGLGEALPNLTELILTNNNLQEL 103
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254  209 QDISKLKPLQDLISLILVENPVVTLPHYLQFTIFHLRSLESLEGQPVTTQDRQEAFERFSLEEVERLERDL 279
Cdd:pfam14580  104 GDLDPLASLKKLTFLSLLRNPVTNKPHYRLYVIYKVPQLRLLDFRKVKQKERQAAEKMFRSKQGKQLAKEI 174
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1280-2021 1.40e-12

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 73.61  E-value: 1.40e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1280 KREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVESLMTELEIEKSLKHH--ED 1357
Cdd:pfam15921   88 KDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDmlED 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1358 IVDEIECIEKTLLKRRSELREADRLLAEAEsELSCTKEKTKNAVEkfTDAKRSLLQTESDA-EELERRAQETAVNLVKAD 1436
Cdd:pfam15921  168 SNTQIEQLRKMMLSHEGVLQEIRSILVDFE-EASGKKIYEHDSMS--TMHFRSLGSAISKIlRELDTEISYLKGRIFPVE 244
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1437 QQLRSLQADAKD-----LEQHKIKQEEILKEinkivaaKDSDFQCLSKKKEKLTEELQKLQKDIEMAERnedhhlQVLKE 1511
Cdd:pfam15921  245 DQLEALKSESQNkiellLQQHQDRIEQLISE-------HEVEITGLTEKASSARSQANSIQSQLEIIQE------QARNQ 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1512 SEVLLQakraELEKLKSQVTSQQQEMAVLDRQLGHKKEElhlLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLE 1591
Cdd:pfam15921  312 NSMYMR----QLSDLESTVSQLRSELREAKRMYEDKIEE---LEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLLA 384
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1592 ELSFQKGELNVQiserKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQ 1671
Cdd:pfam15921  385 DLHKREKELSLE----KEQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNESL 460
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1672 VAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRvl 1751
Cdd:pfam15921  461 EKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQHLKN-- 538
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1752 aaAEENSKMEQSNLEKLELN-------VRKLQQELDQLNR--------------DKLSLHNDISAMQQQLQE----KREA 1806
Cdd:pfam15921  539 --EGDHLRNVQTECEALKLQmaekdkvIEILRQQIENMTQlvgqhgrtagamqvEKAQLEKEINDRRLELQEfkilKDKK 616
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1807 VNSLQEELANVQDhLNLAKQDLLHTTkhqdvllSEQTRLQKDIS----EWANRFEDCQKEEETKQQQLQVLQNEIEEnkl 1882
Cdd:pfam15921  617 DAKIRELEARVSD-LELEKVKLVNAG-------SERLRAVKDIKqerdQLLNEVKTSRNELNSLSEDYEVLKRNFRN--- 685
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1883 KLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQ----------LEKELTDQKSKLDQVLSKVLAAEERVRTLQEEER 1952
Cdd:pfam15921  686 KSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSdghamkvamgMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKH 765
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217375254 1953 WCESLEKTLSQtkrqlsereqqlveksgELLALQKEADSMRADFSLLRNQfltERKKAEKqVASLKEAL 2021
Cdd:pfam15921  766 FLKEEKNKLSQ-----------------ELSTVATEKNKMAGELEVLRSQ---ERRLKEK-VANMEVAL 813
PLN02939 PLN02939
transferase, transferring glycosyl groups
1714-2042 5.38e-10

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 64.92  E-value: 5.38e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1714 AESRALQScVECLSKEKEDLQEKCDIWEKKLAQT-KRVLAAAEENSKME--QSNLEKLELN-----------VRKLQQEL 1779
Cdd:PLN02939   150 ARLQALED-LEKILTEKEALQGKINILEMRLSETdARIKLAAQEKIHVEilEEQLEKLRNEllirgateglcVHSLSKEL 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1780 DQLNRDKLSLHNDISAMQQQLQEKREAVNSLqeelanvqdhLNLAKQDLLHTTKHQDvLLSEQTRLQKDISEWANRFEDC 1859
Cdd:PLN02939   229 DVLKEENMLLKDDIQFLKAELIEVAETEERV----------FKLEKERSLLDASLRE-LESKFIVAQEDVSKLSPLQYDC 297
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1860 QKEE-ETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEE----SKLETSKVTLKEQQHQLEKELTDQKSKldQVL 1934
Cdd:PLN02939   298 WWEKvENLQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKeanvSKFSSYKVELLQQKLKLLEERLQASDH--EIH 375
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1935 SKVLAAEERVRTLQeeerwcESLEKTLSQTKRQlsereqqlveksgellALQKEADSMRADF----SLLRNQFLTERKKA 2010
Cdd:PLN02939   376 SYIQLYQESIKEFQ------DTLSKLKEESKKR----------------SLEHPADDMPSEFwsriLLLIDGWLLEKKIS 433
                          330       340       350
                   ....*....|....*....|....*....|..
gi 2217375254 2011 EKQVASLKEALKIQRSQLEKNLLTINEDSERD 2042
Cdd:PLN02939   434 NNDAKLLREMVWKRDGRIREAYLSCKGKNERE 465
PTZ00121 PTZ00121
MAEBL; Provisional
406-1017 7.36e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 64.78  E-value: 7.36e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  406 DSAQAVQIKKMEPDEQLRNDHMNLRGHTPLDTQLEDKEKKISAAQTRLSElhdEIEKAEQqiLRATEEFKQLEEAiqLKK 485
Cdd:PTZ00121  1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAE---EKKKADE--AKKAEEKKKADEA--KKK 1310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  486 ISEAGK-DLLYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIK--DLQIAIDSLDSKDPKHSHMKAQKSGKE- 561
Cdd:PTZ00121  1311 AEEAKKaDEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEaaEEKAEAAEKKKEEAKKKADAAKKKAEEk 1390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  562 QQLDIMNKQYQQLESRLDEILSRIA--KETEEIKDLEEQLTEgqiaANEALKKDLEGVISglqEYLGTIKGQATQAQNEC 639
Cdd:PTZ00121  1391 KKADEAKKKAEEDKKKADELKKAAAakKKADEAKKKAEEKKK----ADEAKKKAEEAKKA---DEAKKKAEEAKKAEEAK 1463
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  640 RKlRDEKETLLQRLTEVEQERDQLEIVAMDAENMRKLEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHL 719
Cdd:PTZ00121  1464 KK-AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKA 1542
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  720 QDDNNllkqqlkdfqnhlnhvVDGLVRPEEVaaRVDELRRKLKLGTGEMNIHSPSDVLGKSLADLQKQFSEILARSKWER 799
Cdd:PTZ00121  1543 EEKKK----------------ADELKKAEEL--KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEE 1604
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  800 DEAQVRERKLQEEMALQQEKLATGQEEfRQACERALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQ 879
Cdd:PTZ00121  1605 KKMKAEEAKKAEEAKIKAEELKKAEEE-KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  880 EADEEKerilaqlRELEKKKKLEDAKSQEQVFGLDKELKKLKKAVATSDKLATAELTIAKDQLKSlhgtvmkiNQERAEE 959
Cdd:PTZ00121  1684 EEDEKK-------AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE--------DKKKAEE 1748
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217375254  960 LQEAERFSRKAAQAARDLTRAEAEIE-----LLQNLLRQKGEQFRLEMEKTGVGTGANSQVLE 1017
Cdd:PTZ00121  1749 AKKDEEEKKKIAHLKKEEEKKAEEIRkekeaVIEEELDEEDEKRRMEVDKKIKDIFDNFANII 1811
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
781-1567 3.31e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 59.30  E-value: 3.31e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  781 LADLQKQFSEILARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFRQACERALEA------------RMNFDKRQHEA 848
Cdd:TIGR02168  230 LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELqkelyalaneisRLEQQKQILRE 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  849 RIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEKKKKLEDAKSQEQVFGLDKELKKLKKAVATSD 928
Cdd:TIGR02168  310 RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  929 KL------ATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQaaRDLTRAEAEIELLQNLLRQKGEQFRLEM 1002
Cdd:TIGR02168  390 QLelqiasLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQ--AELEELEEELEELQEELERLEEALEELR 467
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1003 EktgvgtgansqvlEIEKLNETMERQRTEIARLQNVLDLTGSDnKGGFENVLEEIAELRREVSYQNDYISSMADPFKRRG 1082
Cdd:TIGR02168  468 E-------------ELEEAEQALDAAERELAQLQARLDSLERL-QENLEGFSEGVKALLKNQSGLSGILGVLSELISVDE 533
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1083 YWyfmppppsskvsshSSQATKDSGVGLKYSAStpvrkprPGQQDGKEG--SQPPPASGYWVYSPIRSGLHKLFPSRDAD 1160
Cdd:TIGR02168  534 GY--------------EAAIEAALGGRLQAVVV-------ENLNAAKKAiaFLKQNELGRVTFLPLDSIKGTEIQGNDRE 592
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1161 S--GGDSQEESELDDQEEPPFVPPP-GYMMYTVLPDGSpVPQGMALYAPPPPLPNNSRP----LTPGTVVYGpppagapm 1233
Cdd:TIGR02168  593 IlkNIEGFLGVAKDLVKFDPKLRKAlSYLLGGVLVVDD-LDNALELAKKLRPGYRIVTLdgdlVRPGGVITG-------- 663
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1234 vyGPPPPNFSIpfipmgvlhcnvpehHNLENEVSRLedimqhlkskkreerwmraskrqsEKEMEELHHNIDDLLQEKKS 1313
Cdd:TIGR02168  664 --GSAKTNSSI---------------LERRREIEEL------------------------EEKIEELEEKIAELEKALAE 702
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1314 LECEVEELhrTVQKRQQQKDFIDGNVESLMTELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCT 1393
Cdd:TIGR02168  703 LRKELEEL--EEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEA 780
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1394 KEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVaakdsd 1473
Cdd:TIGR02168  781 EAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI------ 854
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1474 fqclskkkEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHL 1553
Cdd:TIGR02168  855 --------ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQ 926
                          810
                   ....*....|....
gi 2217375254 1554 LQGSMVQAKADLQE 1567
Cdd:TIGR02168  927 LELRLEGLEVRIDN 940
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
270-1081 9.99e-08

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 57.67  E-value: 9.99e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  270 EEVERLERDLEKKMIETEELKSKQTRFLEEIKNQDKLNKSLKEEAMLQKQSCEELKSDLNTKNELLKQKTIELTRACQKQ 349
Cdd:pfam02463  166 RLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQEL 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  350 YELEQElafyKIDAKFEPLNYYPSEYAEIDKAPDEspyigKSRYKRNMFATESYIIDSAQAVQIKKMEPDEQLRNDHMNL 429
Cdd:pfam02463  246 LRDEQE----EIESSKQEIEKEEEKLAQVLKENKE-----EEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKL 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  430 RGHtplDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYKQLSGRLQLVNKLR 509
Cdd:pfam02463  317 KES---EKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLK 393
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  510 QEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKhshmKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKET 589
Cdd:pfam02463  394 EEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILE----EEEESIELKQGKLTEEKEELEKQELKLLKDELELKK 469
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  590 EEIKDLEEQLTEGQIAANEALK-KDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAM 668
Cdd:pfam02463  470 SEDLLKETQLVKLQEQLELLLSrQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAV 549
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  669 DAENMRKLEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRPE 748
Cdd:pfam02463  550 IVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGIL 629
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  749 EVAARVDELRRKLKLGTGEMNIHSpSDVLGKSLADLQKQFSEILARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFR 828
Cdd:pfam02463  630 KDTELTKLKESAKAKESGLRKGVS-LEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQRE 708
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  829 QACER--ALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQ-LRELEKKKKLEDAK 905
Cdd:pfam02463  709 KEELKklKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEkELAEEREKTEKLKV 788
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  906 SQEQVFGLDKELKKLKKAVATSDKLATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAARDLTRAEAEIE 985
Cdd:pfam02463  789 EEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEEL 868
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  986 LLQNLLRQKGEQFRLEMEKTGvgtGANSQVLEIEKLNETMERQRTEIARLQNVLDLTGSDNKGGFENVLEEIAELRREVS 1065
Cdd:pfam02463  869 LQELLLKEEELEEQKLKDELE---SKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEA 945
                          810
                   ....*....|....*.
gi 2217375254 1066 YQNDYISSMADPFKRR 1081
Cdd:pfam02463  946 DEKEKEENNKEEEEER 961
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
270-597 2.65e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.60  E-value: 2.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  270 EEVERLERDLEKKMIETEELK----------SKQTRFLEEIKNQDKLnksLKEEAMLQKQSCEELKSDlntknelLKQKT 339
Cdd:TIGR02169  681 ERLEGLKRELSSLQSELRRIEnrldelsqelSDASRKIGEIEKEIEQ---LEQEEEKLKERLEELEED-------LSSLE 750
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  340 IELTRACQKQYELEQElafykIDAKFEPLNYYPSEYAEIDKAPDESPYIGKSRYKRNMFATESYIIDSAQAVQ--IKKME 417
Cdd:TIGR02169  751 QEIENVKSELKELEAR-----IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEqkLNRLT 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  418 PDEQLRNDHMN--LRGHTPLDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEeaiqlKKISEAGKDLly 495
Cdd:TIGR02169  826 LEKEYLEKEIQelQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLK-----KERDELEAQL-- 898
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  496 KQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKSGKEQQLDIMN------- 568
Cdd:TIGR02169  899 RELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEpvnmlai 978
                          330       340
                   ....*....|....*....|....*....
gi 2217375254  569 KQYQQLESRLDEILSRIAKETEEIKDLEE 597
Cdd:TIGR02169  979 QEYEEVLKRLDELKEKRAKLEEERKAILE 1007
PLN03150 PLN03150
hypothetical protein; Provisional
145-206 1.88e-03

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 43.27  E-value: 1.88e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217375254  145 KLLKLRELNLSYNKIskiEG-----IENMCNLQKLNLAGNEIE-HIPVWLGKkLKSLRVLNLKGNKIS 206
Cdd:PLN03150   440 KLRHLQSINLSGNSI---RGnippsLGSITSLEVLDLSYNSFNgSIPESLGQ-LTSLRILNLNGNSLS 503
 
Name Accession Description Interval E-value
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
89-261 2.41e-23

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 100.63  E-value: 2.41e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   89 KKLTKQDNLALIKSL-------NlslskdggkKFKYIENLEKCVKLEVLNLSYNLIGKIEKLDKLLKLRELNLSYNKISK 161
Cdd:cd21340     12 KNITKIDNLSLCKNLkvlylydN---------KITKIENLEFLTNLTHLYLQNNQIEKIENLENLVNLKKLYLGGNRISV 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  162 IEGIENMCNLQKLNLAGNEIEhipvwLGKKL-----------KSLRVLNLKGNKISSLQDISKLKPLQDLI--------- 221
Cdd:cd21340     83 VEGLENLTNLEELHIENQRLP-----PGEKLtfdprslaalsNSLRVLNISGNNIDSLEPLAPLRNLEQLDasnnqisdl 157
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217375254  222 --------------SLILVENPVVTLPHYLQFTIFHLRSLESLEGQPVTTQDRQ 261
Cdd:cd21340    158 eelldllsswpslrELDLTGNPVCKKPKYRDKIILASKSLEVLDGKEITDTERQ 211
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1287-2161 3.76e-23

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 108.22  E-value: 3.76e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1287 RASKRQSEKEMEELHHN---IDDLLQEkksLECEVEELHRTVQKRQQQKDfidgnVESLMTELEIEKSLKHHEDIVDEIE 1363
Cdd:TIGR02168  171 KERRKETERKLERTRENldrLEDILNE---LERQLKSLERQAEKAERYKE-----LKAELRELELALLVLRLEELREELE 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1364 CIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQ 1443
Cdd:TIGR02168  243 ELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELE 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1444 ADAKDLEQHKIKQEEILKEINKivaakdsDFQCLSKKKEKLTEELQKLQKDIEMAERNedhhlqvLKESEVLLQAKRAEL 1523
Cdd:TIGR02168  323 AQLEELESKLDELAEELAELEE-------KLEELKEELESLEAELEELEAELEELESR-------LEELEEQLETLRSKV 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1524 EKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEA-LRLGETEVTEKCNHIREVKSLLEELSFQKGELNV 1602
Cdd:TIGR02168  389 AQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAeLKELQAELEELEEELEELQEELERLEEALEELRE 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1603 QISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQlENHELQGLKLQHDQRVSELEKTQVAVLEEKLELE 1682
Cdd:TIGR02168  469 ELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLK-NQSGLSGILGVLSELISVDEGYEAAIEAALGGRL 547
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1683 nlQQI----SQQQKGEIEWQKQ----------LLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTK 1748
Cdd:TIGR02168  548 --QAVvvenLNAAKKAIAFLKQnelgrvtflpLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGV 625
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1749 RV---LAAAEENSKMEQSNL--------------------EKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKRE 1805
Cdd:TIGR02168  626 LVvddLDNALELAKKLRPGYrivtldgdlvrpggvitggsAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK 705
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1806 AVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLV 1885
Cdd:TIGR02168  706 ELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE 785
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1886 QQEMMFQRLQKERESEESKLEtskvtlkeqqhQLEKELTDQKSKLDQVLSKvlaaeervrtlqeeerwCESLEKTLSQTK 1965
Cdd:TIGR02168  786 ELEAQIEQLKEELKALREALD-----------ELRAELTLLNEEAANLRER-----------------LESLERRIAATE 837
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1966 RQLSEREQQLVEKSGELLALQKEadsmRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKnlltinedserdssl 2045
Cdd:TIGR02168  838 RRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEE--------------- 898
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2046 lskeqkqenscIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMANQKD--LERRQMEISDAMRTLKSEVKD- 2122
Cdd:TIGR02168  899 -----------LSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQErlSEEYSLTLEEAEALENKIEDDe 967
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....*..
gi 2217375254 2123 -EIRTSLKNLNQFLPEL-PADLEAILE------RNENLEGELESLKE 2161
Cdd:TIGR02168  968 eEARRRLKRLENKIKELgPVNLAAIEEyeelkeRYDFLTAQKEDLTE 1014
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
96-345 5.81e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 98.08  E-value: 5.81e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   96 NLALIKSLNLSlskdgGKKFKYI-ENLEKCVKLEVLNLSYNLIGKI-EKLDKLLKLRELNLSYNKISKI-EGIENMCNLQ 172
Cdd:COG4886    134 NLTNLKELDLS-----NNQLTDLpEPLGNLTNLKSLDLSNNQLTDLpEELGNLTNLKELDLSNNQITDLpEPLGNLTNLE 208
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  173 KLNLAGNEIEHIPVWLGkKLKSLRVLNLKGNKISSLQDISKLKPLQdliSLILVENPVVTLPHYLQFTifHLRSLeSLEG 252
Cdd:COG4886    209 ELDLSGNQLTDLPEPLA-NLTNLETLDLSNNQLTDLPELGNLTNLE---ELDLSNNQLTDLPPLANLT--NLKTL-DLSN 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  253 QPVTTQDRQEAFERFSLEEVERLERDLEKKMIETEELKSKQTRFLEEIKNQDKLNKSLKEEAMLQKQSCEELKSDLNTKN 332
Cdd:COG4886    282 NQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSL 361
                          250
                   ....*....|...
gi 2217375254  333 ELLKQKTIELTRA 345
Cdd:COG4886    362 LLTLLLTLGLLGL 374
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
435-995 2.09e-19

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 95.77  E-value: 2.09e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAgkdllykQLSGRLQLVNKLRQEALD 514
Cdd:COG1196    244 LEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELA-------RLEQDIARLEERRRELEE 316
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAIdsldskdpkhshmKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKD 594
Cdd:COG1196    317 RLEELEEELAELEEELEELEEELEEL-------------EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  595 LEEQLTEGQIAANEALKKDLEgvisglqeylgtIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMDAENMR 674
Cdd:COG1196    384 LAEELLEALRAAAELAAQLEE------------LEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEE 451
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  675 KLEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNL--------LKQQLKDFQNHLNHVVDGLVR 746
Cdd:COG1196    452 AELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADyegflegvKAALLLAGLRGLAGAVAVLIG 531
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  747 PEEVAARVDELRrklkLGTGEMNIHSPSDVLGKSLADLQKQ-------FSEILARSKWERDEAQVRERKLQEEMALQQEK 819
Cdd:COG1196    532 VEAAYEAALEAA----LAAALQNIVVEDDEVAAAAIEYLKAakagratFLPLDKIRARAALAAALARGAIGAAVDLVASD 607
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  820 LATGQEEFRQACERALEARMnfDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEKKK 899
Cdd:COG1196    608 LREADARYYVLGDTLLGRTL--VAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELA 685
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  900 KLEDAKSQEQVFGLDKELKKLKKAVATSDKLATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAArDLTR 979
Cdd:COG1196    686 ERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPP-DLEE 764
                          570
                   ....*....|....*.
gi 2217375254  980 AEAEIELLQNLLRQKG 995
Cdd:COG1196    765 LERELERLEREIEALG 780
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1262-2022 2.15e-19

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 95.89  E-value: 2.15e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1262 LENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVES 1341
Cdd:TIGR02168  234 LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLAN 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1342 LMTELeiekslkhhEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEEL 1421
Cdd:TIGR02168  314 LERQL---------EELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETL 384
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1422 ERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKivAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERN 1501
Cdd:TIGR02168  385 RSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLK--KLEEAELKELQAELEELEEELEELQEELERLEEA 462
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1502 EDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLD------RQLGHKKEELHLLQG---SMVQAKADLQEALR-- 1570
Cdd:TIGR02168  463 LEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEgfsegvKALLKNQSGLSGILGvlsELISVDEGYEAAIEaa 542
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1571 LGETE---VTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDML- 1646
Cdd:TIGR02168  543 LGGRLqavVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLl 622
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1647 -------QLENHELQGLKLQHDQRVSELEKTQVAVLEEKLelenlqqisqqqKGEIEWQKQLLERdKREIERMTAESRAL 1719
Cdd:TIGR02168  623 ggvlvvdDLDNALELAKKLRPGYRIVTLDGDLVRPGGVIT------------GGSAKTNSSILER-RREIEELEEKIEEL 689
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1720 QSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQ 1799
Cdd:TIGR02168  690 EEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEER 769
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1800 LQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEE 1879
Cdd:TIGR02168  770 LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEE 849
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1880 NKLKLVQQEMMFQRLQKERESEESKLEtskvTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEerwCESLEK 1959
Cdd:TIGR02168  850 LSEDIESLAAEIEELEELIEELESELE----ALLNERASLEEALALLRSELEELSEELRELESKRSELRRE---LEELRE 922
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217375254 1960 TLSQTKRQLSEREQQLVEKSGELLALQK-EADSMRAdfslLRNQFLTERKKAEKQVASLKEALK 2022
Cdd:TIGR02168  923 KLAQLELRLEGLEVRIDNLQERLSEEYSlTLEEAEA----LENKIEDDEEEARRRLKRLENKIK 982
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
443-1064 9.15e-19

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 93.85  E-value: 9.15e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  443 EKKISAAQTRLSELHDEIEKAEQQI--L----RATEEFKQLEEAIQLKKISEAGkdLLYKQLSGRLQlvnKLRQEALDLE 516
Cdd:COG1196    178 ERKLEATEENLERLEDILGELERQLepLerqaEKAERYRELKEELKELEAELLL--LKLRELEAELE---ELEAELEELE 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  517 LQMEKQKQEIAGKQKEIKDLQIAIDSLdskdpkhshmkaqksgkEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDLE 596
Cdd:COG1196    253 AELEELEAELAELEAELEELRLELEEL-----------------ELELEEAQAEEYELLAELARLEQDIARLEERRRELE 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  597 EQLTEgqiaaNEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEivaMDAENMRKL 676
Cdd:COG1196    316 ERLEE-----LEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAE---EELEELAEE 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  677 EQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRPEEVAARVDE 756
Cdd:COG1196    388 LLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAE 467
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  757 LRRKLKLGTGEMNihspsDVLGKSLADLQKQFSEILARSKWERDEAQVRERKLQEEMALQQEKLA---TGQEEFRQACER 833
Cdd:COG1196    468 LLEEAALLEAALA-----ELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAvliGVEAAYEAALEA 542
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  834 ALEARMNFDKRQHEARIQQMeneIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQ----------LRELEKKKKLED 903
Cdd:COG1196    543 ALAAALQNIVVEDDEVAAAA---IEYLKAAKAGRATFLPLDKIRARAALAAALARGAIgaavdlvasdLREADARYYVLG 619
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  904 AKSQEQVFGLDKELKKLKKAVATSDKL--ATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAARDLTRAE 981
Cdd:COG1196    620 DTLLGRTLVAARLEAALRRAVTLAGRLreVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEAL 699
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  982 AEIELLQNLLRQKGEQFRLEmektgvgtgansqVLEIEKLNETMERQRTEIARLQNVLDLTGSDNKGGFENVLEEIAELR 1061
Cdd:COG1196    700 LAEEEEERELAEAEEERLEE-------------ELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELE 766

                   ...
gi 2217375254 1062 REV 1064
Cdd:COG1196    767 REL 769
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
55-250 1.96e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 90.38  E-value: 1.96e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   55 QVEIADENNMLLDYQDHKGADSHAGVRYITEALIKKLTKQDNLALIKSLNLSLSKDGGKKFKYIENLEKCVKLEVLNLSY 134
Cdd:COG4886     43 LSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSG 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  135 NLIGKI-EKLDKLLKLRELNLSYNKISKI-EGIENMCNLQKLNLAGNEIEHIPVWLGkKLKSLRVLNLKGNKISSLQD-I 211
Cdd:COG4886    123 NQLTDLpEELANLTNLKELDLSNNQLTDLpEPLGNLTNLKSLDLSNNQLTDLPEELG-NLTNLKELDLSNNQITDLPEpL 201
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2217375254  212 SKLKPLQdliSLILVENPVVTLPHylqfTIFHLRSLESL 250
Cdd:COG4886    202 GNLTNLE---ELDLSGNQLTDLPE----PLANLTNLETL 233
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1477-2208 1.42e-17

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 90.12  E-value: 1.42e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1477 LSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKeseVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQG 1556
Cdd:TIGR02168  198 LERQLKSLERQAEKAERYKELKAELRELELALLV---LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRL 274
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1557 SMVQAKADLQEAlrlgETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQvvlrQMSKHK 1636
Cdd:TIGR02168  275 EVSELEEEIEEL----QKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELA----ELEEKL 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1637 TELKNILDMLQLENHELQGLKLQHDQRVSELEKtqvAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAES 1716
Cdd:TIGR02168  347 EELKEELESLEAELEELEAELEELESRLEELEE---QLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEI 423
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1717 RALQScvECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENskmeqsnLEKLELNVRKLQQELDQLNRDKLSLHNDISAM 1796
Cdd:TIGR02168  424 EELLK--KLEEAELKELQAELEELEEELEELQEELERLEEA-------LEELREELEEAEQALDAAERELAQLQARLDSL 494
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1797 QQQLQEKREAVNSLQEELAN------------------------------------VQDHLNLAKQDLLHTTKHQDVLLS 1840
Cdd:TIGR02168  495 ERLQENLEGFSEGVKALLKNqsglsgilgvlselisvdegyeaaieaalggrlqavVVENLNAAKKAIAFLKQNELGRVT 574
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1841 -------EQTRLQKDISEWANRFEDCQ---KEEETKQQQLQVLQN------------EIEENKLKLVQQEMMF------- 1891
Cdd:TIGR02168  575 flpldsiKGTEIQGNDREILKNIEGFLgvaKDLVKFDPKLRKALSyllggvlvvddlDNALELAKKLRPGYRIvtldgdl 654
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1892 ----------------QRLQKERESEEskLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCE 1955
Cdd:TIGR02168  655 vrpggvitggsaktnsSILERRREIEE--LEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1956 SLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQ---LEKNL 2032
Cdd:TIGR02168  733 KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAldeLRAEL 812
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2033 LTINEDSERDSSLLSKEQKQENSCiQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMA---NQKDLERRQMEI 2109
Cdd:TIGR02168  813 TLLNEEAANLRERLESLERRIAAT-ERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEallNERASLEEALAL 891
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2110 SDAMRTLKSEVKDEIRTSLKNLNQFLPELPADLEAILERNENLEGELESLKENLPftmnegpfeEKLNFSQVHIMDEHWR 2189
Cdd:TIGR02168  892 LRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLS---------EEYSLTLEEAEALENK 962
                          810
                   ....*....|....*....
gi 2217375254 2190 GEALREKLRHREDRLKAQL 2208
Cdd:TIGR02168  963 IEDDEEEARRRLKRLENKI 981
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1741-2002 9.37e-15

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 80.75  E-value: 9.37e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1741 EKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDH 1820
Cdd:COG1196    238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEER 317
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1821 LNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERES 1900
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAE 397
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1901 EESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSG 1980
Cdd:COG1196    398 LAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEA 477
                          250       260
                   ....*....|....*....|..
gi 2217375254 1981 ELLALQKEADSMRADFSLLRNQ 2002
Cdd:COG1196    478 ALAELLEELAEAAARLLLLLEA 499
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1685-2209 1.81e-14

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 79.60  E-value: 1.81e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1685 QQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSN 1764
Cdd:COG1196    238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEER 317
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1765 LEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTR 1844
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAE 397
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1845 LQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQemmfQRLQKERESEESKLETSKVTLKEQQHQLEKELT 1924
Cdd:COG1196    398 LAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEE----EEALEEAAEEEAELEEEEEALLELLAELLEEAA 473
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1925 DQKSKLDQVLSKVLAAEERVRTLQEEErwcESLEKTLSQTKRQLSEREQQLVEKSGELLA---LQKEADSMRADFSLLRN 2001
Cdd:COG1196    474 LLEAALAELLEELAEAAARLLLLLEAE---ADYEGFLEGVKAALLLAGLRGLAGAVAVLIgveAAYEAALEAALAAALQN 550
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2002 QFLTERKKAEKQVASLKEAL----------KIQRSQLEKNLLTINEdSERDSSLLSKEQKQENSCIQKEMATI--ELVAQ 2069
Cdd:COG1196    551 IVVEDDEVAAAAIEYLKAAKagratflpldKIRARAALAAALARGA-IGAAVDLVASDLREADARYYVLGDTLlgRTLVA 629
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2070 DNHERARRLMKELNQMQYEYT------------ELKKQMANQKDLERRQMEISDAMRTLKSEVKDEIRTSLKNLNQFLPE 2137
Cdd:COG1196    630 ARLEAALRRAVTLAGRLREVTlegeggsaggslTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEEREL 709
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217375254 2138 LPADLEAILERNENLEGELESLKENLPFTMNEGPFEEKLNFSQVHIMDEhwrgEALREKLRHREDRLKAQLR 2209
Cdd:COG1196    710 AEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPE----PPDLEELERELERLEREIE 777
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1260-1924 5.69e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 78.17  E-value: 5.69e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1260 HNLENEVSRLEDIMQHLKSKKREerwMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNV 1339
Cdd:TIGR02168  312 ANLERQLEELEAQLEELESKLDE---LAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1340 ESLmtELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAE-SELSCTKEKTKNAVEKFTDAKRSLLQTESDA 1418
Cdd:TIGR02168  389 AQL--ELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAElKELQAELEELEEELEELQEELERLEEALEEL 466
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1419 EELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLS------KKKEK--------- 1483
Cdd:TIGR02168  467 REELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSelisvdEGYEAaieaalggr 546
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1484 ----LTEELQKLQKDIEMAERNEDHHLQVLkeseVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELH-----LL 1554
Cdd:TIGR02168  547 lqavVVENLNAAKKAIAFLKQNELGRVTFL----PLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRkalsyLL 622
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1555 QGSMV----QAKADLQEALRLGETEVTEKCNHIREVKSlleeLSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLR 1630
Cdd:TIGR02168  623 GGVLVvddlDNALELAKKLRPGYRIVTLDGDLVRPGGV----ITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEK 698
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1631 QMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIE 1710
Cdd:TIGR02168  699 ALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELA 778
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1711 RMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLH 1790
Cdd:TIGR02168  779 EAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLA 858
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1791 NDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEwanrfedCQKEEETKQQQL 1870
Cdd:TIGR02168  859 AEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEE-------LREKLAQLELRL 931
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 1871 QVLQNEIEENKLKL-VQQEMMFQrlqkERESEESKLETSKVTLKEQQHQLEKELT 1924
Cdd:TIGR02168  932 EGLEVRIDNLQERLsEEYSLTLE----EAEALENKIEDDEEEARRRLKRLENKIK 982
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1294-2104 5.91e-14

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 78.19  E-value: 5.91e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1294 EKEMEELHHNIDDLLQEKKSLECEVEELHR--TVQKRQQQKDFIDGNVESLMTELEIEKSLKHHEDIVDEIECIEKTLLK 1371
Cdd:TIGR02169  183 EENIERLDLIIDEKRQQLERLRREREKAERyqALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISE 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1372 RRSELREADRLLAEAESELSCTKEKTKNAV-EKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLE 1450
Cdd:TIGR02169  263 LEKRLEEIEQLLEELNKKIKDLGEEEQLRVkEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1451 QhkikqeeilkeinkivaakdsDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQV 1530
Cdd:TIGR02169  343 R---------------------EIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREI 401
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1531 TSQQQEMavldRQLGHKKEELHLLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQ 1610
Cdd:TIGR02169  402 NELKREL----DRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEE 477
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1611 LTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQVAVleEKLELENLQQISQQ 1690
Cdd:TIGR02169  478 YDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVHGTVAQLGSVGERYATAI--EVAAGNRLNNVVVE 555
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1691 QKGEIEWQKQLLERdkREIERMT----AESRALQSCVECLSKEKE-----DLQEKCDIWEKKLAQ------------TKR 1749
Cdd:TIGR02169  556 DDAVAKEAIELLKR--RKAGRATflplNKMRDERRDLSILSEDGVigfavDLVEFDPKYEPAFKYvfgdtlvvedieAAR 633
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1750 VLAAAEENSKMEQSNLEK-------------LELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELAN 1816
Cdd:TIGR02169  634 RLMGKYRMVTLEGELFEKsgamtggsraprgGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSD 713
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1817 vqdhlnlAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRL-Q 1895
Cdd:TIGR02169  714 -------ASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLeA 786
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1896 KERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWC-----------ESLEKTLSQT 1964
Cdd:TIGR02169  787 RLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLkeqiksiekeiENLNGKKEEL 866
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1965 KRQLSEREQQLVEKSGELLALQKEADSMRADFSLLR---NQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSER 2041
Cdd:TIGR02169  867 EEELEELEAALRDLESRLGDLKKERDELEAQLRELErkiEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEI 946
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254 2042 DSSLLSKEQKQEN-SCIQKEMATIELV---AQDNHERARRLMKEL----NQMQYEYTELKKQMANQKDLER 2104
Cdd:TIGR02169  947 PEEELSLEDVQAElQRVEEEIRALEPVnmlAIQEYEEVLKRLDELkekrAKLEEERKAILERIEEYEKKKR 1017
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1290-1879 6.14e-14

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 78.05  E-value: 6.14e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1290 KRQSEKEMEELHHN---IDDLLQEK----KSLE--CEVEELHRTVQKRQQQKDfidgnveSLMTELEIEKSLKHHEDIVD 1360
Cdd:COG1196    174 KEEAERKLEATEENlerLEDILGELerqlEPLErqAEKAERYRELKEELKELE-------AELLLLKLRELEAELEELEA 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1361 EIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLR 1440
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELA 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1441 SLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKR 1520
Cdd:COG1196    327 ELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELE 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1521 AELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGEL 1600
Cdd:COG1196    407 EAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEEL 486
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1601 NVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNIL----------------DMLQLENHE--------LQGL 1656
Cdd:COG1196    487 AEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLigveaayeaaleaalaAALQNIVVEddevaaaaIEYL 566
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1657 KLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEk 1736
Cdd:COG1196    567 KAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGR- 645
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1737 cdiWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELAN 1816
Cdd:COG1196    646 ---LREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELE 722
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217375254 1817 VQDHLNLAKQDLLHttkhqdvLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEE 1879
Cdd:COG1196    723 EEALEEQLEAEREE-------LLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEA 778
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
435-1060 1.07e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 77.40  E-value: 1.07e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYKQLSGRLQlvnKLRQEALD 514
Cdd:TIGR02168  258 LTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLE---ELESKLDE 334
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKhshMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKD 594
Cdd:TIGR02168  335 LAEELAELEEKLEELKEELESLEAELEELEAELEE---LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER 411
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  595 LEEQLTEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLE--------IV 666
Cdd:TIGR02168  412 LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAErelaqlqaRL 491
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  667 AMDAENMRKLE---------------------------------QSALQAELEKERQAL----------------KNALG 697
Cdd:TIGR02168  492 DSLERLQENLEgfsegvkallknqsglsgilgvlselisvdegyEAAIEAALGGRLQAVvvenlnaakkaiaflkQNELG 571
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  698 KAQFSEE---KEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRPEEVAARVDELRRKLKLGTGEMNIHSPS 774
Cdd:TIGR02168  572 RVTFLPLdsiKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLD 651
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  775 DVL---------GKSLADLQKQFSEI-LARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFRQACERALEAR-----M 839
Cdd:TIGR02168  652 GDLvrpggvitgGSAKTNSSILERRReIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSrqisaL 731
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  840 NFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEkkKKLEDAKSQEQVFGLDKELKK 919
Cdd:TIGR02168  732 RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE--AQIEQLKEELKALREALDELR 809
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  920 LKKAvATSDKLATAELTIA--KDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAARDLTRAEAEIELLQNLLRQKGEQ 997
Cdd:TIGR02168  810 AELT-LLNEEAANLRERLEslERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEA 888
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217375254  998 F-RLEMEKTGVGTGANSQVLEIEKLNETMERQRTEIARLQNVLDLTgsdnKGGFENVLEEIAEL 1060
Cdd:TIGR02168  889 LaLLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGL----EVRIDNLQERLSEE 948
LRR_9 pfam14580
Leucine-rich repeat;
130-279 1.62e-13

Leucine-rich repeat;


Pssm-ID: 405295 [Multi-domain]  Cd Length: 175  Bit Score: 70.95  E-value: 1.62e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  130 LNLSYNLIGKIEKLDKLL-KLRELNLSYNKISKIEGIENMCNLQKLNLAGNEIEHIPVWLGKKLKSLRVLNLKGNKISSL 208
Cdd:pfam14580   24 LDLRGYKIPIIENLGATLdQFDTIDFSDNEIRKLDGFPLLRRLKTLLLNNNRICRIGEGLGEALPNLTELILTNNNLQEL 103
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254  209 QDISKLKPLQDLISLILVENPVVTLPHYLQFTIFHLRSLESLEGQPVTTQDRQEAFERFSLEEVERLERDL 279
Cdd:pfam14580  104 GDLDPLASLKKLTFLSLLRNPVTNKPHYRLYVIYKVPQLRLLDFRKVKQKERQAAEKMFRSKQGKQLAKEI 174
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1418-2017 1.65e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 76.51  E-value: 1.65e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1418 AEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEM 1497
Cdd:COG1196    220 EELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELAR 299
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1498 AERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEALRLgetevt 1577
Cdd:COG1196    300 LEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAE------ 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1578 ekcnhIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLK 1657
Cdd:COG1196    374 -----LAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA 448
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1658 LQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRAlqscveclSKEKEDLQEKC 1737
Cdd:COG1196    449 EEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEG--------VKAALLLAGLR 520
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1738 DIWEKK--LAQTKRVLAAAEENSkMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELA 1815
Cdd:COG1196    521 GLAGAVavLIGVEAAYEAALEAA-LAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGA 599
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1816 NVQDHLNLAKQDLLHTTKHQDVLLSEqtrlqkdiSEWANRFEDCQKEEETKQQQLQVLQNEIEEnklkLVQQEMMFQRLQ 1895
Cdd:COG1196    600 AVDLVASDLREADARYYVLGDTLLGR--------TLVAARLEAALRRAVTLAGRLREVTLEGEG----GSAGGSLTGGSR 667
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1896 KERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEErwceslEKTLSQTKRQLSEREQQL 1975
Cdd:COG1196    668 RELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEE------EALEEQLEAEREELLEEL 741
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|..
gi 2217375254 1976 VEKSGELLALQKEADSMRADFSLLRnqflTERKKAEKQVASL 2017
Cdd:COG1196    742 LEEEELLEEEALEELPEPPDLEELE----RELERLEREIEAL 779
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
509-904 3.05e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 75.86  E-value: 3.05e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  509 RQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKhshMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKE 588
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQ---LRKELEELSRQISALRKDLARLEAEVEQLEERIAQL 752
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  589 TEEIKDLEEQLTE--GQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEiv 666
Cdd:TIGR02168  753 SKELTELEAEIEEleERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLE-- 830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  667 amdaENMRKLEQSALQAELEKERQALKNALGKAQFSEEKEQENsELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLvr 746
Cdd:TIGR02168  831 ----RRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIE-ELESELEALLNERASLEEALALLRSELEELSEEL-- 903
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  747 pEEVAARVDELRRKLKLgtgemnihspsdvlgksladLQKQFSEILARskweRDEAQVRERKLQEEMALQQEKLATGQEE 826
Cdd:TIGR02168  904 -RELESKRSELRRELEE--------------------LREKLAQLELR----LEGLEVRIDNLQERLSEEYSLTLEEAEA 958
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  827 frqaceraLEARMNFDKRQHEARIQQMENEIHYLQE-NLKSMEEiqgltdlqLQEADEEKERILAQLRELEK-KKKLEDA 904
Cdd:TIGR02168  959 --------LENKIEDDEEEARRRLKRLENKIKELGPvNLAAIEE--------YEELKERYDFLTAQKEDLTEaKETLEEA 1022
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1606-2161 8.18e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.20  E-value: 8.18e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1606 ERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQvavleeKLELENLQ 1685
Cdd:COG1196    222 LKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQ------AEEYELLA 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1686 QISQQQKGEIEwQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNL 1765
Cdd:COG1196    296 ELARLEQDIAR-LEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAEL 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1766 EKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRL 1845
Cdd:COG1196    375 AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAEL 454
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1846 QKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEmmfQRLQKERESEESKLETSKVTLKEQQHQLEKELTD 1925
Cdd:COG1196    455 EEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLL---EAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIG 531
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1926 QKSKLDQVLSKVLAA--EERVRTLQEEERWCESLEKT-----------LSQTKRQLSEREQQLVEKSGELLALQKEADSM 1992
Cdd:COG1196    532 VEAAYEAALEAALAAalQNIVVEDDEVAAAAIEYLKAakagratflplDKIRARAALAAALARGAIGAAVDLVASDLREA 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1993 RADFSLLRNQFL--------------------TERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLSKEQKQ 2052
Cdd:COG1196    612 DARYYVLGDTLLgrtlvaarleaalrravtlaGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEE 691
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2053 ENSCIQKEMATIELVAQDNHERARRLMKELNQMQYE-YTELKKQMANQKDLERRQMEISDAMRTLKSEVK-DEIRTSLKN 2130
Cdd:COG1196    692 ELELEEALLAEEEEERELAEAEEERLEEELEEEALEeQLEAEREELLEELLEEEELLEEEALEELPEPPDlEELERELER 771
                          570       580       590
                   ....*....|....*....|....*....|....*...
gi 2217375254 2131 LNQFLPELPA-------DLEAILERNENLEGELESLKE 2161
Cdd:COG1196    772 LEREIEALGPvnllaieEYEELEERYDFLSEQREDLEE 809
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1291-1975 8.30e-13

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 74.33  E-value: 8.30e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1291 RQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVESLMTELEIEKSLKhhEDIVDEIECIEKTLL 1370
Cdd:TIGR02169  290 LRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRR--DKLTEEYAELKEELE 367
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1371 KRRSELREADRLLAEaeselscTKEKTKNAVEKFTDAKRSLlqtesdaEELERRAQETAVNLVKADQQLRSLQADAKDLE 1450
Cdd:TIGR02169  368 DLRAELEEVDKEFAE-------TRDELKDYREKLEKLKREI-------NELKRELDRLQEELQRLSEELADLNAAIAGIE 433
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1451 QHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQV 1530
Cdd:TIGR02169  434 AKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVE 513
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1531 T---SQQQEMAVLDRQLGHKKEELHL---------LQGSMVQAKADLQEALRLGETEVTEKC-----NHIREVKSLLEEL 1593
Cdd:TIGR02169  514 EvlkASIQGVHGTVAQLGSVGERYATaievaagnrLNNVVVEDDAVAKEAIELLKRRKAGRAtflplNKMRDERRDLSIL 593
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1594 SfQKGELN-----VQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKtelknildMLQLENHELQ--------GLKLQH 1660
Cdd:TIGR02169  594 S-EDGVIGfavdlVEFDPKYEPAFKYVFGDTLVVEDIEAARRLMGKYR--------MVTLEGELFEksgamtggSRAPRG 664
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1661 DQRVSELEKTQVAVLEEKLELENLQQISQQQkgEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIW 1740
Cdd:TIGR02169  665 GILFSRSEPAELQRLRERLEGLKRELSSLQS--ELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEEL 742
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1741 EKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLslhndisamQQQLQEKREAVNSLQEELANVQDH 1820
Cdd:TIGR02169  743 EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLS---------HSRIPEIQAELSKLEEEVSRIEAR 813
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1821 LNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERE- 1899
Cdd:TIGR02169  814 LREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDe 893
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217375254 1900 --SEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKtLSQTKRQLSEREQQL 1975
Cdd:TIGR02169  894 leAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLED-VQAELQRVEEEIRAL 970
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
451-1001 1.24e-12

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 73.80  E-value: 1.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  451 TRLSELHDEIEKAEQQ------ILRATEEFKQLEEAIQLKKISEAGKDLLYKQLSGRL--QLVNKLRQEALDLELQMEKQ 522
Cdd:COG4913    235 DDLERAHEALEDAREQiellepIRELAERYAAARERLAELEYLRAALRLWFAQRRLELleAELEELRAELARLEAELERL 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  523 KQEIAGKQKEIKDLQIAIDSLDSKDpkHSHMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDLEEQLteg 602
Cdd:COG4913    315 EARLDALREELDELEAQIRGNGGDR--LEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEA--- 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  603 qiaanEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLT----EVEQERDQL-EIVAMDAENMR--- 674
Cdd:COG4913    390 -----AALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSnipaRLLALRDALaEALGLDEAELPfvg 464
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  675 -----KLEQSALQAELEK---------------ERQALK--NAL---GKAQFSEEKEQENSELHAKLkhlqDDNNLL-KQ 728
Cdd:COG4913    465 elievRPEEERWRGAIERvlggfaltllvppehYAAALRwvNRLhlrGRLVYERVRTGLPDPERPRL----DPDSLAgKL 540
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  729 QLKD--FQNHLNHVvdgLVRPEEVA--ARVDELRR-----------KLKLGTGEMNIHSPSD---VLGKS----LADLQK 786
Cdd:COG4913    541 DFKPhpFRAWLEAE---LGRRFDYVcvDSPEELRRhpraitragqvKGNGTRHEKDDRRRIRsryVLGFDnrakLAALEA 617
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  787 QfseiLARSKWERDEAQVRERKLQEEMALQQEKlatgqeefRQACERALEARMNF-DKRQHEARIQQMENEihyLQENLK 865
Cdd:COG4913    618 E----LAELEEELAEAEERLEALEAELDALQER--------REALQRLAEYSWDEiDVASAEREIAELEAE---LERLDA 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  866 SMEEIQGLTDlQLQEADEEKERILAQLRELEKK-----KKLEDAKSQEQvfgldkelkklkkavATSDKLATAELTIAKD 940
Cdd:COG4913    683 SSDDLAALEE-QLEELEAELEELEEELDELKGEigrleKELEQAEEELD---------------ELQDRLEAAEDLARLE 746
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254  941 QLKSLhgtvmkinQERAEELQEAERFSRKAAQAARDLTRAEAEIELLQNLLRQKGEQFRLE 1001
Cdd:COG4913    747 LRALL--------EERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRAFNRE 799
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1280-2021 1.40e-12

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 73.61  E-value: 1.40e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1280 KREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVESLMTELEIEKSLKHH--ED 1357
Cdd:pfam15921   88 KDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDmlED 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1358 IVDEIECIEKTLLKRRSELREADRLLAEAEsELSCTKEKTKNAVEkfTDAKRSLLQTESDA-EELERRAQETAVNLVKAD 1436
Cdd:pfam15921  168 SNTQIEQLRKMMLSHEGVLQEIRSILVDFE-EASGKKIYEHDSMS--TMHFRSLGSAISKIlRELDTEISYLKGRIFPVE 244
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1437 QQLRSLQADAKD-----LEQHKIKQEEILKEinkivaaKDSDFQCLSKKKEKLTEELQKLQKDIEMAERnedhhlQVLKE 1511
Cdd:pfam15921  245 DQLEALKSESQNkiellLQQHQDRIEQLISE-------HEVEITGLTEKASSARSQANSIQSQLEIIQE------QARNQ 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1512 SEVLLQakraELEKLKSQVTSQQQEMAVLDRQLGHKKEElhlLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLE 1591
Cdd:pfam15921  312 NSMYMR----QLSDLESTVSQLRSELREAKRMYEDKIEE---LEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLLA 384
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1592 ELSFQKGELNVQiserKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQ 1671
Cdd:pfam15921  385 DLHKREKELSLE----KEQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNESL 460
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1672 VAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRvl 1751
Cdd:pfam15921  461 EKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQHLKN-- 538
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1752 aaAEENSKMEQSNLEKLELN-------VRKLQQELDQLNR--------------DKLSLHNDISAMQQQLQE----KREA 1806
Cdd:pfam15921  539 --EGDHLRNVQTECEALKLQmaekdkvIEILRQQIENMTQlvgqhgrtagamqvEKAQLEKEINDRRLELQEfkilKDKK 616
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1807 VNSLQEELANVQDhLNLAKQDLLHTTkhqdvllSEQTRLQKDIS----EWANRFEDCQKEEETKQQQLQVLQNEIEEnkl 1882
Cdd:pfam15921  617 DAKIRELEARVSD-LELEKVKLVNAG-------SERLRAVKDIKqerdQLLNEVKTSRNELNSLSEDYEVLKRNFRN--- 685
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1883 KLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQ----------LEKELTDQKSKLDQVLSKVLAAEERVRTLQEEER 1952
Cdd:pfam15921  686 KSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSdghamkvamgMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKH 765
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217375254 1953 WCESLEKTLSQtkrqlsereqqlveksgELLALQKEADSMRADFSLLRNQfltERKKAEKqVASLKEAL 2021
Cdd:pfam15921  766 FLKEEKNKLSQ-----------------ELSTVATEKNKMAGELEVLRSQ---ERRLKEK-VANMEVAL 813
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
430-894 1.42e-12

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 73.43  E-value: 1.42e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  430 RGHTPLDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYK----QLSGRLQLV 505
Cdd:COG1196    302 QDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLeaeaELAEAEEEL 381
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  506 NKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSK----DPKHSHMKAQKSGKEQQLDIMNKQYQQLESRLDEI 581
Cdd:COG1196    382 EELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEEleelEEALAELEEEEEEEEEALEEAAEEEAELEEEEEAL 461
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  582 LSRIAKETEEIKDLEEQLTE--GQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNecRKLRDEKETLLQRLTEVEQE 659
Cdd:COG1196    462 LELLAELLEEAALLEAALAEllEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGL--RGLAGAVAVLIGVEAAYEAA 539
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  660 RDQLEIVAMDAENMRKLEQSALQAELEKERQA-----LKNALGKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKDFQ 734
Cdd:COG1196    540 LEAALAAALQNIVVEDDEVAAAAIEYLKAAKAgratfLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLG 619
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  735 NHLNHVVDGLVRPEEVAARVDELRRKLKLGTGEMNIHSPsdvlGKSLADLQKQFSEILARSKwERDEAQVRERKLQEEMA 814
Cdd:COG1196    620 DTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSA----GGSLTGGSRRELLAALLEA-EAELEELAERLAEEELE 694
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  815 LQQEKLATGQEEFRQACERALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRE 894
Cdd:COG1196    695 LEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLER 774
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1258-1785 1.87e-12

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 73.05  E-value: 1.87e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1258 EHHNLENEVSRLEDIMQHLKSKKREERwmrASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFidg 1337
Cdd:COG1196    233 KLRELEAELEELEAELEELEAELEELE---AELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIAR--- 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1338 nveslmTELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESD 1417
Cdd:COG1196    307 ------LEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEE 380
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1418 AEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEM 1497
Cdd:COG1196    381 LEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEA 460
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1498 AERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMV----------QAKADLQE 1567
Cdd:COG1196    461 LLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLagavavligvEAAYEAAL 540
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1568 ALRLGETEVTEKCNHIREVKSLLEELSFQKGE------LNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKN 1641
Cdd:COG1196    541 EAALAAALQNIVVEDDEVAAAAIEYLKAAKAGratflpLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGD 620
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1642 ILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQS 1721
Cdd:COG1196    621 TLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALL 700
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 1722 CVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLEL-----------NVRKLQQELDQLNRD 1785
Cdd:COG1196    701 AEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEeealeelpeppDLEELERELERLERE 775
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
83-250 2.53e-12

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 71.50  E-value: 2.53e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   83 ITEALIKKLTKQDNLALIKSLNLSLSKDGGKKFKYIENLEKCVKLEVLNLSYNLIGKIEKLDKLLKLRELNLSYNkiski 162
Cdd:COG4886     32 LLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGN----- 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  163 EGIENMCNLQKLNLAGNEIEHIPVWLGkKLKSLRVLNLKGNKISSLQdiSKLKPLQDLISLILVENPVVTLPHylqfTIF 242
Cdd:COG4886    107 EELSNLTNLESLDLSGNQLTDLPEELA-NLTNLKELDLSNNQLTDLP--EPLGNLTNLKSLDLSNNQLTDLPE----ELG 179

                   ....*...
gi 2217375254  243 HLRSLESL 250
Cdd:COG4886    180 NLTNLKEL 187
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1310-1952 3.20e-11

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 69.31  E-value: 3.20e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1310 EKKSLECEVEELHRTVQKRQQQKDFIDGNVESLMTELEIEK--------SLKHHEDIVDEIECIEKTLLKRRSELREADR 1381
Cdd:TIGR00606  406 EAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKeilekkqeELKFVIKELQQLEGSSDRILELDQELRKAER 485
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1382 LLAEAE--SELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEI 1459
Cdd:TIGR00606  486 ELSKAEknSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSL 565
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1460 LKEINKIVAAKDSdFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLL-------------QAKRAELEKL 1526
Cdd:TIGR00606  566 LGYFPNKKQLEDW-LHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLssyedklfdvcgsQDEESDLERL 644
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1527 KSQVTSQQQEMAVLD----------RQLGHKKEELHLLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEELSFQ 1596
Cdd:TIGR00606  645 KEEIEKSSKQRAMLAgatavysqfiTQLTDENQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKR 724
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1597 KGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLK-LQHDQRVSElektQVAVL 1675
Cdd:TIGR00606  725 RDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKvCLTDVTIME----RFQME 800
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1676 EEKLELENLQQISQQQKGEIEWQKQLLERDKREIERmtaESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAE 1755
Cdd:TIGR00606  801 LKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQH---ELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIG 877
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1756 ENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKH- 1834
Cdd:TIGR00606  878 TNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYm 957
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1835 -------QDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENK---------LKLVQQEMMFQRLQKER 1898
Cdd:TIGR00606  958 kdienkiQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQKiqerwlqdnLTLRKRENELKEVEEEL 1037
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 1899 ESEESKLETSKVTLKEQQHQ-LEKELTDQKSKLDQVLSKVLAAEERVRTLQEEER 1952
Cdd:TIGR00606 1038 KQHLKEMGQMQVLQMKQEHQkLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELR 1092
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
245-1038 3.91e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 68.93  E-value: 3.91e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  245 RSLESLEGQPVTTQDRQEAferfsLEEVERLERDLEKKmiETEELKSKQTRFLEEIK-NQDKLNKSLKEEAMLQKQScEE 323
Cdd:TIGR02168  200 RQLKSLERQAEKAERYKEL-----KAELRELELALLVL--RLEELREELEELQEELKeAEEELEELTAELQELEEKL-EE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  324 LKSDLNTKNELLKQKTIELTRACQKQYELEQELAFYKidAKFEPLNYYPSEYAEidkapdespYIGKSRYKRNMFATESy 403
Cdd:TIGR02168  272 LRLEVSELEEEIEELQKELYALANEISRLEQQKQILR--ERLANLERQLEELEA---------QLEELESKLDELAEEL- 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  404 iidSAQAVQIKKMEPD-EQLRNDHMNLRGHTP-LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAI 481
Cdd:TIGR02168  340 ---AELEEKLEELKEElESLEAELEELEAELEeLESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRR 416
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  482 Q---------LKKISEAGKDLLYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSH 552
Cdd:TIGR02168  417 ErlqqeieelLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLER 496
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  553 MKAQKSGKEQQLDIMNKQYQQLESRLDEILSRI--------AKETEEIKDLEEQLTEGQIAANEA---LKKDLEGVISGL 621
Cdd:TIGR02168  497 LQENLEGFSEGVKALLKNQSGLSGILGVLSELIsvdegyeaAIEAALGGRLQAVVVENLNAAKKAiafLKQNELGRVTFL 576
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  622 QeyLGTIKGQATQAQNecRKLRDEKETLLQRLTEVEQERDQLEIV-------------AMDAENMRKLEQSA-----LQA 683
Cdd:TIGR02168  577 P--LDSIKGTEIQGND--REILKNIEGFLGVAKDLVKFDPKLRKAlsyllggvlvvddLDNALELAKKLRPGyrivtLDG 652
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  684 ELEKERQALKNALGKAQFSE-EKEQENSELHAKLKHLQDDNNLLKQQLKD----FQNHLNHVVDGLVRPEEVAARVDELR 758
Cdd:TIGR02168  653 DLVRPGGVITGGSAKTNSSIlERRREIEELEEKIEELEEKIAELEKALAElrkeLEELEEELEQLRKELEELSRQISALR 732
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  759 RKL-KLGTGEMNIHSPSDVLGKSLADLQKQFSEILARSKWERDEAQVRERKLQEemalQQEKLATGQEEFRQACERALEA 837
Cdd:TIGR02168  733 KDLaRLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE----LEAQIEQLKEELKALREALDEL 808
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  838 RM-----NFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEKK-KKLEDAKSQEQvf 911
Cdd:TIGR02168  809 RAeltllNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESElEALLNERASLE-- 886
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  912 gldkelkklkkavatsdklatAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAARDLTRAEAEIellQNLL 991
Cdd:TIGR02168  887 ---------------------EALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRI---DNLQ 942
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....*..
gi 2217375254  992 RQKGEQFRLEMEktGVGTGANSQVLEIEKLNETMERQRTEIARLQNV 1038
Cdd:TIGR02168  943 ERLSEEYSLTLE--EAEALENKIEDDEEEARRRLKRLENKIKELGPV 987
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
96-280 4.75e-11

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 67.27  E-value: 4.75e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   96 NLALIKSLNLSlskdgGKKFKYI-ENLEKCVKLEVLNLSYNLIGKIEKLDKLLKLRELNLSYNKISKIEGIENMCNLQKL 174
Cdd:COG4886    203 NLTNLEELDLS-----GNQLTDLpEPLANLTNLETLDLSNNQLTDLPELGNLTNLEELDLSNNQLTDLPPLANLTNLKTL 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  175 NLAGNEIEHIPVWLGKKLKSLRVLNLKGNKISSLQDISKLKPLQDLISLILVENPVVTLPHYLQFTIFHLRSLESLEGQP 254
Cdd:COG4886    278 DLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLN 357
                          170       180
                   ....*....|....*....|....*.
gi 2217375254  255 VTTQDRQEAFERFSLEEVERLERDLE 280
Cdd:COG4886    358 LLSLLLTLLLTLGLLGLLEATLLTLA 383
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1265-2094 8.78e-11

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 67.69  E-value: 8.78e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1265 EVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEV------------EELHRTVQKRQQQK 1332
Cdd:pfam02463  171 KKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEyllyldylklneERIDLLQELLRDEQ 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1333 DFIDGNVESLMTELEI-EKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSL 1411
Cdd:pfam02463  251 EEIESSKQEIEKEEEKlAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKEL 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1412 LQTESDAEELERRAQE----TAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEE 1487
Cdd:pfam02463  331 KKEKEEIEELEKELKEleikREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLL 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1488 LQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQE 1567
Cdd:pfam02463  411 LELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLL 490
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1568 ALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRqmskHKTELKNILDMLQ 1647
Cdd:pfam02463  491 SRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEV----SATADEVEERQKL 566
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1648 LENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLS 1727
Cdd:pfam02463  567 VRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKES 646
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1728 KEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAV 1807
Cdd:pfam02463  647 GLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADR 726
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1808 NSLQEELANVQdhLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQ 1887
Cdd:pfam02463  727 VQEAQDKINEE--LKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELR 804
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1888 EMMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQ 1967
Cdd:pfam02463  805 ALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKL 884
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1968 LSEREQQLVEKSGELLALQKEadsmradfsLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLS 2047
Cdd:pfam02463  885 KDELESKEEKEKEEKKELEEE---------SQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNK 955
                          810       820       830       840       850
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217375254 2048 KEQKQENSCIQKEMATIELV-------AQDNHERARRLMKELNQMQYEYTELKK 2094
Cdd:pfam02463  956 EEEEERNKRLLLAKEELGKVnlmaieeFEEKEERYNKDELEKERLEEEKKKLIR 1009
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1262-2120 1.35e-10

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 66.99  E-value: 1.35e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1262 LENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHH---NIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGN 1338
Cdd:TIGR00606  212 LKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDPLKNrlkEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELK 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1339 VESLM--TELEIEKSLKHHEDIVDEIE----CIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFT--DAKRS 1410
Cdd:TIGR00606  292 MEKVFqgTDEQLNDLYHNHQRTVREKErelvDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRarDSLIQ 371
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1411 LLQTESDAEELER-----RAQETAVNLVKADQQlRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLT 1485
Cdd:TIGR00606  372 SLATRLELDGFERgpfseRQIKNFHTLVIERQE-DEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILE 450
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1486 EELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRaELEKLKSQVTSQQQEMAVLdrqlghkkeelhllqgSMVQAKADL 1565
Cdd:TIGR00606  451 KKQEELKFVIKELQQLEGSSDRILELDQELRKAER-ELSKAEKNSLTETLKKEVK----------------SLQNEKADL 513
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1566 QEALRlGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDM 1645
Cdd:TIGR00606  514 DRKLR-KLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSKSKEINQTRDR 592
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1646 LQLENHELQGLKLQHDQRVSELEKTQvAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVEC 1725
Cdd:TIGR00606  593 LAKLNKELASLEQNKNHINNELESKE-EQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQ 671
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1726 LSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKRE 1805
Cdd:TIGR00606  672 LTDENQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRN 751
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1806 AVNSLQEELANVQDHLnlakqdllhttkhqdvllSEQTRLQKDISEWANRFEDCQKEEETKQQqlqvLQNEIEENKLKLV 1885
Cdd:TIGR00606  752 KLQKVNRDIQRLKNDI------------------EEQETLLGTIMPEEESAKVCLTDVTIMER----FQMELKDVERKIA 809
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1886 QQEmmfqrlqkereseeSKLETSKVTLKEQQhqLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTK 1965
Cdd:TIGR00606  810 QQA--------------AKLQGSDLDRTVQQ--VNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEK 873
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1966 RQLSEREQQLVEKSGELLALQKEADSMradfsllrnqfLTERKKAEKQVASLKEALKiQRSQLEKNLLTINEDSERDSSL 2045
Cdd:TIGR00606  874 LQIGTNLQRRQQFEEQLVELSTEVQSL-----------IREIKDAKEQDSPLETFLE-KDQQEKEELISSKETSNKKAQD 941
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 2046 LSKEQKQENSCIQKEMATIELVAQDNHERarrlmkelnQMQYEYTELKKQMANQKDLERRQMEISDAMRTLKSEV 2120
Cdd:TIGR00606  942 KVNDIKEKVKNIHGYMKDIENKIQDGKDD---------YLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDI 1007
PLN02939 PLN02939
transferase, transferring glycosyl groups
1714-2042 5.38e-10

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 64.92  E-value: 5.38e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1714 AESRALQScVECLSKEKEDLQEKCDIWEKKLAQT-KRVLAAAEENSKME--QSNLEKLELN-----------VRKLQQEL 1779
Cdd:PLN02939   150 ARLQALED-LEKILTEKEALQGKINILEMRLSETdARIKLAAQEKIHVEilEEQLEKLRNEllirgateglcVHSLSKEL 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1780 DQLNRDKLSLHNDISAMQQQLQEKREAVNSLqeelanvqdhLNLAKQDLLHTTKHQDvLLSEQTRLQKDISEWANRFEDC 1859
Cdd:PLN02939   229 DVLKEENMLLKDDIQFLKAELIEVAETEERV----------FKLEKERSLLDASLRE-LESKFIVAQEDVSKLSPLQYDC 297
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1860 QKEE-ETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEE----SKLETSKVTLKEQQHQLEKELTDQKSKldQVL 1934
Cdd:PLN02939   298 WWEKvENLQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKeanvSKFSSYKVELLQQKLKLLEERLQASDH--EIH 375
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1935 SKVLAAEERVRTLQeeerwcESLEKTLSQTKRQlsereqqlveksgellALQKEADSMRADF----SLLRNQFLTERKKA 2010
Cdd:PLN02939   376 SYIQLYQESIKEFQ------DTLSKLKEESKKR----------------SLEHPADDMPSEFwsriLLLIDGWLLEKKIS 433
                          330       340       350
                   ....*....|....*....|....*....|..
gi 2217375254 2011 EKQVASLKEALKIQRSQLEKNLLTINEDSERD 2042
Cdd:PLN02939   434 NNDAKLLREMVWKRDGRIREAYLSCKGKNERE 465
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1728-2163 6.27e-10

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 64.68  E-value: 6.27e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1728 KEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNV---RKLQQELDQLNRDKLSLHNDISAMQQQLQEKR 1804
Cdd:PRK02224   199 KEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLeehEERREELETLEAEIEDLRETIAETEREREELA 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1805 EAVNSLQEELANVQDHLNlakqDLLHTTKHQDvllSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKL 1884
Cdd:PRK02224   279 EEVRDLRERLEELEEERD----DLLAEAGLDD---ADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDA 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1885 VQQEMMFQRLQKERESEESKLETSKVTL---KEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTL 1961
Cdd:PRK02224   352 DDLEERAEELREEAAELESELEEAREAVedrREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAEL 431
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1962 SQTKRQLSEReqqlVEKSGELLAL-------QKEADSMRADFsllrnqfLTERKKAEKQVASLKEALKIQRSQLEKNLLT 2034
Cdd:PRK02224   432 EATLRTARER----VEEAEALLEAgkcpecgQPVEGSPHVET-------IEEDRERVEELEAELEDLEEEVEEVEERLER 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2035 INEDSERDSSLLSKEQKQENscIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMANQKDLERRQMEISDAMR 2114
Cdd:PRK02224   501 AEDLVEAEDRIERLEERRED--LEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELN 578
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 2217375254 2115 TLKSEVKDEIrtslknlnqflpELPADLEAILERNENLEGELESLKENL 2163
Cdd:PRK02224   579 SKLAELKERI------------ESLERIRTLLAAIADAEDEIERLREKR 615
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
641-1021 6.87e-10

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 64.57  E-value: 6.87e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  641 KLRDEKETLLQRLTEVEQERDQLEIVAmdAENMRKLEQSALQAELEKERQALKnalgkaqfSEEKEQENSELHAKLKHLQ 720
Cdd:COG1196    169 KYKERKEEAERKLEATEENLERLEDIL--GELERQLEPLERQAEKAERYRELK--------EELKELEAELLLLKLRELE 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  721 DDNNLLKQQLKDFQNHLNHVVDGLvrpEEVAARVDELRRKLklgtgemnihspsDVLGKSLADLQKQFSEILAR-SKWER 799
Cdd:COG1196    239 AELEELEAELEELEAELEELEAEL---AELEAELEELRLEL-------------EELELELEEAQAEEYELLAElARLEQ 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  800 DEAQVRERKLQEEMALQQEKLATGQEEFRQACERALEARMNFDKRQHEARIQQMENEIHYLQENLKsmEEIQGLTDLQLQ 879
Cdd:COG1196    303 DIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALL--EAEAELAEAEEE 380
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  880 EADEEKERILAQLRELEKKKKLEDAKSQEQvfgldkELKKLKKAVATSDKLATAELTIAKDQLKSLHGTVMKINQERAEE 959
Cdd:COG1196    381 LEELAEELLEALRAAAELAAQLEELEEAEE------ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAEL 454
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217375254  960 LQEAERFSRKAAQAARDLTRAEAEIELLQNLLRQKGEQFRLEMEKTGVGTGANSQVLEIEKL 1021
Cdd:COG1196    455 EEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLL 516
PTZ00121 PTZ00121
MAEBL; Provisional
406-1017 7.36e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 64.78  E-value: 7.36e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  406 DSAQAVQIKKMEPDEQLRNDHMNLRGHTPLDTQLEDKEKKISAAQTRLSElhdEIEKAEQqiLRATEEFKQLEEAiqLKK 485
Cdd:PTZ00121  1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAE---EKKKADE--AKKAEEKKKADEA--KKK 1310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  486 ISEAGK-DLLYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIK--DLQIAIDSLDSKDPKHSHMKAQKSGKE- 561
Cdd:PTZ00121  1311 AEEAKKaDEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEaaEEKAEAAEKKKEEAKKKADAAKKKAEEk 1390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  562 QQLDIMNKQYQQLESRLDEILSRIA--KETEEIKDLEEQLTEgqiaANEALKKDLEGVISglqEYLGTIKGQATQAQNEC 639
Cdd:PTZ00121  1391 KKADEAKKKAEEDKKKADELKKAAAakKKADEAKKKAEEKKK----ADEAKKKAEEAKKA---DEAKKKAEEAKKAEEAK 1463
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  640 RKlRDEKETLLQRLTEVEQERDQLEIVAMDAENMRKLEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHL 719
Cdd:PTZ00121  1464 KK-AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKA 1542
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  720 QDDNNllkqqlkdfqnhlnhvVDGLVRPEEVaaRVDELRRKLKLGTGEMNIHSPSDVLGKSLADLQKQFSEILARSKWER 799
Cdd:PTZ00121  1543 EEKKK----------------ADELKKAEEL--KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEE 1604
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  800 DEAQVRERKLQEEMALQQEKLATGQEEfRQACERALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQ 879
Cdd:PTZ00121  1605 KKMKAEEAKKAEEAKIKAEELKKAEEE-KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  880 EADEEKerilaqlRELEKKKKLEDAKSQEQVFGLDKELKKLKKAVATSDKLATAELTIAKDQLKSlhgtvmkiNQERAEE 959
Cdd:PTZ00121  1684 EEDEKK-------AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE--------DKKKAEE 1748
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217375254  960 LQEAERFSRKAAQAARDLTRAEAEIE-----LLQNLLRQKGEQFRLEMEKTGVGTGANSQVLE 1017
Cdd:PTZ00121  1749 AKKDEEEKKKIAHLKKEEEKKAEEIRkekeaVIEEELDEEDEKRRMEVDKKIKDIFDNFANII 1811
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1741-1974 7.79e-10

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 63.24  E-value: 7.79e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1741 EKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDH 1820
Cdd:COG4942     26 EAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEE 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1821 LN--LAKQDLLHTTKHQDVLLSeqtrlQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEEnklklvqqemmFQRLQKER 1898
Cdd:COG4942    106 LAelLRALYRLGRQPPLALLLS-----PEDFLDAVRRLQYLKYLAPARREQAEELRADLAE-----------LAALRAEL 169
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254 1899 ESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEErwcESLEKTLSQTKRQLSEREQQ 1974
Cdd:COG4942    170 EAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEA---EELEALIARLEAEAAAAAER 242
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1257-1851 1.06e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 64.31  E-value: 1.06e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1257 PEHHNLENEVSRLEDIMQHLKSKKR--EERWMRASKRQSEKEMEELHHN--IDDLLQEKKSLECEVEELH---RTVQKRQ 1329
Cdd:TIGR02168  351 EELESLEAELEELEAELEELESRLEelEEQLETLRSKVAQLELQIASLNneIERLEARLERLEDRRERLQqeiEELLKKL 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1330 QQKDFIDGNVESLMTELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCTkEKTKNAVEKFTDAKR 1409
Cdd:TIGR02168  431 EEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSL-ERLQENLEGFSEGVK 509
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1410 SLLQTESD--------AEELERRAQ-ETAVNLVKAD--QQLRSLQADAKDLEQHKIKQEEI---------------LKEI 1463
Cdd:TIGR02168  510 ALLKNQSGlsgilgvlSELISVDEGyEAAIEAALGGrlQAVVVENLNAAKKAIAFLKQNELgrvtflpldsikgteIQGN 589
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1464 NKIVAAKDSDFQCLSKKKEKLTEELQK--------------LQKDIEMAERNEDHHLQVLKESEVL-------------- 1515
Cdd:TIGR02168  590 DREILKNIEGFLGVAKDLVKFDPKLRKalsyllggvlvvddLDNALELAKKLRPGYRIVTLDGDLVrpggvitggsaktn 669
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1516 --LQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEEL 1593
Cdd:TIGR02168  670 ssILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERI 749
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1594 SFQKGELNVQISERKTQLTlikqeiekeeenlqvvlrQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQVA 1673
Cdd:TIGR02168  750 AQLSKELTELEAEIEELEE------------------RLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAE 811
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1674 VLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAEsralqscVECLSKEKEDLQEKCDIWEKKLAQTKRVLAA 1753
Cdd:TIGR02168  812 LTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSED-------IESLAAEIEELEELIEELESELEALLNERAS 884
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1754 AEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELAN-VQDHLNLAKQDLLHTT 1832
Cdd:TIGR02168  885 LEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEeYSLTLEEAEALENKIE 964
                          650
                   ....*....|....*....
gi 2217375254 1833 KHQDVLLSEQTRLQKDISE 1851
Cdd:TIGR02168  965 DDEEEARRRLKRLENKIKE 983
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1741-2121 1.70e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 63.55  E-value: 1.70e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1741 EKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNR------------------DKLSLHNDISAMQQQLQE 1802
Cdd:TIGR02169  169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERyqallkekreyegyellkEKEALERQKEAIERQLAS 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1803 KREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLS-EQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIeenk 1881
Cdd:TIGR02169  249 LEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEeEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERL---- 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1882 lklvqqemmfQRLQKERESEESKLEtskvtlkeqqhQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEerwCESLEKTL 1961
Cdd:TIGR02169  325 ----------AKLEAEIDKLLAEIE-----------ELEREIEEERKRRDKLTEEYAELKEELEDLRAE---LEEVDKEF 380
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1962 SQTKRQLSEREQQLVEKSGELLALQKEADSMradfsllrnqfLTERKKAEKQVASLKEALKIQRSQLEKnlltiNEDSER 2041
Cdd:TIGR02169  381 AETRDELKDYREKLEKLKREINELKRELDRL-----------QEELQRLSEELADLNAAIAGIEAKINE-----LEEEKE 444
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2042 DSSLLSKEQKQENSCIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQmanQKDLERRQMEISDAMRTLKSEVK 2121
Cdd:TIGR02169  445 DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ---ARASEERVRGGRAVEEVLKASIQ 521
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1356-1936 2.76e-09

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 62.73  E-value: 2.76e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1356 EDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKA 1435
Cdd:TIGR04523   36 KQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKND 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1436 DQQLRSLQADAKDLEQHKIKQEEILKEIN-------KIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDH---- 1504
Cdd:TIGR04523  116 KEQKNKLEVELNKLEKQKKENKKNIDKFLteikkkeKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKiknk 195
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1505 ---------HLQVLKESEVLLQAKRAELEK----LKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEA--- 1568
Cdd:TIGR04523  196 llklelllsNLKKKIQKNKSLESQISELKKqnnqLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKqke 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1569 LRLGETEVTEKCNHIREVKSLLEELSFQK-----GELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKN-- 1641
Cdd:TIGR04523  276 LEQNNKKIKELEKQLNQLKSEISDLNNQKeqdwnKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNse 355
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1642 -----ILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAES 1716
Cdd:TIGR04523  356 senseKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETI 435
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1717 RALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAM 1796
Cdd:TIGR04523  436 IKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDL 515
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1797 QQQLQEKREAVNSLQEELANVQDHLNLAKQDL--LHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQ 1874
Cdd:TIGR04523  516 TKKISSLKEKIEKLESEKKEKESKISDLEDELnkDDFELKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKE 595
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 1875 NEIEENKLKLVQQEMMFQRLQKERES---EESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSK 1936
Cdd:TIGR04523  596 KEKKDLIKEIEEKEKKISSLEKELEKakkENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNK 660
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
460-1065 5.71e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 61.62  E-value: 5.71e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  460 IEKAEQQILRATEEFKQLEEaiQLKKIseagkDLLYKQLSGRLQLVNKLRQEALD-LELQMEKQKQEIAGKQKEIKDLQI 538
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEE--NIERL-----DLIIDEKRQQLERLRREREKAERyQALLKEKREYEGYELLKEKEALER 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  539 AIDSLDskdpkhshmkAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDL--EEQLT-EGQIAANEALKKDLE 615
Cdd:TIGR02169  238 QKEAIE----------RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLgeEEQLRvKEKIGELEAEIASLE 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  616 GVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQL-EIVAMDAENMRKLEQSAlqAELEKERQALKN 694
Cdd:TIGR02169  308 RSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLtEEYAELKEELEDLRAEL--EEVDKEFAETRD 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  695 ALGKAQfsEEKEQ---ENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLV----RPEEVAARVDELRRKLKLGTGE 767
Cdd:TIGR02169  386 ELKDYR--EKLEKlkrEINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINeleeEKEDKALEIKKQEWKLEQLAAD 463
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  768 M--------NIHSPSDVLGKSLADLQKQFSEILARSKWERDEaqVRERKLQEEMALQQEKLATGQ--------EEFRQAC 831
Cdd:TIGR02169  464 LskyeqelyDLKEEYDRVEKELSKLQRELAEAEAQARASEER--VRGGRAVEEVLKASIQGVHGTvaqlgsvgERYATAI 541
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  832 ERALEARMNFDKRQHEA-----------------------RIQQMENEIHYLQE--------NLKSMEE------IQGLT 874
Cdd:TIGR02169  542 EVAAGNRLNNVVVEDDAvakeaiellkrrkagratflplnKMRDERRDLSILSEdgvigfavDLVEFDPkyepafKYVFG 621
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  875 DLQLQEADEEKERILAQLRELEKKKKLEDaKSQEQVFGLDKELKKLKKAVATSDKLATAeltiaKDQLKSLHGTVMKINQ 954
Cdd:TIGR02169  622 DTLVVEDIEAARRLMGKYRMVTLEGELFE-KSGAMTGGSRAPRGGILFSRSEPAELQRL-----RERLEGLKRELSSLQS 695
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  955 ERAEELQEAERFSRKAAQAARDLTRAEAEIELL---QNLLRQKGEQFRLEMEKTGVGTGANSQvlEIEKLNETMERQRTE 1031
Cdd:TIGR02169  696 ELRRIENRLDELSQELSDASRKIGEIEKEIEQLeqeEEKLKERLEELEEDLSSLEQEIENVKS--ELKELEARIEELEED 773
                          650       660       670
                   ....*....|....*....|....*....|....*
gi 2217375254 1032 IARLQNVL-DLTGSDNKGGFENVLEEIAELRREVS 1065
Cdd:TIGR02169  774 LHKLEEALnDLEARLSHSRIPEIQAELSKLEEEVS 808
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1298-2162 8.51e-09

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 61.14  E-value: 8.51e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1298 EELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDgnveslmteleiekslkhheDIVDEIECIEKTLLKRRSELR 1377
Cdd:pfam02463  152 PERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAELII--------------------DLEELKLQELKLKEQAKKALE 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1378 EADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQE 1457
Cdd:pfam02463  212 YYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLL 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1458 EILKEINKIVAAK-DSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQ----VTS 1532
Cdd:pfam02463  292 AKEEEELKSELLKlERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLqeklEQL 371
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1533 QQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEELSfQKGELNVQISERKTQLT 1612
Cdd:pfam02463  372 EEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILE-EEEESIELKQGKLTEEK 450
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1613 LIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQK 1692
Cdd:pfam02463  451 EELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGR 530
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1693 GEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNV 1772
Cdd:pfam02463  531 LGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDK 610
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1773 RKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQE----ELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKD 1848
Cdd:pfam02463  611 ATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKgvslEEGLAEKSEVKASLSELTKELLEIQELQEKAESELA 690
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1849 ISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEM----MFQRLQKERESEESKLETSKVTLKEQQHQLEKELT 1924
Cdd:pfam02463  691 KEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQdkinEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELS 770
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1925 DQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFL 2004
Cdd:pfam02463  771 LKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEK 850
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2005 TERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLSKEQKQENSCIQKEMATIELVAQDNHERARRLMKELNQ 2084
Cdd:pfam02463  851 LAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEIL 930
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217375254 2085 MQYEYTELKKQMANQKDLERRQMEISDAMRTLKSEVKDEIRTSLKNLNQFlpelpADLEAILERNENLEGELESLKEN 2162
Cdd:pfam02463  931 LKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAI-----EEFEEKEERYNKDELEKERLEEE 1003
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
438-841 1.09e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 60.85  E-value: 1.09e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  438 QLEDKEKKISAAQTRLSELHDEIEKAEQQILRAtEEFKqleeAIQLKKISEAGKDLLYKQLSGRLQLVNKLRQEAlDLEL 517
Cdd:TIGR02169  178 ELEEVEENIERLDLIIDEKRQQLERLRREREKA-ERYQ----ALLKEKREYEGYELLKEKEALERQKEAIERQLA-SLEE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  518 QMEKQKQEIAGKQKEIKDLQIAIDSLDSKdpkhshmkAQKSGKEQQLDImNKQYQQLESRLDEILSRIAKETEEIKDLEE 597
Cdd:TIGR02169  252 ELEKLTEEISELEKRLEEIEQLLEELNKK--------IKDLGEEEQLRV-KEKIGELEAEIASLERSIAEKERELEDAEE 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  598 QLTEGqiaanEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLeivamdaenmrKLE 677
Cdd:TIGR02169  323 RLAKL-----EAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAET-----------RDE 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  678 QSALQAELEK---ERQALKNALGKAQF-SEEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLvrpEEVAAR 753
Cdd:TIGR02169  387 LKDYREKLEKlkrEINELKRELDRLQEeLQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKL---EQLAAD 463
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  754 VDELRRKLKlgtgemNIHSPSDVLGKSLADLQKQFSEILARSKWERDEaqVRERKLQEEMALQQEKLATGQ--------E 825
Cdd:TIGR02169  464 LSKYEQELY------DLKEEYDRVEKELSKLQRELAEAEAQARASEER--VRGGRAVEEVLKASIQGVHGTvaqlgsvgE 535
                          410
                   ....*....|....*.
gi 2217375254  826 EFRQACERALEARMNF 841
Cdd:TIGR02169  536 RYATAIEVAAGNRLNN 551
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
521-900 1.34e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 60.47  E-value: 1.34e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  521 KQKQEIAGKQKEIKDLQIAIDSLDSKdpkHSHMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDLEEQL- 599
Cdd:TIGR02169  671 SEPAELQRLRERLEGLKRELSSLQSE---LRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLs 747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  600 -TEGQIAANEALKKDLEGVISGLQEYLGTIkgQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMDAEnmRKLEQ 678
Cdd:TIGR02169  748 sLEQEIENVKSELKELEARIEELEEDLHKL--EEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIE--QKLNR 823
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  679 SALQAE-LEKERQalknalgkaqfseEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLnhvvdglvrpEEVAARVDEL 757
Cdd:TIGR02169  824 LTLEKEyLEKEIQ-------------ELQEQRIDLKEQIKSIEKEIENLNGKKEELEEEL----------EELEAALRDL 880
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  758 RRKLKlgtgemNIHSPSDVLGKSLADLQKQFSEIlarsKWERDEAQVRERKLQEEMALQQEKLATGQEEFRQACEralEA 837
Cdd:TIGR02169  881 ESRLG------DLKKERDELEAQLRELERKIEEL----EAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEE---IP 947
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  838 RMNFDKRQHEARIQQMENEIHYLQE-NLKSMEE----IQGLTDLQLQEA--DEEKERILAQLRELEKKKK 900
Cdd:TIGR02169  948 EEELSLEDVQAELQRVEEEIRALEPvNMLAIQEyeevLKRLDELKEKRAklEEERKAILERIEEYEKKKR 1017
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1277-1978 1.35e-08

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 60.37  E-value: 1.35e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1277 KSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVESLMTELEIEKSLKHHE 1356
Cdd:TIGR00618  187 AKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLR 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1357 DIVDEIECIEKTLLKRRSEL---READRLLAEAESelsctkektknavekftdakrsLLQTESDAEELERRAQETAVNLV 1433
Cdd:TIGR00618  267 ARIEELRAQEAVLEETQERInraRKAAPLAAHIKA----------------------VTQIEQQAQRIHTELQSKMRSRA 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1434 KADQQLRSLQADAKDLEQHKIKQEEILKE--INKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEmaernedhhlqVLKE 1511
Cdd:TIGR00618  325 KLLMKRAAHVKQQSSIEEQRRLLQTLHSQeiHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKT-----------TLTQ 393
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1512 SEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEALrlgeTEVTEKCNHIREVKSLLE 1591
Cdd:TIGR00618  394 KLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTA----QCEKLEKIHLQESAQSLK 469
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1592 ELSFQKGelnvqiserktqltlikqeiekeeeNLQVVLRQMSKHKTELKNILDMLQLENHELQGLKL---QHDQRVSELE 1668
Cdd:TIGR00618  470 EREQQLQ-------------------------TKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIhpnPARQDIDNPG 524
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1669 KTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEkcdiwekklaQTK 1748
Cdd:TIGR00618  525 PLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQN----------ITV 594
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1749 RVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLnrdklslhnDISAMQQQLQEKREavnslQEELANVQDHLNLAKQDL 1828
Cdd:TIGR00618  595 RLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQ---------DVRLHLQQCSQELA-----LKLTALHALQLTLTQERV 660
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1829 LHT---TKHQDVLLSEQTRLQKDISEWANRFEDCQKEE-ETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESK 1904
Cdd:TIGR00618  661 REHalsIRVLPKELLASRQLALQKMQSEKEQLTYWKEMlAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDA 740
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217375254 1905 LETSkvtLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEK 1978
Cdd:TIGR00618  741 LNQS---LKELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQE 811
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
495-898 1.84e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 59.78  E-value: 1.84e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  495 YKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKSgKEQQLDIMNKQYQQL 574
Cdd:COG4717     73 LKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEA-LEAELAELPERLEEL 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  575 ESRLDEILSRIaketEEIKDLEEQLTEGQIAANEALKKDLEGVisglQEYLGTIKGQATQAQNECRKLRDEKETLLQRLT 654
Cdd:COG4717    152 EERLEELRELE----EELEELEAELAELQEELEELLEQLSLAT----EEELQDLAEELEELQQRLAELEEELEEAQEELE 223
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  655 EVEQERDQLEIVAMDAENMRKLEQ--------------SALQAELEKERQALKNAL--------------GKAQFSEEKE 706
Cdd:COG4717    224 ELEEELEQLENELEAAALEERLKEarlllliaaallalLGLGGSLLSLILTIAGVLflvlgllallflllAREKASLGKE 303
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  707 QENSELHAKLKHLQDDNnlLKQQLKDFQNHLNHVVDGLVRPEEVAARVDELRRKLKLGTGEMNIHspsdvlgkslaDLQK 786
Cdd:COG4717    304 AEELQALPALEELEEEE--LEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLE-----------ELEQ 370
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  787 QFSEILARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFRQACERALEARMNFDKRQHEARIQQMENEIHYLQENLKS 866
Cdd:COG4717    371 EIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEE 450
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 2217375254  867 M-EEIQGL--------TDLQLQEADEEKERILAQLRELEKK 898
Cdd:COG4717    451 LrEELAELeaeleqleEDGELAELLQELEELKAELRELAEE 491
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1521-1881 2.61e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 59.70  E-value: 2.61e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1521 AELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEALRLGETEVTEkcnHIREVKSLLEELSFQKGEL 1600
Cdd:TIGR02169  170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYE---LLKEKEALERQKEAIERQL 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1601 NvQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTEL-KNILDMLQLENHELQGlklqhdqrvsELEKTQVAVLEEKL 1679
Cdd:TIGR02169  247 A-SLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLgEEEQLRVKEKIGELEA----------EIASLERSIAEKER 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1680 ELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSK 1759
Cdd:TIGR02169  316 ELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLE 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1760 MEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLlhttkhqDVLL 1839
Cdd:TIGR02169  396 KLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADL-------SKYE 468
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 2217375254 1840 SEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENK 1881
Cdd:TIGR02169  469 QELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGR 510
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
781-1567 3.31e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 59.30  E-value: 3.31e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  781 LADLQKQFSEILARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFRQACERALEA------------RMNFDKRQHEA 848
Cdd:TIGR02168  230 LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELqkelyalaneisRLEQQKQILRE 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  849 RIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEKKKKLEDAKSQEQVFGLDKELKKLKKAVATSD 928
Cdd:TIGR02168  310 RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  929 KL------ATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQaaRDLTRAEAEIELLQNLLRQKGEQFRLEM 1002
Cdd:TIGR02168  390 QLelqiasLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQ--AELEELEEELEELQEELERLEEALEELR 467
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1003 EktgvgtgansqvlEIEKLNETMERQRTEIARLQNVLDLTGSDnKGGFENVLEEIAELRREVSYQNDYISSMADPFKRRG 1082
Cdd:TIGR02168  468 E-------------ELEEAEQALDAAERELAQLQARLDSLERL-QENLEGFSEGVKALLKNQSGLSGILGVLSELISVDE 533
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1083 YWyfmppppsskvsshSSQATKDSGVGLKYSAStpvrkprPGQQDGKEG--SQPPPASGYWVYSPIRSGLHKLFPSRDAD 1160
Cdd:TIGR02168  534 GY--------------EAAIEAALGGRLQAVVV-------ENLNAAKKAiaFLKQNELGRVTFLPLDSIKGTEIQGNDRE 592
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1161 S--GGDSQEESELDDQEEPPFVPPP-GYMMYTVLPDGSpVPQGMALYAPPPPLPNNSRP----LTPGTVVYGpppagapm 1233
Cdd:TIGR02168  593 IlkNIEGFLGVAKDLVKFDPKLRKAlSYLLGGVLVVDD-LDNALELAKKLRPGYRIVTLdgdlVRPGGVITG-------- 663
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1234 vyGPPPPNFSIpfipmgvlhcnvpehHNLENEVSRLedimqhlkskkreerwmraskrqsEKEMEELHHNIDDLLQEKKS 1313
Cdd:TIGR02168  664 --GSAKTNSSI---------------LERRREIEEL------------------------EEKIEELEEKIAELEKALAE 702
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1314 LECEVEELhrTVQKRQQQKDFIDGNVESLMTELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCT 1393
Cdd:TIGR02168  703 LRKELEEL--EEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEA 780
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1394 KEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVaakdsd 1473
Cdd:TIGR02168  781 EAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI------ 854
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1474 fqclskkkEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHL 1553
Cdd:TIGR02168  855 --------ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQ 926
                          810
                   ....*....|....
gi 2217375254 1554 LQGSMVQAKADLQE 1567
Cdd:TIGR02168  927 LELRLEGLEVRIDN 940
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
438-660 4.06e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 57.85  E-value: 4.06e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  438 QLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEeaiqlKKISEAGKDLlyKQLSGRLqlvNKLRQEALDLEL 517
Cdd:COG4942     21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALE-----RRIAALARRI--RALEQEL---AALEAELAELEK 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  518 QMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKSgkeQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDLEE 597
Cdd:COG4942     91 EIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDF---LDAVRRLQYLKYLAPARREQAEELRADLAELAALRA 167
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217375254  598 QLT------EGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQER 660
Cdd:COG4942    168 ELEaeraelEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEA 236
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1523-2173 4.25e-08

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 58.98  E-value: 4.25e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1523 LEKLKSQVTSQQQEMAvlDRQLGHKKEELHLLQGSMvqakaDLQEALRLGETEVTEKCNHIREVKSLLEELsfqKGELNV 1602
Cdd:pfam15921   80 LEEYSHQVKDLQRRLN--ESNELHEKQKFYLRQSVI-----DLQTKLQEMQMERDAMADIRRRESQSQEDL---RNQLQN 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1603 QISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLqLENHELQGLKLQHDQRVSELEKTQVAvleeklelE 1682
Cdd:pfam15921  150 TVHELEAAKCLKEDMLEDSNTQIEQLRKMMLSHEGVLQEIRSIL-VDFEEASGKKIYEHDSMSTMHFRSLG--------S 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1683 NLQQISQQQKGEIEWQKQLLERDKREIERMTAESralQSCVECLskekedLQEKCDIWEKKLAQTKRVLAAAEENSKMEQ 1762
Cdd:pfam15921  221 AISKILRELDTEISYLKGRIFPVEDQLEALKSES---QNKIELL------LQQHQDRIEQLISEHEVEITGLTEKASSAR 291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1763 SNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQ-EKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSE 1841
Cdd:pfam15921  292 SQANSIQSQLEIIQEQARNQNSMYMRQLSDLESTVSQLRsELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQE 371
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1842 QTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNE-------IEENKLKLVQQEMMFQRLQ---KERESE-ESKLETSKV 1910
Cdd:pfam15921  372 SGNLDDQLQKLLADLHKREKELSLEKEQNKRLWDRdtgnsitIDHLRRELDDRNMEVQRLEallKAMKSEcQGQMERQMA 451
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1911 TLKEQQHQLEK--ELTDQKSKLDQVLSKVlaaeerVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKe 1988
Cdd:pfam15921  452 AIQGKNESLEKvsSLTAQLESTKEMLRKV------VEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRS- 524
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1989 adsmRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLSKEQKQENScIQKEMATIElva 2068
Cdd:pfam15921  525 ----RVDLKLQELQHLKNEGDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGA-MQVEKAQLE--- 596
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2069 qdnherarrlmKELNQMQYEYTELKKqMANQKDLERRQME--ISD-----------------AMRTLKSEvKDEIRTSLK 2129
Cdd:pfam15921  597 -----------KEINDRRLELQEFKI-LKDKKDAKIRELEarVSDlelekvklvnagserlrAVKDIKQE-RDQLLNEVK 663
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|....
gi 2217375254 2130 NLNQFLPELPADLEAILERNENLEGELESLKENLPFTMNEGPFE 2173
Cdd:pfam15921  664 TSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSE 707
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
510-731 4.40e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 57.85  E-value: 4.40e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  510 QEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLdskdpkhshmKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKET 589
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKAL----------LKQLAALERRIAALARRIRALEQELAALEAELAELE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  590 EEIKDLEEQLtegqiaanEALKKDLEGVI-----SGLQEYLGTIKGQ--ATQAQNECRKLRDEKETLLQRLTEVEQERDQ 662
Cdd:COG4942     90 KEIAELRAEL--------EAQKEELAELLralyrLGRQPPLALLLSPedFLDAVRRLQYLKYLAPARREQAEELRADLAE 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  663 LEIVAMDAENMRKlEQSALQAELEKERQALKNALG-KAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLK 731
Cdd:COG4942    162 LAALRAELEAERA-ELEALLAELEEERAALEALKAeRQKLLARLEKELAELAAELAELQQEAEELEALIA 230
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1750-1991 5.99e-08

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 57.53  E-value: 5.99e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1750 VLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLL 1829
Cdd:COG3883     10 TPAFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELG 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1830 HTTKHQ----------DVLLSeqtrlQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKlklvqqemmfqRLQKERE 1899
Cdd:COG3883     90 ERARALyrsggsvsylDVLLG-----SESFSDFLDRLSALSKIADADADLLEELKADKAELE-----------AKKAELE 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1900 SEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKS 1979
Cdd:COG3883    154 AKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAA 233
                          250
                   ....*....|..
gi 2217375254 1980 GELLALQKEADS 1991
Cdd:COG3883    234 AAAAAAAAAAAS 245
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1402-2208 8.87e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.77  E-value: 8.87e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1402 EKFTDAKRSLLQTESDAEELERRAQETAVNLVK-ADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAK---DSDFQCL 1477
Cdd:TIGR02169  170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERlRREREKAERYQALLKEKREYEGYELLKEKEALERQKeaiERQLASL 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1478 SKKKEKLTEELQKLQKDIEMAERN-EDHHLQVLKESEVLLQAKRAELEKLKSQVTSqqqemavLDRQLGHKKEELHllqg 1556
Cdd:TIGR02169  250 EEELEKLTEEISELEKRLEEIEQLlEELNKKIKDLGEEEQLRVKEKIGELEAEIAS-------LERSIAEKERELE---- 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1557 smvqakaDLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHK 1636
Cdd:TIGR02169  319 -------DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYR 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1637 TEL---KNILDMLQLENHELQGLKLQHDQRVSELEktqvavleeklelenlqqisQQQKGEIEWQKQLLERDKREIERMT 1713
Cdd:TIGR02169  392 EKLeklKREINELKRELDRLQEELQRLSEELADLN--------------------AAIAGIEAKINELEEEKEDKALEIK 451
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1714 AESRALQSCVECLSKEKE---DLQEKCDIWEKKLAQTKRVLAAAEEnskmEQSNLEKLELNVRKLQQELDQLNRDKLSLH 1790
Cdd:TIGR02169  452 KQEWKLEQLAADLSKYEQelyDLKEEYDRVEKELSKLQRELAEAEA----QARASEERVRGGRAVEEVLKASIQGVHGTV 527
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1791 NDISAMQQQLQEKRE--AVNSLQ----EELANVQDHLNLAKQ------DLLHTTKHQDVLLSEQTRLQKDISEWANRFED 1858
Cdd:TIGR02169  528 AQLGSVGERYATAIEvaAGNRLNnvvvEDDAVAKEAIELLKRrkagraTFLPLNKMRDERRDLSILSEDGVIGFAVDLVE 607
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1859 CQKEEETKQQQL---QVLQNEIEENKLKLVQQEMM----------------------FQRLQKERESEESKLETSKVTLK 1913
Cdd:TIGR02169  608 FDPKYEPAFKYVfgdTLVVEDIEAARRLMGKYRMVtlegelfeksgamtggsraprgGILFSRSEPAELQRLRERLEGLK 687
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1914 EQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEErwcESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMR 1993
Cdd:TIGR02169  688 RELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEI---EQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELE 764
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1994 ADFSLLrnqflterkkaEKQVASLKEALkiqrsqleknlltinEDSERDsslLSKEQKQEnscIQKEMATIELVAQDNHE 2073
Cdd:TIGR02169  765 ARIEEL-----------EEDLHKLEEAL---------------NDLEAR---LSHSRIPE---IQAELSKLEEEVSRIEA 812
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2074 RARRLMKELNQMQYEYTELKKQMANqkdLERRQMEISDAMRTLKSEVkDEIRTSLKNLNQFLPELPADLEAILERNENLE 2153
Cdd:TIGR02169  813 RLREIEQKLNRLTLEKEYLEKEIQE---LQEQRIDLKEQIKSIEKEI-ENLNGKKEELEEELEELEAALRDLESRLGDLK 888
                          810       820       830       840       850
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 2154 GELESLKENLpftmneGPFEEKLNfsqvhimdehwRGEALREKLRHREDRLKAQL 2208
Cdd:TIGR02169  889 KERDELEAQL------RELERKIE-----------ELEAQIEKKRKRLSELKAKL 926
PTZ00121 PTZ00121
MAEBL; Provisional
423-1034 8.94e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 57.84  E-value: 8.94e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  423 RNDHMN-LRGHTPLDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQ----LKKISEAGK-DLLYK 496
Cdd:PTZ00121  1062 AKAHVGqDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKkaedARKAEEARKaEDARK 1141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  497 QLSGRLQLVNKLRQEALDLElqmEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHmKAQKSGKEQQLDIMNKQYQQLES 576
Cdd:PTZ00121  1142 AEEARKAEDAKRVEIARKAE---DARKAEEARKAEDAKKAEAARKAEEVRKAEELR-KAEDARKAEAARKAEEERKAEEA 1217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  577 RLDEILSRI--AKETEEI-KDLEEQLTEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRL 653
Cdd:PTZ00121  1218 RKAEDAKKAeaVKKAEEAkKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKK 1297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  654 TEVEQERDQLEIVAMDAENMRKLEQSALQAELEKE--RQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLK 731
Cdd:PTZ00121  1298 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADaaKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  732 DFQNHLNHVVDGLVRPEEVAARVDELRRKLKLGTGEMNIHSPSDVLGKSLADLQKqfSEILARSKWERDEAQvrERKLQE 811
Cdd:PTZ00121  1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK--ADEAKKKAEEAKKAD--EAKKKA 1453
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  812 EMALQQEKLATGQEEFRQACE---RALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTdlQLQEADE----E 884
Cdd:PTZ00121  1454 EEAKKAEEAKKKAEEAKKADEakkKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAE--EAKKADEakkaE 1531
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  885 KERILAQLRELEKKKKLEDAKSQEQVFGLDKELKKLKKAVATSDK-LATAELTIAKDQLKSLHGTVMKINQE----RAEE 959
Cdd:PTZ00121  1532 EAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKnMALRKAEEAKKAEEARIEEVMKLYEEekkmKAEE 1611
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254  960 LQEAERFSRKAAQaardlTRAEAEIELLQNLLRQKGEQFRLEMEKTGVGTGANSQVLEIEKLNETMERQRTEIAR 1034
Cdd:PTZ00121  1612 AKKAEEAKIKAEE-----LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAK 1681
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1262-1916 9.16e-08

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 57.82  E-value: 9.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1262 LENEVSRLEDIMQHLKSKKRE--ERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTV-----QKRQQQKDF 1334
Cdd:pfam15921  236 LKGRIFPVEDQLEALKSESQNkiELLLQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLeiiqeQARNQNSMY 315
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1335 ID--GNVESLMTEL--EIEKSLKHHEDivdEIECIEKTLLKRRSELREA------------------DRLLAE---AESE 1389
Cdd:pfam15921  316 MRqlSDLESTVSQLrsELREAKRMYED---KIEELEKQLVLANSELTEArterdqfsqesgnlddqlQKLLADlhkREKE 392
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1390 LSCTKEKTKNAVEKFTDAKRSL--LQTESDAEELERRAQETAVNLVKAD------QQLRSLQAdakdleqhkikQEEILK 1461
Cdd:pfam15921  393 LSLEKEQNKRLWDRDTGNSITIdhLRRELDDRNMEVQRLEALLKAMKSEcqgqmeRQMAAIQG-----------KNESLE 461
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1462 EINKIVAAKDSDFQCLskkkEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLd 1541
Cdd:pfam15921  462 KVSSLTAQLESTKEML----RKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQHL- 536
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1542 rqlghKKEELHLLQgsmVQAKAdlqEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTlikqeieke 1621
Cdd:pfam15921  537 -----KNEGDHLRN---VQTEC---EALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLE--------- 596
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1622 eenlqvvlRQMSKHKTELKnildmlqlenhELQGLKLQHDQRVSELEKtqvavleeklelenlqQISQQQKGEIEWQKQL 1701
Cdd:pfam15921  597 --------KEINDRRLELQ-----------EFKILKDKKDAKIRELEA----------------RVSDLELEKVKLVNAG 641
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1702 LERdKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLaqtkrvlaaaEENSKMEQSNLEKLELNVRKLQQELDQ 1781
Cdd:pfam15921  642 SER-LRAVKDIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNF----------RNKSEEMETTTNKLKMQLKSAQSELEQ 710
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1782 LNRDKLSLHND-------ISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWAN 1854
Cdd:pfam15921  711 TRNTLKSMEGSdghamkvAMGMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAG 790
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217375254 1855 RFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRlqKERESEESKLETSkVTLKEQQ 1916
Cdd:pfam15921  791 ELEVLRSQERRLKEKVANMEVALDKASLQFAECQDIIQR--QEQESVRLKLQHT-LDVKELQ 849
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
270-1081 9.99e-08

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 57.67  E-value: 9.99e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  270 EEVERLERDLEKKMIETEELKSKQTRFLEEIKNQDKLNKSLKEEAMLQKQSCEELKSDLNTKNELLKQKTIELTRACQKQ 349
Cdd:pfam02463  166 RLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQEL 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  350 YELEQElafyKIDAKFEPLNYYPSEYAEIDKAPDEspyigKSRYKRNMFATESYIIDSAQAVQIKKMEPDEQLRNDHMNL 429
Cdd:pfam02463  246 LRDEQE----EIESSKQEIEKEEEKLAQVLKENKE-----EEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKL 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  430 RGHtplDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYKQLSGRLQLVNKLR 509
Cdd:pfam02463  317 KES---EKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLK 393
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  510 QEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKhshmKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKET 589
Cdd:pfam02463  394 EEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILE----EEEESIELKQGKLTEEKEELEKQELKLLKDELELKK 469
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  590 EEIKDLEEQLTEGQIAANEALK-KDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAM 668
Cdd:pfam02463  470 SEDLLKETQLVKLQEQLELLLSrQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAV 549
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  669 DAENMRKLEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRPE 748
Cdd:pfam02463  550 IVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGIL 629
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  749 EVAARVDELRRKLKLGTGEMNIHSpSDVLGKSLADLQKQFSEILARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFR 828
Cdd:pfam02463  630 KDTELTKLKESAKAKESGLRKGVS-LEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQRE 708
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  829 QACER--ALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQ-LRELEKKKKLEDAK 905
Cdd:pfam02463  709 KEELKklKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEkELAEEREKTEKLKV 788
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  906 SQEQVFGLDKELKKLKKAVATSDKLATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAARDLTRAEAEIE 985
Cdd:pfam02463  789 EEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEEL 868
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  986 LLQNLLRQKGEQFRLEMEKTGvgtGANSQVLEIEKLNETMERQRTEIARLQNVLDLTGSDNKGGFENVLEEIAELRREVS 1065
Cdd:pfam02463  869 LQELLLKEEELEEQKLKDELE---SKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEA 945
                          810
                   ....*....|....*.
gi 2217375254 1066 YQNDYISSMADPFKRR 1081
Cdd:pfam02463  946 DEKEKEENNKEEEEER 961
PTZ00121 PTZ00121
MAEBL; Provisional
1295-2151 1.01e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 57.84  E-value: 1.01e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1295 KEMEEL--HHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVESL----MTELEIEKSLKHHEDIVDEIECIEKT 1368
Cdd:PTZ00121  1027 EKIEELteYGNNDDVLKEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYkdfdFDAKEDNRADEATEEAFGKAEEAKKT 1106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1369 LLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSllqTESDAEELERRAQET-AVNLVKADQQLRSLQADAK 1447
Cdd:PTZ00121  1107 ETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKA---EDAKRVEIARKAEDArKAEEARKAEDAKKAEAARK 1183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1448 DLEQHKIKQEEILKEINKIVAAKDSDfqclskkKEKLTEELQKLQ--KDIEMAERNEdhhlQVLKESEvllQAKRAELEK 1525
Cdd:PTZ00121  1184 AEEVRKAEELRKAEDARKAEAARKAE-------EERKAEEARKAEdaKKAEAVKKAE----EAKKDAE---EAKKAEEER 1249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1526 LKSQVTSQQQEMAV--LDRQLGHKKEELHllQGSMVQAKADLQEALRLGETEVTEKCNHIR----------EVKSLLEEL 1593
Cdd:PTZ00121  1250 NNEEIRKFEEARMAhfARRQAAIKAEEAR--KADELKKAEEKKKADEAKKAEEKKKADEAKkkaeeakkadEAKKKAEEA 1327
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1594 SFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMS---KHKTELKNILDMLQLENHE------LQGLKLQHDQRV 1664
Cdd:PTZ00121  1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEaaeKKKEEAKKKADAAKKKAEEkkkadeAKKKAEEDKKKA 1407
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1665 SELEKTQVAVLEEKLELENLQQI--SQQQKGEIEWQKQLLERDKREIERMTAES---RALQSCVECLSKEKEDLQEKCDI 1739
Cdd:PTZ00121  1408 DELKKAAAAKKKADEAKKKAEEKkkADEAKKKAEEAKKADEAKKKAEEAKKAEEakkKAEEAKKADEAKKKAEEAKKADE 1487
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1740 WEKKLAQTKRvlaAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQ----EELA 1815
Cdd:PTZ00121  1488 AKKKAEEAKK---KADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEElkkaEEKK 1564
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1816 NVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDI----SEWANRFEDCQKEEETKQQQLQVLQneiEENKLKLVQQEMMF 1891
Cdd:PTZ00121  1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMklyeEEKKMKAEEAKKAEEAKIKAEELKK---AEEEKKKVEQLKKK 1641
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1892 QRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSER 1971
Cdd:PTZ00121  1642 EAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEEL 1721
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1972 EQQLVEKSGELLALQKEADSmradfsllrnqfltERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSL---LSK 2048
Cdd:PTZ00121  1722 KKAEEENKIKAEEAKKEAEE--------------DKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIeeeLDE 1787
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2049 EQKQENSCIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMANQKDLERRQMEISDAMRTLKSEVKDEIRTSL 2128
Cdd:PTZ00121  1788 EDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKE 1867
                          890       900
                   ....*....|....*....|...
gi 2217375254 2129 KNLNQFLPELPADLEAILERNEN 2151
Cdd:PTZ00121  1868 ADFNKEKDLKEDDEEEIEEADEI 1890
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
629-969 1.24e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.39  E-value: 1.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  629 KGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMDAENM---RKLEQSALQAELEKERQALKNALGKAQFSEEK 705
Cdd:TIGR02169  673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKigeIEKEIEQLEQEEEKLKERLEELEEDLSSLEQE 752
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  706 ----EQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNH-----VVDGLVRPEEVAARVDELRRKLKLGTGEMniHSPSDV 776
Cdd:TIGR02169  753 ienvKSELKELEARIEELEEDLHKLEEALNDLEARLSHsripeIQAELSKLEEEVSRIEARLREIEQKLNRL--TLEKEY 830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  777 LGKSLADLQKQFSEILARSKWERDEaqvrerklQEEMALQQEKLATGQEEFrQACERALEAR---MNFDKRQHEARIQQM 853
Cdd:TIGR02169  831 LEKEIQELQEQRIDLKEQIKSIEKE--------IENLNGKKEELEEELEEL-EAALRDLESRlgdLKKERDELEAQLREL 901
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  854 ENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQLRELEKKKklEDAKSQEQVFGLDKELKKLKKAVATSDKLATA 933
Cdd:TIGR02169  902 ERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIP--EEELSLEDVQAELQRVEEEIRALEPVNMLAIQ 979
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 2217375254  934 ELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRK 969
Cdd:TIGR02169  980 EYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKK 1015
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1261-1815 1.43e-07

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 57.00  E-value: 1.43e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1261 NLENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKrqqqkdfidgnve 1340
Cdd:PRK03918   225 KLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEK------------- 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1341 slmtELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESElsctKEKTKNAVEKFTDAKRSLLQTESDAEE 1420
Cdd:PRK03918   292 ----AEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK----EERLEELKKKLKELEKRLEELEERHEL 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1421 LER-RAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKdSDFQCLSKKKEKLTEELQKLQKDIEMAE 1499
Cdd:PRK03918   364 YEEaKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARI-GELKKEIKELKKAIEELKKAKGKCPVCG 442
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1500 R--NEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQgsMVQAKADLQEALRLGETEVT 1577
Cdd:PRK03918   443 RelTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKE--LAEQLKELEEKLKKYNLEEL 520
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1578 EKCNhiREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEkeeenLQVVLRQMSKHKTELKNILDMLQLENHElqglk 1657
Cdd:PRK03918   521 EKKA--EEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAE-----LEKKLDELEEELAELLKELEELGFESVE----- 588
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1658 lQHDQRVSELEKTQVAVLEEKLElenlqqisqqqKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKC 1737
Cdd:PRK03918   589 -ELEERLKELEPFYNEYLELKDA-----------EKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKY 656
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1738 DIWE-----KKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNdISAMQQQLQEKREAVNSLQE 1812
Cdd:PRK03918   657 SEEEyeelrEEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEK-LEKALERVEELREKVKKYKA 735

                   ...
gi 2217375254 1813 ELA 1815
Cdd:PRK03918   736 LLK 738
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
419-1079 1.83e-07

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 56.77  E-value: 1.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  419 DEQLRNDHMNLRGHTpldTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISeagKDLLYKQL 498
Cdd:pfam12128  229 DIQAIAGIMKIRPEF---TKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRT---LDDQWKEK 302
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  499 SGRLqlvnKLRQEALDLELQMEKQKQEIAGKQKEI---KDLQIAIDSLDSKDPKHSHMKAQKSGKEQQLDIMNKQYQQLE 575
Cdd:pfam12128  303 RDEL----NGELSAADAAVAKDRSELEALEDQHGAfldADIETAAADQEQLPSWQSELENLEERLKALTGKHQDVTAKYN 378
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  576 SRLDEILSRIAKETEEIKDLEEQLTEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTE 655
Cdd:pfam12128  379 RRRSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEAGKLEFNEEEYRLKSRLGELKLRLNQATA 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  656 VEQERDQLEIVAMDAENMRK------LEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDN-----N 724
Cdd:pfam12128  459 TPELLLQLENFDERIERAREeqeaanAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELELQLFPQagtllH 538
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  725 LLKQQLKDFQNHLNHVVD-GLVRPEEVAARVDELRRKLKLGTGEMNIHSPSdVLGKSLADLQKQFSEILARSKWERDEAQ 803
Cdd:pfam12128  539 FLRKEAPDWEQSIGKVISpELLHRTDLDPEVWDGSVGGELNLYGVKLDLKR-IDVPEWAASEEELRERLDKAEEALQSAR 617
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  804 VRERKLQEEMAL---QQEKLATGQEEFRQACERALE--ARMNFDKRQHEARIQQ-MENEIHYLQENLKSMEEIQGLTDLQ 877
Cdd:pfam12128  618 EKQAAAEEQLVQangELEKASREETFARTALKNARLdlRRLFDEKQSEKDKKNKaLAERKDSANERLNSLEAQLKQLDKK 697
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  878 LQEADEEKER--------ILAQLRELEKKKKLEDAKSQEQVFGLDKELKKLKKAVATS-----DKLATAELTIAK--DQL 942
Cdd:pfam12128  698 HQAWLEEQKEqkrearteKQAYWQVVEGALDAQLALLKAAIAARRSGAKAELKALETWykrdlASLGVDPDVIAKlkREI 777
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  943 KSLHGTVMKINQERAEELQ----EAERFSRKAAQAARDLTRAEAEIELLQNLLRQKGEQFRLEMEKTGVGTGANSQVLei 1018
Cdd:pfam12128  778 RTLERKIERIAVRRQEVLRyfdwYQETWLQRRPRLATQLSNIERAISELQQQLARLIADTKLRRAKLEMERKASEKQQ-- 855
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217375254 1019 EKLNETMERQRTEIARLQNV-LDLTGSDNKGGFENVLEEIAELRREVSYQNDYISSMADPFK 1079
Cdd:pfam12128  856 VRLSENLRGLRCEMSKLATLkEDANSEQAQGSIGERLAQLEDLKLKRDYLSESVKKYVEHFK 917
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1698-1940 1.88e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 55.93  E-value: 1.88e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1698 QKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQ 1777
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1778 ELDQLnRDKLSlhNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQdllhttkhqdvLLSEQTRLQKDISEWANRFE 1857
Cdd:COG4942     98 ELEAQ-KEELA--ELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKY-----------LAPARREQAEELRADLAELA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1858 DCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKV 1937
Cdd:COG4942    164 ALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERT 243

                   ...
gi 2217375254 1938 LAA 1940
Cdd:COG4942    244 PAA 246
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1259-2163 2.10e-07

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 56.72  E-value: 2.10e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1259 HHNLENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKK----SLECEVEELHRTVQKRQQQKDF 1334
Cdd:pfam01576   35 HQQLCEEKNALQEQLQAETELCAEAEEMRARLAARKQELEEILHELESRLEEEEersqQLQNEKKKMQQHIQDLEEQLDE 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1335 IDGNVESLMTE-LEIEKSLKHHEDIVDEIECIEKTLLKRRSELRE-----ADRLLAEAESELSCTKEKTKNAVeKFTDAK 1408
Cdd:pfam01576  115 EEAARQKLQLEkVTTEAKIKKLEEDILLLEDQNSKLSKERKLLEErisefTSNLAEEEEKAKSLSKLKNKHEA-MISDLE 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1409 RSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEEL 1488
Cdd:pfam01576  194 ERLKKEEKGRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQI 273
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1489 QKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEA 1568
Cdd:pfam01576  274 SELQEDLESERAARNKAEKQRRDLGEELEALKTELEDTLDTTAAQQELRSKREQEVTELKKALEEETRSHEAQLQEMRQK 353
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1569 LRLGETEVTEKCNHIREVKSLLEE----LSFQKGELNVQI---SERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKN 1641
Cdd:pfam01576  354 HTQALEELTEQLEQAKRNKANLEKakqaLESENAELQAELrtlQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAE 433
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1642 ILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQS 1721
Cdd:pfam01576  434 KLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKR 513
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1722 CVEclsKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQ 1801
Cdd:pfam01576  514 NVE---RQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLD 590
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1802 EKREAVNSLQEelanvqdhlnlaKQdllhttKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENK 1881
Cdd:pfam01576  591 HQRQLVSNLEK------------KQ------KKFDQMLAEEKAISARYAEERDRAEAEAREKETRALSLARALEEALEAK 652
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1882 LKLvqqemmfQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVlaaEERVRTLQEEERWCESLEKTL 1961
Cdd:pfam01576  653 EEL-------ERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVEEMKTQL---EELEDELQATEDAKLRLEVNM 722
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1962 ----SQTKRQLSEREQQLVEKSGELLALQKEADSMradfsllrnqfLTERKKAEKQVASLKEALKIQRSQLEKNLLTINE 2037
Cdd:pfam01576  723 qalkAQFERDLQARDEQGEEKRRQLVKQVRELEAE-----------LEDERKQRAQAVAAKKKLELDLKELEAQIDAANK 791
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2038 DSERDSSLLSKEQKQ----ENSCIQKEMATIELVAQ--DNHERARRLMKELNQMQYEYT---ELKKQMANQKDLERRQME 2108
Cdd:pfam01576  792 GREEAVKQLKKLQAQmkdlQRELEEARASRDEILAQskESEKKLKNLEAELLQLQEDLAaseRARRQAQQERDELADEIA 871
                          890       900       910       920       930
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 2109 ISDAMRTLKSEVKDEIRTSLKNLNQFLPELPADLEAILERNENLEGELESLKENL 2163
Cdd:pfam01576  872 SGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTEL 926
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
275-896 2.33e-07

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 56.59  E-value: 2.33e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  275 LERDLEKKMIETEELKSKQTRFLEEIKNQDKLnksLKEEAMLQKQSCEELKSDLNTKNELLKQKTIELtracqKQYELEQ 354
Cdd:TIGR00606  442 IELKKEILEKKQEELKFVIKELQQLEGSSDRI---LELDQELRKAERELSKAEKNSLTETLKKEVKSL-----QNEKADL 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  355 ELAFYKIDAKFEPLNYYPSEYAEI-----DKA-PDESPYIGKSRYKRNM------FATESYIIDSAQAVQIKKMEPDEQL 422
Cdd:TIGR00606  514 DRKLRKLDQEMEQLNHHTTTRTQMemltkDKMdKDEQIRKIKSRHSDELtsllgyFPNKKQLEDWLHSKSKEINQTRDRL 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  423 RNDHMNLRGHTPLDTQLEDKEKKISAAQTRLSELHDEI---EKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYKQLS 499
Cdd:TIGR00606  594 AKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVcgsQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLT 673
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  500 GRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSH-MKAQKSGKEQQLDIMNKQYQQLESRL 578
Cdd:TIGR00606  674 DENQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDeMLGLAPGRQSIIDLKEKEIPELRNKL 753
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  579 DEILSRIAKETEEIKDLEEQLteGQIAANEALKKDLE---GVISGLQEYLGTIKGQATQAQNECRKLrDEKETLLQRLTE 655
Cdd:TIGR00606  754 QKVNRDIQRLKNDIEEQETLL--GTIMPEEESAKVCLtdvTIMERFQMELKDVERKIAQQAAKLQGS-DLDRTVQQVNQE 830
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  656 VEQERDQLEIVAMDAENMRKLEQSALQA---------ELEKERQALKNALGKAQF----SEEKEQENSELHAKLKHLQDD 722
Cdd:TIGR00606  831 KQEKQHELDTVVSKIELNRKLIQDQQEQiqhlksktnELKSEKLQIGTNLQRRQQfeeqLVELSTEVQSLIREIKDAKEQ 910
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  723 NNLLKQQLKDFQNHLNHVVDGLVRPEEVAA-RVDELRRKLKLGTGEMNihSPSDVLGKSLADLQKQFSEILARSKWERDE 801
Cdd:TIGR00606  911 DSPLETFLEKDQQEKEELISSKETSNKKAQdKVNDIKEKVKNIHGYMK--DIENKIQDGKDDYLKQKETELNTVNAQLEE 988
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  802 AQVRERKLQEEMALQQEKLATGQEE------------FRQACERALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEE 869
Cdd:TIGR00606  989 CEKHQEKINEDMRLMRQDIDTQKIQerwlqdnltlrkRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDLIKR 1068
                          650       660
                   ....*....|....*....|....*..
gi 2217375254  870 IQGLTDLQLQEADEEKERILAQLRELE 896
Cdd:TIGR00606 1069 NHVLALGRQKGYEKEIKHFKKELREPQ 1095
LRR_8 pfam13855
Leucine rich repeat;
148-205 2.35e-07

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 49.45  E-value: 2.35e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254  148 KLRELNLSYNKISKIEG--IENMCNLQKLNLAGNEIEHI-PVWLGKkLKSLRVLNLKGNKI 205
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDgaFKGLSNLKVLDLSNNLLTTLsPGAFSG-LPSLRYLDLSGNRL 61
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
441-761 3.08e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 56.08  E-value: 3.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  441 DKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEaiqlkkiseagkdllykqlsgrlqlvnklRQEALDLELQME 520
Cdd:COG4913    607 DNRAKLAALEAELAELEEELAEAEERLEALEAELDALQE-----------------------------RREALQRLAEYS 657
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  521 KQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMkaqksgkEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDLEEQLT 600
Cdd:COG4913    658 WDEIDVASAEREIAELEAELERLDASSDDLAAL-------EEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELD 730
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  601 EGQIAANEALKKDLEGVISGLQEYLGTIKGQATQA------QNECRKLRDEKETLLQRLTEVEQE---RDQLEIVAMDAE 671
Cdd:COG4913    731 ELQDRLEAAEDLARLELRALLEERFAAALGDAVERelrenlEERIDALRARLNRAEEELERAMRAfnrEWPAETADLDAD 810
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  672 NM---------RKLEQSALQAELEKERQALKNALG--KAQFSEEKEQENSELHAKLKHLqddNNLLKQ------------ 728
Cdd:COG4913    811 LEslpeylallDRLEEDGLPEYEERFKELLNENSIefVADLLSKLRRAIREIKERIDPL---NDSLKRipfgpgrylrle 887
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 2217375254  729 -------QLKDFQNHLNHVVDGLVRPEEVAA-----RVDELRRKL 761
Cdd:COG4913    888 arprpdpEVREFRQELRAVTSGASLFDEELSearfaALKRLIERL 932
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
551-872 3.73e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 55.83  E-value: 3.73e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  551 SHMKAQKSGKEQQLDIMNKQYQQLESRLDEI------LSRIAKETEEIKDLEEQLTEGQIAANEALKKDLEGVISGLQEy 624
Cdd:TIGR02168  168 SKYKERRKETERKLERTRENLDRLEDILNELerqlksLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQE- 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  625 lgtikgQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEiVAMDAENMRKLEQSALQAELEKERQALKNALGKAQFS-- 702
Cdd:TIGR02168  247 ------ELKEAEEELEELTAELQELEEKLEELRLEVSELE-EEIEELQKELYALANEISRLEQQKQILRERLANLERQle 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  703 ------EEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLV----RPEEVAARVDELRRKLKLGTGEMNIH- 771
Cdd:TIGR02168  320 eleaqlEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEelesRLEELEEQLETLRSKVAQLELQIASLn 399
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  772 ---SPSDVLGKSLADLQKQFSEILARSKWERDEAQVRErkLQEEMALQQEKLATGQEEFRQACERALEARMNFdkRQHEA 848
Cdd:TIGR02168  400 neiERLEARLERLEDRRERLQQEIEELLKKLEEAELKE--LQAELEELEEELEELQEELERLEEALEELREEL--EEAEQ 475
                          330       340
                   ....*....|....*....|....
gi 2217375254  849 RIQQMENEIHYLQENLKSMEEIQG 872
Cdd:TIGR02168  476 ALDAAERELAQLQARLDSLERLQE 499
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
259-900 3.95e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 55.84  E-value: 3.95e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  259 DRQEAFERFSLEEVERLERDLEKKmIETEELKSKqtrfLEEIKNQDKLNKSLKEEAMLqkqscEELKSDLNTKNELLKQK 338
Cdd:TIGR02169  187 ERLDLIIDEKRQQLERLRREREKA-ERYQALLKE----KREYEGYELLKEKEALERQK-----EAIERQLASLEEELEKL 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  339 TIELTRACQKQYELEQELAfyKIDAKFEPLNyyPSEYAEIdKAPDESPYIGKSRYKRNMFATESYIIDSAQAVQIKKMEP 418
Cdd:TIGR02169  257 TEEISELEKRLEEIEQLLE--ELNKKIKDLG--EEEQLRV-KEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEI 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  419 DEQLRNdhmnlrgHTPLDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQL-EEAIQLKKISEAGKDLLY-- 495
Cdd:TIGR02169  332 DKLLAE-------IEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETrDELKDYREKLEKLKREINel 404
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  496 -KQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSkdpKHSHMKAQKSGKEQQLDIMNKQYQQL 574
Cdd:TIGR02169  405 kRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEW---KLEQLAADLSKYEQELYDLKEEYDRV 481
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  575 ESRLDEILSRIAkETEEIKDLEEQLTEGQIAANEALKKDLEGVIsGLQEYLGTIKGQ----------------------- 631
Cdd:TIGR02169  482 EKELSKLQRELA-EAEAQARASEERVRGGRAVEEVLKASIQGVH-GTVAQLGSVGERyataievaagnrlnnvvveddav 559
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  632 ATQAQNECRKLRDEKETLLQrLTEVEQERDQLEIVAMDA------------------------------------ENMRK 675
Cdd:TIGR02169  560 AKEAIELLKRRKAGRATFLP-LNKMRDERRDLSILSEDGvigfavdlvefdpkyepafkyvfgdtlvvedieaarRLMGK 638
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  676 LEQSALQAELEKERQAL----KNALGKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRPEE-- 749
Cdd:TIGR02169  639 YRMVTLEGELFEKSGAMtggsRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRki 718
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  750 ----------------VAARVDELRRKLK-LGTGEMNIHSPSDVLGKSLADLQKQFSEI-LARSKWERDEAQVRERKLQE 811
Cdd:TIGR02169  719 geiekeieqleqeeekLKERLEELEEDLSsLEQEIENVKSELKELEARIEELEEDLHKLeEALNDLEARLSHSRIPEIQA 798
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  812 EMALQQEKLATgQEEFRQACERALEARmNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQ 891
Cdd:TIGR02169  799 ELSKLEEEVSR-IEARLREIEQKLNRL-TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAA 876

                   ....*....
gi 2217375254  892 LRELEKKKK 900
Cdd:TIGR02169  877 LRDLESRLG 885
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
435-602 3.97e-07

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 53.39  E-value: 3.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYKQLSGRLQLVNKLRqEALD 514
Cdd:COG1579     15 LDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNK-EYEA 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAIDSLdskdpkhshmKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKD 594
Cdd:COG1579     94 LQKEIESLKRRISDLEDEILELMERIEEL----------EEELAELEAELAELEAELEEKKAELDEELAELEAELEELEA 163

                   ....*...
gi 2217375254  595 LEEQLTEG 602
Cdd:COG1579    164 EREELAAK 171
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
435-1063 4.04e-07

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 55.43  E-value: 4.04e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKK-----ISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQlkkiseagkdllykqlsgrlqlvnklr 509
Cdd:PRK02224   192 LKAQIEEKEEKdlherLNGLESELAELDEEIERYEEQREQARETRDEADEVLE--------------------------- 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  510 qealdlelqmekqkqEIAGKQKEIKDLQIAIDSLDSKdpkhshmkaqKSGKEQQLDIMNKQYQQLESRLDEILSRIaket 589
Cdd:PRK02224   245 ---------------EHEERREELETLEAEIEDLRET----------IAETEREREELAEEVRDLRERLEELEEER---- 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  590 EEIKDlEEQLTEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMD 669
Cdd:PRK02224   296 DDLLA-EAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEE 374
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  670 AENM---RKLEQSALQAELEKERQALKNA---LGKAQ-FSEEKEQENSELHAKLKHLQDDnnllkqqlkdfqnhlnhvvd 742
Cdd:PRK02224   375 AREAvedRREEIEELEEEIEELRERFGDApvdLGNAEdFLEELREERDELREREAELEAT-------------------- 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  743 gLVRPEEVAARVDELRRKLKLGTGEMNIHSPSDVlgKSLADLQKQFSEILA-RSKWERDEAQVRERKLQEEMALQQEKLA 821
Cdd:PRK02224   435 -LRTARERVEEAEALLEAGKCPECGQPVEGSPHV--ETIEEDRERVEELEAeLEDLEEEVEEVEERLERAEDLVEAEDRI 511
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  822 TGQEEFRQACERALEarmnfdkrQHEARIQQMENEIHYLQENLKSMEEiqgltdlqlqEADEEKERilAQLRELEKKKKL 901
Cdd:PRK02224   512 ERLEERREDLEELIA--------ERRETIEEKRERAEELRERAAELEA----------EAEEKREA--AAEAEEEAEEAR 571
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  902 EDAKSQEQVFGLDKELKKLKKAVATSDklatAELTIAKDQLKSLhgtvmkinQERAEELQEAERFSRKAAQAARDLTRA- 980
Cdd:PRK02224   572 EEVAELNSKLAELKERIESLERIRTLL----AAIADAEDEIERL--------REKREALAELNDERRERLAEKRERKREl 639
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  981 -----EAEIELLQNlLRQKGEQFrlemektgvgtgansqvleIEKLNETMERQRTEIARLQNVLdltgsdnkGGFENVLE 1055
Cdd:PRK02224   640 eaefdEARIEEARE-DKERAEEY-------------------LEQVEEKLDELREERDDLQAEI--------GAVENELE 691

                   ....*...
gi 2217375254 1056 EIAELRRE 1063
Cdd:PRK02224   692 ELEELRER 699
LRR_8 pfam13855
Leucine rich repeat;
126-181 4.62e-07

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 48.67  E-value: 4.62e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  126 KLEVLNLSYNLIGKIEK--LDKLLKLRELNLSYNKISKIEG--IENMCNLQKLNLAGNEI 181
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDgaFKGLSNLKVLDLSNNLLTTLSPgaFSGLPSLRYLDLSGNRL 61
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1258-1671 6.72e-07

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 54.73  E-value: 6.72e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1258 EHHNLENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDD---LLQEKKSLECEVEELHRTVQK-----RQ 1329
Cdd:pfam05483  368 EQQRLEKNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKKILAEdekLLDEKKQFEKIAEELKGKEQElifllQA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1330 QQKDFIDGNVESLMTELEIEKSLKHHEDIVDEiecIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKR 1409
Cdd:pfam05483  448 REKEIHDLEIQLTAIKTSEEHYLKEVEDLKTE---LEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIIN 524
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1410 SLLQTESDAEELErRAQETAVNLVKADQQLRSLQADAKDLEQHKI-KQEEILKEINKIVAAKDSDFQCLSKKKEKLTEEL 1488
Cdd:pfam05483  525 CKKQEERMLKQIE-NLEEKEMNLRDELESVREEFIQKGDEVKCKLdKSEENARSIEYEVLKKEKQMKILENKCNNLKKQI 603
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1489 QKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVL----DRQLGHKKEELHLLQGSMVQAKAD 1564
Cdd:pfam05483  604 ENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIidnyQKEIEDKKISEEKLLEEVEKAKAI 683
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1565 LQEALRLgETEVTEKCNH-IREVKSLLEELSFQKGELnvqISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNIL 1643
Cdd:pfam05483  684 ADEAVKL-QKEIDKRCQHkIAEMVALMEKHKHQYDKI---IEERDSELGLYKNKEQEQSSAKAALEIELSNIKAELLSLK 759
                          410       420
                   ....*....|....*....|....*...
gi 2217375254 1644 DMLQLENHELQGLKLQHDQRVSELEKTQ 1671
Cdd:pfam05483  760 KQLEIEKEEKEKLKMEAKENTAILKDKK 787
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1262-1959 7.43e-07

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 54.68  E-value: 7.43e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1262 LENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKrqqqkdfidgnves 1341
Cdd:PRK03918   160 YENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEK-------------- 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1342 lmteleIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELsctkEKTKNAVEKFTDAKRSLLQTESDAEEL 1421
Cdd:PRK03918   226 ------LEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERI----EELKKEIEELEEKVKELKELKEKAEEY 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1422 ERRAQEtavnLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKivaaKDSDFQCLSKKKEKLTEELQKLQKDIEMAERn 1501
Cdd:PRK03918   296 IKLSEF----YEEYLDELREIEKRLSRLEEEINGIEERIKELEE----KEERLEELKKKLKELEKRLEELEERHELYEE- 366
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1502 edhhlqvlkesevlLQAKRAELEKLKSQVTsqqqemavldrqlGHKKEELHLLQGSMVQAKADLQEALRLGETEVTEKCN 1581
Cdd:PRK03918   367 --------------AKAKKEELERLKKRLT-------------GLTPEKLEKELEELEKAKEEIEEEISKITARIGELKK 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1582 HIREVKSLLEELSFQKGELNVQISErktqltlikqEIEKEEENLqvvlrqMSKHKTELKNILDMLQLENHELQGLKlqhd 1661
Cdd:PRK03918   420 EIKELKKAIEELKKAKGKCPVCGRE----------LTEEHRKEL------LEEYTAELKRIEKELKEIEEKERKLR---- 479
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1662 QRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKcdiwe 1741
Cdd:PRK03918   480 KELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEEL----- 554
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1742 kklaqtkrvlaaaeenskmeQSNLEKLELNVRKLQQELDQLNRDKLSL-HNDISAMQQQLQEKREAVNSLQEelanvqdh 1820
Cdd:PRK03918   555 --------------------KKKLAELEKKLDELEEELAELLKELEELgFESVEELEERLKELEPFYNEYLE-------- 606
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1821 LNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQvlQNEIEENKLKLVQQEMMFQRLQKERES 1900
Cdd:PRK03918   607 LKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYS--EEEYEELREEYLELSRELAGLRAELEE 684
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254 1901 EESKLETSKVTLKEQQHQLE--KELTDQKSKLDQVLSKVLAAEERVRTLQEEERwCESLEK 1959
Cdd:PRK03918   685 LEKRREEIKKTLEKLKEELEerEKAKKELEKLEKALERVEELREKVKKYKALLK-ERALSK 744
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
526-875 7.47e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 54.92  E-value: 7.47e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  526 IAGKQKEIKDLQIAIDSLDskdpkhshmkaqksgkeQQLDIMNKQYQQLESRLDEiLSRIAKETEEIKDLEEqlTEGQIA 605
Cdd:COG4913    612 LAALEAELAELEEELAEAE-----------------ERLEALEAELDALQERREA-LQRLAEYSWDEIDVAS--AEREIA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  606 ANEALKKDLE---GVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEivaMDAENMRKLEQSALQ 682
Cdd:COG4913    672 ELEAELERLDassDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQ---DRLEAAEDLARLELR 748
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  683 AELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDD-NNLLKQQLKDFQNHLNHVVDGLVRPEEVAARVDELRR-- 759
Cdd:COG4913    749 ALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEElERAMRAFNREWPAETADLDADLESLPEYLALLDRLEEdg 828
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  760 ------KLKLGTGEMNIHSPSDVLGK---SLADLQKQFSEI---LARSKWERDEA---QVRERKLQEEmalqqeklatgq 824
Cdd:COG4913    829 lpeyeeRFKELLNENSIEFVADLLSKlrrAIREIKERIDPLndsLKRIPFGPGRYlrlEARPRPDPEV------------ 896
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2217375254  825 EEFRQACERALEARMNFDKRQHEARIQQMENEIHYL--QENLKSMEEIQGLTD 875
Cdd:COG4913    897 REFRQELRAVTSGASLFDEELSEARFAALKRLIERLrsEEEESDRRWRARVLD 949
mukB PRK04863
chromosome partition protein MukB;
441-722 8.65e-07

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 54.58  E-value: 8.65e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  441 DKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQlkkiseagkdlLYKQLSGRLQL--VNKLRQEALDLELQ 518
Cdd:PRK04863   834 DPEAELRQLNRRRVELERALADHESQEQQQRSQLEQAKEGLS-----------ALNRLLPRLNLlaDETLADRVEEIREQ 902
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  519 M---EKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKsgkeQQLDIMNKQYQQLESRLDEILSRIAKETEEikDL 595
Cdd:PRK04863   903 LdeaEEAKRFVQQHGNALAQLEPIVSVLQSDPEQFEQLKQDY----QQAQQTQRDAKQQAFALTEVVQRRAHFSYE--DA 976
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  596 EEQLTEGQiAANEALKKDLEgvisGLQEYLGTIKGQATQAQNECRK-------LRDEKETLLQRLTEVEQERDQLEIVA- 667
Cdd:PRK04863   977 AEMLAKNS-DLNEKLRQRLE----QAEQERTRAREQLRQAQAQLAQynqvlasLKSSYDAKRQMLQELKQELQDLGVPAd 1051
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254  668 MDAENMRKLEQSALQAELEKERQAlKNALGKAQFSEEKEQENseLHAKLKHLQDD 722
Cdd:PRK04863  1052 SGAEERARARRDELHARLSANRSR-RNQLEKQLTFCEAEMDN--LTKKLRKLERD 1103
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1571-2240 9.25e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 54.69  E-value: 9.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1571 LGETEVTEKcnHIREVKSLLEELSFQKGELNvqiSERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLEN 1650
Cdd:TIGR02169  176 LEELEEVEE--NIERLDLIIDEKRQQLERLR---REREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLE 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1651 HELQGLKLQHDQRVSELEktqvavleeklelenlqqisqqqkgEIEwqkQLLERDKREIERMTA-ESRALQSCVECLSKE 1729
Cdd:TIGR02169  251 EELEKLTEEISELEKRLE-------------------------EIE---QLLEELNKKIKDLGEeEQLRVKEKIGELEAE 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1730 KEDLqekcdiwEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNS 1809
Cdd:TIGR02169  303 IASL-------ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEE 375
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1810 LQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEM 1889
Cdd:TIGR02169  376 VDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEW 455
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1890 MFQRLQKERESEESKLETskvtLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQL- 1968
Cdd:TIGR02169  456 KLEQLAADLSKYEQELYD----LKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVHGTVAQLg 531
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1969 --SEREQQLVEKSG----------------ELLALQKEADSMRADFSLL------------------------------- 1999
Cdd:TIGR02169  532 svGERYATAIEVAAgnrlnnvvveddavakEAIELLKRRKAGRATFLPLnkmrderrdlsilsedgvigfavdlvefdpk 611
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2000 -----------------------------------------------------RNQFLTERKKAEKQVASLKEALKIQRS 2026
Cdd:TIGR02169  612 yepafkyvfgdtlvvedieaarrlmgkyrmvtlegelfeksgamtggsraprgGILFSRSEPAELQRLRERLEGLKRELS 691
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2027 QLEKNLLTINEDSERDSSLLSKEQKqENSCIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMAN-------- 2098
Cdd:TIGR02169  692 SLQSELRRIENRLDELSQELSDASR-KIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKElearieel 770
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2099 QKDLERRQMEISDAMRTLKSEVKDEIRTSLKNLNQFLPELPA---DLEAILERNENLEGELESLKENLpftmnegpfEEK 2175
Cdd:TIGR02169  771 EEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEArlrEIEQKLNRLTLEKEYLEKEIQEL---------QEQ 841
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 2176 LNFSQVHIMDEHWRGEALREKLRHREDRLKaqlrhcmSKQAEVlikgkRQTEGTLHSLRRQVDAL 2240
Cdd:TIGR02169  842 RIDLKEQIKSIEKEIENLNGKKEELEEELE-------ELEAAL-----RDLESRLGDLKKERDEL 894
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
435-662 1.01e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 54.25  E-value: 1.01e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAA----QTRLSELHDEIEKAEQQIlratEEFKQLEEAIQLkkisEAGKDLLYKQLSGrlqlvnkLRQ 510
Cdd:COG3206    162 LEQNLELRREEARKAleflEEQLPELRKELEEAEAAL----EEFRQKNGLVDL----SEEAKLLLQQLSE-------LES 226
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  511 EALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLdSKDPKHSHMKAQKSGKEQQLDIMNKQY-------QQLESRLDEILS 583
Cdd:COG3206    227 QLAEARAELAEAEARLAALRAQLGSGPDALPEL-LQSPVIQQLRAQLAELEAELAELSARYtpnhpdvIALRAQIAALRA 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  584 RIAKETEEIKdleeQLTEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQA---QNECRKLRDEKETLLQRLTEVEQER 660
Cdd:COG3206    306 QLQQEAQRIL----ASLEAELEALQAREASLQAQLAQLEARLAELPELEAELrrlEREVEVARELYESLLQRLEEARLAE 381

                   ..
gi 2217375254  661 DQ 662
Cdd:COG3206    382 AL 383
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1365-1576 1.02e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 54.25  E-value: 1.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1365 IEKTLLKRRSELREADRLLAE--AESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSL 1442
Cdd:COG3206    180 LEEQLPELRKELEEAEAALEEfrQKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPEL 259
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1443 QADA--KDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEdhhLQVLKESEVLLQAKR 1520
Cdd:COG3206    260 LQSPviQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILASLEAE---LEALQAREASLQAQL 336
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254 1521 AELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKadLQEALRLGETEV 1576
Cdd:COG3206    337 AQLEARLAELPELEAELRRLEREVEVARELYESLLQRLEEAR--LAEALTVGNVRV 390
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
440-730 1.20e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 54.19  E-value: 1.20e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  440 EDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEaiqlkkiseagkdllykqlsgRLQLVNKL----------- 508
Cdd:COG3096    832 PDPEAELAALRQRRSELERELAQHRAQEQQLRQQLDQLKE---------------------QLQLLNKLlpqanlladet 890
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  509 ---RQEALDLEL-QMEKQKQEIAGKQKEIKDLQIAIDSLDSkDPkhshmkaqksgkeQQLDIMNKQYQQLESRLDEILSR 584
Cdd:COG3096    891 ladRLEELREELdAAQEAQAFIQQHGKALAQLEPLVAVLQS-DP-------------EQFEQLQADYLQAKEQQRRLKQQ 956
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  585 IAKETEEIK--------DLEEQLTEGQiAANEALKKDLEGV---ISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRL 653
Cdd:COG3096    957 IFALSEVVQrrphfsyeDAVGLLGENS-DLNEKLRARLEQAeeaRREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTL 1035
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217375254  654 TEVEQERDQLEI-VAMDAENMRKLEQSALQAELEKERQAlKNALGKAQFSEEKEQENseLHAKLKHLQDDNNLLKQQL 730
Cdd:COG3096   1036 QELEQELEELGVqADAEAEERARIRRDELHEELSQNRSR-RSQLEKQLTRCEAEMDS--LQKRLRKAERDYKQEREQV 1110
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1384-1573 1.57e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 52.91  E-value: 1.57e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1384 AEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKE- 1462
Cdd:COG3883     12 AFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGEr 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1463 -------------INKIVAAKD-SDF-------QCLSKKKEKLTEELQKLQKDIEMAERNedhhlqvLKESEVLLQAKRA 1521
Cdd:COG3883     92 aralyrsggsvsyLDVLLGSESfSDFldrlsalSKIADADADLLEELKADKAELEAKKAE-------LEAKLAELEALKA 164
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217375254 1522 ELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEALRLGE 1573
Cdd:COG3883    165 ELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAA 216
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1660-2161 1.59e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 53.51  E-value: 1.59e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1660 HDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLqekcdi 1739
Cdd:PRK02224   246 HEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREEL------ 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1740 wEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQD 1819
Cdd:PRK02224   320 -EDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRE 398
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1820 HLNLAKQDLLHTTKHQDVLLSEQTRLQKDIsewanrfedcqKEEETKqqqLQVLQNEIEENklklvqqemmfQRLQKE-- 1897
Cdd:PRK02224   399 RFGDAPVDLGNAEDFLEELREERDELRERE-----------AELEAT---LRTARERVEEA-----------EALLEAgk 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1898 -----RESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVlaaeERVRTLQEEERWCESLEKTLSQTKRQLSERE 1972
Cdd:PRK02224   454 cpecgQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERL----ERAEDLVEAEDRIERLEERREDLEELIAERR 529
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1973 QQLVEKSGELLALQKEADSMRADFSLLRNQFLTERKKAEK---QVASL---KEALKIQRSQLEK--NLLTINEDSERDSS 2044
Cdd:PRK02224   530 ETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEareEVAELnskLAELKERIESLERirTLLAAIADAEDEIE 609
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2045 LLS--KEQKQENSCIQKEMatielvAQDNHERARRLMKELNQMQYEYTELKKQMAnqkdlERRQMEISDAMRTLKSEvKD 2122
Cdd:PRK02224   610 RLRekREALAELNDERRER------LAEKRERKRELEAEFDEARIEEAREDKERA-----EEYLEQVEEKLDELREE-RD 677
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 2217375254 2123 E-------IRTSLKNLNQFLPELPAdLEAILERNENLEGELESLKE 2161
Cdd:PRK02224   678 DlqaeigaVENELEELEELRERREA-LENRVEALEALYDEAEELES 722
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1294-1819 1.76e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 53.51  E-value: 1.76e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1294 EKEMEELHHNIDDLLQEKKSLECEVEELH----RTVQKRQQQKDFIDGNVESLMTELEIEKSLKHHEDIVDEIECIEKTL 1369
Cdd:PRK02224   198 EKEEKDLHERLNGLESELAELDEEIERYEeqreQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREEL 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1370 LKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLlqtESDAEELERRAQETAVNLVKADQQLRSLQADAKDL 1449
Cdd:PRK02224   278 AEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREEL---EDRDEELRDRLEECRVAAQAHNEEAESLREDADDL 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1450 EQHKIKQEEILKEINKIVAAKDSDfqcLSKKKEKLTEelqkLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQ 1529
Cdd:PRK02224   355 EERAEELREEAAELESELEEAREA---VEDRREEIEE----LEEEIEELRERFGDAPVDLGNAEDFLEELREERDELRER 427
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1530 VTSQQQEMAVLDRQLgHKKEELhLLQGSMVQAKADLQEALRLGETEvtEKCNHIREVKSLLEELSFQKGELNVQIsERKT 1609
Cdd:PRK02224   428 EAELEATLRTARERV-EEAEAL-LEAGKCPECGQPVEGSPHVETIE--EDRERVEELEAELEDLEEEVEEVEERL-ERAE 502
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1610 QLTLIKQEIEKEEENLQVVLRQMSKHKTELKNilDMLQLEnhELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQ 1689
Cdd:PRK02224   503 DLVEAEDRIERLEERREDLEELIAERRETIEE--KRERAE--ELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELN 578
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1690 QQKGEIEWQKQLLERdkreIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLA---QTKRVLAAAEENSKME--QSN 1764
Cdd:PRK02224   579 SKLAELKERIESLER----IRTLLAAIADAEDEIERLREKREALAELNDERRERLAekrERKRELEAEFDEARIEeaRED 654
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254 1765 LEKLELNVRKLQQELDQLNRDKLSLHNDISAMQ------QQLQEKREAVNSLQEELANVQD 1819
Cdd:PRK02224   655 KERAEEYLEQVEEKLDELREERDDLQAEIGAVEneleelEELRERREALENRVEALEALYD 715
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1262-1862 1.80e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 53.53  E-value: 1.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1262 LENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVES 1341
Cdd:TIGR02169  366 LEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1342 LmtELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELScTKEKTKNAVEK-----------FTDAKRS 1410
Cdd:TIGR02169  446 K--ALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA-EAEAQARASEErvrggraveevLKASIQG 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1411 LLQTESDAEELERRAQ----------------ETAVNLVKADQQLRSLQAD-AKDLEQHKIKQEEILKEINKIVAAKDS- 1472
Cdd:TIGR02169  523 VHGTVAQLGSVGERYAtaievaagnrlnnvvvEDDAVAKEAIELLKRRKAGrATFLPLNKMRDERRDLSILSEDGVIGFa 602
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1473 -DFQCLSKKKEKLTEELQK---LQKDIEMAERNEDHHLQVLKESEVL----------LQAKRAEL--EKLKSQVTSQQQE 1536
Cdd:TIGR02169  603 vDLVEFDPKYEPAFKYVFGdtlVVEDIEAARRLMGKYRMVTLEGELFeksgamtggsRAPRGGILfsRSEPAELQRLRER 682
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1537 MAVLDRQLGHKKEELHLLQGSMVQAKADLQEALRlgetEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQ 1616
Cdd:TIGR02169  683 LEGLKRELSSLQSELRRIENRLDELSQELSDASR----KIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKS 758
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1617 EIEKEEENLQVVLRQMSKHKTELKNILDMLQLE-----NHELQGLKLQH---DQRVSELEKTQVAVLEEKLELENLQQIS 1688
Cdd:TIGR02169  759 ELKELEARIEELEEDLHKLEEALNDLEARLSHSripeiQAELSKLEEEVsriEARLREIEQKLNRLTLEKEYLEKEIQEL 838
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1689 QQQKGEIEWQK----QLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSN 1764
Cdd:TIGR02169  839 QEQRIDLKEQIksieKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKR 918
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1765 LEKLELNVRKLQQELDQLNRDKLSLHNDISAMQ--QQLQEKREAVnslQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQ 1842
Cdd:TIGR02169  919 LSELKAKLEALEEELSEIEDPKGEDEEIPEEELslEDVQAELQRV---EEEIRALEPVNMLAIQEYEEVLKRLDELKEKR 995
                          650       660
                   ....*....|....*....|
gi 2217375254 1843 TRLQKDISEWANRFEDCQKE 1862
Cdd:TIGR02169  996 AKLEEERKAILERIEEYEKK 1015
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
435-609 2.03e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 52.52  E-value: 2.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQL--KKISEAGKDL-------------LYKQ-- 497
Cdd:COG3883     21 KQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKlqAEIAEAEAEIeerreelgeraraLYRSgg 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  498 ----------------LSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKdpkhshMKAQKSGKE 561
Cdd:COG3883    101 svsyldvllgsesfsdFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAE------LEAAKAELE 174
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 2217375254  562 QQLDIMNKQYQQLESRLDEILSRIAKETEEIKDLEEQLTEGQIAANEA 609
Cdd:COG3883    175 AQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAA 222
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
437-999 2.07e-06

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 53.43  E-value: 2.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  437 TQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLY-----KQLSGRLQLVNKLRQE 511
Cdd:TIGR00618  246 TQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQieqqaQRIHTELQSKMRSRAK 325
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  512 ALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKSGKEQQLDIMNKQYQQLESRLDeiLSRIAKETEE 591
Cdd:TIGR00618  326 LLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQ--KLQSLCKELD 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  592 IKDLEEQLTEGQIAANEALKKDL---EGVISGLQEYLgTIKGQATQAQNECRKLRD-EKETLLQRLTEVEQERDQLEIVA 667
Cdd:TIGR00618  404 ILQREQATIDTRTSAFRDLQGQLahaKKQQELQQRYA-ELCAAAITCTAQCEKLEKiHLQESAQSLKEREQQLQTKEQIH 482
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  668 MDAENMRKLEQSALQaELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRP 747
Cdd:TIGR00618  483 LQETRKKAVVLARLL-ELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASL 561
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  748 EEVAARVDElrRKLKLGTGEMNIHSPSDVLGKSLADLQKQFSEILARSKWERDEAQVRERKLQEEMALQQEKLATGQEEF 827
Cdd:TIGR00618  562 KEQMQEIQQ--SFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQ 639
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  828 RQACERALEARMNFDKRQHEARIQQM---ENEIHYLQENLKSMEEIQGLTDlQLQEADEEKERILAQLRELEKKKKLEDA 904
Cdd:TIGR00618  640 ELALKLTALHALQLTLTQERVREHALsirVLPKELLASRQLALQKMQSEKE-QLTYWKEMLAQCQTLLRELETHIEEYDR 718
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  905 KSQEQVFGLDKELKKLKKAVATSDKLATAELTIAKDQLKSLhgtvMKINQERAEELQEAERFSRKAAQAARDLTRAEAEI 984
Cdd:TIGR00618  719 EFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKAR----TEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLR 794
                          570
                   ....*....|....*
gi 2217375254  985 ELLQNLLRQKGEQFR 999
Cdd:TIGR00618  795 EEDTHLLKTLEAEIG 809
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1766-2125 2.32e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 53.42  E-value: 2.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1766 EKLELNVRKLQQELDQLN-RDKLSLHND-ISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTK---HQDVL-- 1838
Cdd:COG3096    279 ERRELSERALELRRELFGaRRQLAEEQYrLVEMARELEELSARESDLEQDYQAASDHLNLVQTALRQQEKierYQEDLee 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1839 LSEQTRLQKDISEWAN--------RFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERE-SEESKLETSK 1909
Cdd:COG3096    359 LTERLEEQEEVVEEAAeqlaeaeaRLEAAEEEVDSLKSQLADYQQALDVQQTRAIQYQQAVQALEKARAlCGLPDLTPEN 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1910 V-----TLKEQQHQLEKELTDQKSKLD-------------QVLSKVL------AAEERVRTLQEEERWCESLEKTLSQTK 1965
Cdd:COG3096    439 AedylaAFRAKEQQATEEVLELEQKLSvadaarrqfekayELVCKIAgeversQAWQTARELLRRYRSQQALAQRLQQLR 518
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1966 RQLSEREQqlveksgeLLALQKEADSMRADFSLLRNQFLTerkkAEKQVASLKEALKIQRSQLEKNLltiNEDSERDSSl 2045
Cdd:COG3096    519 AQLAELEQ--------RLRQQQNAERLLEEFCQRIGQQLD----AAEELEELLAELEAQLEELEEQA---AEAVEQRSE- 582
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2046 LSKEQKQENSCIqKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMANQKDLERRQMEISDAMRTLKSEVKDEIR 2125
Cdd:COG3096    583 LRQQLEQLRARI-KELAARAPAWLAAQDALERLREQSGEALADSQEVTAAMQQLLEREREATVERDELAARKQALESQIE 661
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1277-1990 2.63e-06

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 52.92  E-value: 2.63e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1277 KSKKREERWMRasKRQSEKEMEELHHNIDDLLQE---KKSLECEVEELHRTVQKRQQQkdfidgnVESLMTELEIEKSLK 1353
Cdd:pfam12128  218 LNRQQVEHWIR--DIQAIAGIMKIRPEFTKLQQEfntLESAELRLSHLHFGYKSDETL-------IASRQEERQETSAEL 288
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1354 HHEDIVDEIECIEKTLlKRRSELREADRLLAEAESELSCTKEKT----KNAVEKFTDAKRSLLQTESDAEELERR--AQE 1427
Cdd:pfam12128  289 NQLLRTLDDQWKEKRD-ELNGELSAADAAVAKDRSELEALEDQHgaflDADIETAAADQEQLPSWQSELENLEERlkALT 367
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1428 TAVNLVKADQQLRSLQAD---AKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSkkkeklteelQKLQKDIEMAERNedh 1504
Cdd:pfam12128  368 GKHQDVTAKYNRRRSKIKeqnNRDIAGIKDKLAKIREARDRQLAVAEDDLQALE----------SELREQLEAGKLE--- 434
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1505 hlqvLKESEVLLQAKRAELEKLKSQVTSQQQEMAvldrQLGHKKEELHLLQGSMVQAKA---DLQEALRLGETEVTEKCN 1581
Cdd:pfam12128  435 ----FNEEEYRLKSRLGELKLRLNQATATPELLL----QLENFDERIERAREEQEAANAeveRLQSELRQARKRRDQASE 506
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1582 HIREVKSLLEELSFQKGELNVQISERK-TQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENH-ELQGLKLq 1659
Cdd:pfam12128  507 ALRQASRRLEERQSALDELELQLFPQAgTLLHFLRKEAPDWEQSIGKVISPELLHRTDLDPEVWDGSVGGElNLYGVKL- 585
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1660 HDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQScvecLSKEKEDLQEKCDI 1739
Cdd:pfam12128  586 DLKRIDVPEWAASEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTA----LKNARLDLRRLFDE 661
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1740 WEKKLAQTKRVLAAAEENSKMEQSNLEK-LELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREA-VNSLQEELANV 1817
Cdd:pfam12128  662 KQSEKDKKNKALAERKDSANERLNSLEAqLKQLDKKHQAWLEEQKEQKREARTEKQAYWQVVEGALDAqLALLKAAIAAR 741
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1818 QD----HLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEE-----------ETKQQQLQVLQNEIEENKL 1882
Cdd:pfam12128  742 RSgakaELKALETWYKRDLASLGVDPDVIAKLKREIRTLERKIERIAVRRqevlryfdwyqETWLQRRPRLATQLSNIER 821
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1883 KLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLdqvlskvlaaeERVRTLQEEERwCESLEKTLS 1962
Cdd:pfam12128  822 AISELQQQLARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEM-----------SKLATLKEDAN-SEQAQGSIG 889
                          730       740
                   ....*....|....*....|....*...
gi 2217375254 1963 QTKRQLserEQQLVEKSGELLALQKEAD 1990
Cdd:pfam12128  890 ERLAQL---EDLKLKRDYLSESVKKYVE 914
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
243-763 2.79e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 52.81  E-value: 2.79e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  243 HLRSLESLEGQPVTTQDRQEAFERFSLEEVERLERDLEKKMIETEELKSKQTRFLEEIKNQDK--------LNKSLKEEA 314
Cdd:pfam15921  315 YMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDqlqklladLHKREKELS 394
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  315 MLQKQ-------------SCEELKSDLNTKN-ELLKQKTIELTRACQKQYELEQELAfyKIDAKFEPLNYYPSEYAEIDk 380
Cdd:pfam15921  395 LEKEQnkrlwdrdtgnsiTIDHLRRELDDRNmEVQRLEALLKAMKSECQGQMERQMA--AIQGKNESLEKVSSLTAQLE- 471
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  381 apdespyigksrykrnmfATESYIIDSAQAVQIKKM--EPDEQLRNDhmnlrghtpLDTQLEDKEKKISAAQTRLSELHD 458
Cdd:pfam15921  472 ------------------STKEMLRKVVEELTAKKMtlESSERTVSD---------LTASLQEKERAIEATNAEITKLRS 524
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  459 EIEKAEQQILRATEEFKQLEEA---IQLKKISEAGKD----LLYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQK 531
Cdd:pfam15921  525 RVDLKLQELQHLKNEGDHLRNVqteCEALKLQMAEKDkvieILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRL 604
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  532 EIKDLQIAidsLDSKDPKHSHMKAQKSGKE-QQLDIMNKQYQQLES------RLDEILSRIAKETEEIKDLEEQLtegqi 604
Cdd:pfam15921  605 ELQEFKIL---KDKKDAKIRELEARVSDLElEKVKLVNAGSERLRAvkdikqERDQLLNEVKTSRNELNSLSEDY----- 676
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  605 aanEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEketllqrLTEVEQERDQLEIVAMDAENM---RKLEQSAL 681
Cdd:pfam15921  677 ---EVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNT-------LKSMEGSDGHAMKVAMGMQKQitaKRGQIDAL 746
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  682 QAELEKERQALKNALGKAQFSEEKEQ-----------ENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRPEEV 750
Cdd:pfam15921  747 QSKIQFLEEAMTNANKEKHFLKEEKNklsqelstvatEKNKMAGELEVLRSQERRLKEKVANMEVALDKASLQFAECQDI 826
                          570
                   ....*....|...
gi 2217375254  751 AARVDELRRKLKL 763
Cdd:pfam15921  827 IQRQEQESVRLKL 839
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1553-1930 2.93e-06

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 52.20  E-value: 2.93e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1553 LLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLtlikqeiekeEENLQVVLRQM 1632
Cdd:pfam07888   31 LLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEEL----------RQSREKHEELE 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1633 SKHKtELKNILDMLQLENHELQGLKLQHDQRVSELEktqvavleekleleNLQQISQQQKGEIEWQkqlLERDKREIERM 1712
Cdd:pfam07888  101 EKYK-ELSASSEELSEEKDALLAQRAAHEARIRELE--------------EDIKTLTQRVLERETE---LERMKERAKKA 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1713 TAESRALQScveclskEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKL---SL 1789
Cdd:pfam07888  163 GAQRKEEEA-------ERKQLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAeneAL 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1790 HNDISAMQQQLQEKREAVNSLQEELANVqdhlnLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQ--KEEETKQ 1867
Cdd:pfam07888  236 LEELRSLQERLNASERKVEGLGEELSSM-----AAQRDRTQAELHQARLQAAQLTLQLADASLALREGRARwaQERETLQ 310
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217375254 1868 QQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKL 1930
Cdd:pfam07888  311 QSAEADKDRIEKLSAELQRLEERLQEERMEREKLEVELGREKDCNRVQLSESRRELQELKASL 373
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
566-999 3.00e-06

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 52.46  E-value: 3.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  566 IMNKQYQQLESRLDEILSRIAKETEEIKDLEEqlTEGQIAANEALKKDLEGVISGLQEYLG--TIKGQATQAQNECRKLR 643
Cdd:COG4717     68 LNLKELKELEEELKEAEEKEEEYAELQEELEE--LEEELEELEAELEELREELEKLEKLLQllPLYQELEALEAELAELP 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  644 DEKETLLQRLTEVEQERDQLEIVAMDAENMRKLEQSALQAELEKERQALKNALGKAqfsEEKEQENSELHAKLKHLQDDN 723
Cdd:COG4717    146 ERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEEL---EELQQRLAELEEELEEAQEEL 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  724 NLLKQQLKDFQNHLnhvvdglvRPEEVAARVDELRRKLKLGTGEMNIHSPSDVLGKSLADLQK----------QFSEILA 793
Cdd:COG4717    223 EELEEELEQLENEL--------EAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGvlflvlgllaLLFLLLA 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  794 RSKWERDEAQVRERKLQEEMALQQEKLATGQEEFRqaCERALEARMNFDKRQHEARIQQMENEIHYLQENLK---SMEEI 870
Cdd:COG4717    295 REKASLGKEAEELQALPALEELEEEELEELLAALG--LPPDLSPEELLELLDRIEELQELLREAEELEEELQleeLEQEI 372
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  871 QGLTDLQLQEADEEKERILAQLREL-EKKKKLEDAKSQEQvfglDKELKKLKKAVATSDKLATAELTIAKDQLKSLHGTV 949
Cdd:COG4717    373 AALLAEAGVEDEEELRAALEQAEEYqELKEELEELEEQLE----ELLGELEELLEALDEEELEEELEELEEELEELEEEL 448
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|
gi 2217375254  950 MKINQERAEELQEaerfsRKAAQAARDLTRAEAEIELLQNLLRQKGEQFR 999
Cdd:COG4717    449 EELREELAELEAE-----LEQLEEDGELAELLQELEELKAELRELAEEWA 493
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1457-1966 3.41e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 52.37  E-value: 3.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1457 EEILKEINKIvaakDSDFQCLSKKKEKLTEELQKLQK---DIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQ 1533
Cdd:PRK03918   203 EEVLREINEI----SSELPELREELEKLEKEVKELEElkeEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEEL 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1534 QQEMAVLdRQLGHKKEELHLLQGSMVQAKADLQEaLRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTl 1613
Cdd:PRK03918   279 EEKVKEL-KELKEKAEEYIKLSEFYEEYLDELRE-IEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLE- 355
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1614 IKQEIEKEEENLQVVLRQMSKHKTELKNI-LDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQK 1692
Cdd:PRK03918   356 ELEERHELYEEAKAKKEELERLKKRLTGLtPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAK 435
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1693 GEI---------EWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLqEKCDIWEKKLAQTKRVLAAAEE-NSKMEQ 1762
Cdd:PRK03918   436 GKCpvcgrelteEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELREL-EKVLKKESELIKLKELAEQLKElEEKLKK 514
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1763 SNLEKLELN---VRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREavnsLQEELANVQDHL-NLAKQDLLHTTKHQDVL 1838
Cdd:PRK03918   515 YNLEELEKKaeeYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAE----LEKKLDELEEELaELLKELEELGFESVEEL 590
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1839 LSEQTRLQKDISEWaNRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKE--------RESEESKLETSKV 1910
Cdd:PRK03918   591 EERLKELEPFYNEY-LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKEleelekkySEEEYEELREEYL 669
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254 1911 TLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKR 1966
Cdd:PRK03918   670 ELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEE 725
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
516-708 3.63e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 51.75  E-value: 3.63e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  516 ELQMEKQKQEIAGKQKEIKDLQIAIDSLDskdpkhshmkaqksgkeQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDL 595
Cdd:COG3883     15 DPQIQAKQKELSELQAELEAAQAELDALQ-----------------AELEELNEEYNELQAELEALQAEIDKLQAEIAEA 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  596 EEQLTEGQIAANEALK------------------KDLEGVISGLqEYLGTIKGQAT----QAQNECRKLRDEKETLLQRL 653
Cdd:COG3883     78 EAEIEERREELGERARalyrsggsvsyldvllgsESFSDFLDRL-SALSKIADADAdlleELKADKAELEAKKAELEAKL 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254  654 TEVEQERDQLEIVAMDAENmRKLEQSALQAELEKERQALKNALGKAQFSEEKEQE 708
Cdd:COG3883    157 AELEALKAELEAAKAELEA-QQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEA 210
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
435-601 4.18e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 51.69  E-value: 4.18e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEaiQLKKISEAGKDLLYK------------------ 496
Cdd:COG4942     53 LLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA--ELEAQKEELAELLRAlyrlgrqpplalllsped 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  497 --QLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSL-DSKDPKHSHMKAQKSGKEQQLDIMNKQYQQ 573
Cdd:COG4942    131 flDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALlAELEEERAALEALKAERQKLLARLEKELAE 210
                          170       180
                   ....*....|....*....|....*...
gi 2217375254  574 LESRLDEILSRIAKETEEIKDLEEQLTE 601
Cdd:COG4942    211 LAAELAELQQEAEELEALIARLEAEAAA 238
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1796-2003 6.75e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 51.56  E-value: 6.75e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1796 MQQQLQEKREAVNS----LQEELANVQDHLNLAKQDLLH-TTKHQDVLLSEQTR-LQKDISEWANRFEDCQKEEETKQQQ 1869
Cdd:COG3206    162 LEQNLELRREEARKalefLEEQLPELRKELEEAEAALEEfRQKNGLVDLSEEAKlLLQQLSELESQLAEARAELAEAEAR 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1870 LQVLQNEIEENKLKL--VQQEMMFQRLQKERESEESKLETSKVTLKEQqH----QLEKELTDQKSKLDQVLSKVLA-AEE 1942
Cdd:COG3206    242 LAALRAQLGSGPDALpeLLQSPVIQQLRAQLAELEAELAELSARYTPN-HpdviALRAQIAALRAQLQQEAQRILAsLEA 320
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254 1943 RVRTLQEEErwcESLEKTLSQTKRQLsereQQLVEKSGELLALQKEADSMRADFSLLRNQF 2003
Cdd:COG3206    321 ELEALQARE---ASLQAQLAQLEARL----AELPELEAELRRLEREVEVARELYESLLQRL 374
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1686-1993 7.71e-06

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 51.28  E-value: 7.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1686 QISQQQKGEIEWQKQLlERDKREIERMTAESRALQSCVEclskEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNL 1765
Cdd:pfam17380  276 HIVQHQKAVSERQQQE-KFEKMEQERLRQEKEEKAREVE----RRRKLEEAEKARQAEMDRQAAIYAEQERMAMEREREL 350
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1766 EKLELNVRKlqQELDQLNRDKLSLH----NDISAMQQQLQEKREAVNSLQEELANV-------QDHLNLAKQDLLHTTKH 1834
Cdd:pfam17380  351 ERIRQEERK--RELERIRQEEIAMEisrmRELERLQMERQQKNERVRQELEAARKVkileeerQRKIQQQKVEMEQIRAE 428
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1835 QDVLLSEQTRLQKDisEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEmmfqrlQKERESEESKLETSKVTLKE 1914
Cdd:pfam17380  429 QEEARQREVRRLEE--ERAREMERVRLEEQERQQQVERLRQQEEERKRKKLELE------KEKRDRKRAEEQRRKILEKE 500
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1915 QQHQLEKELTDQKSKldQVLSKVLaaEERVRTLQEEERWCESLEKTLSQT----KRQLSEREQQLVEKSGELLALQKEAD 1990
Cdd:pfam17380  501 LEERKQAMIEEERKR--KLLEKEM--EERQKAIYEEERRREAEEERRKQQemeeRRRIQEQMRKATEERSRLEAMERERE 576

                   ...
gi 2217375254 1991 SMR 1993
Cdd:pfam17380  577 MMR 579
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1261-2051 8.45e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 51.51  E-value: 8.45e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1261 NLENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVE 1340
Cdd:pfam02463  301 ELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKK 380
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1341 SLMTELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEE 1420
Cdd:pfam02463  381 LESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKL 460
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1421 LERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAER 1500
Cdd:pfam02463  461 LKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVEN 540
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1501 NEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELhllqgsmvqakadlqeaLRLGETEVTEKC 1580
Cdd:pfam02463  541 YKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPL-----------------KSIAVLEIDPIL 603
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1581 NHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIekeeenlqVVLRQMSKHKTELKNILDMLQLENHELQGLKLQH 1660
Cdd:pfam02463  604 NLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESA--------KAKESGLRKGVSLEEGLAEKSEVKASLSELTKEL 675
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1661 DQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIW 1740
Cdd:pfam02463  676 LEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKS 755
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1741 EKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELdqlnrdklslhnDISAMQQQLQEKREAVNSLQEELANVQDH 1820
Cdd:pfam02463  756 RLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEE------------KLKAQEEELRALEEELKEEAELLEEEQLL 823
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1821 LNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANR-FEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERE 1899
Cdd:pfam02463  824 IEQEEKIKEEELEELALELKEEQKLEKLAEEELERlEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELE 903
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1900 SEESKletskvtlKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKS 1979
Cdd:pfam02463  904 EESQK--------LNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKV 975
                          730       740       750       760       770       780       790
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217375254 1980 GELLALQKEADSMRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLSKEQK 2051
Cdd:pfam02463  976 NLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLELFVSINKGWNKVFFYLELGGSAE 1047
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1466-2162 8.85e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 51.17  E-value: 8.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1466 IVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLG 1545
Cdd:TIGR04523   27 IANKQDTEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLS 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1546 HKKEELHLLQGSMVQAK---ADLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEE 1622
Cdd:TIGR04523  107 KINSEIKNDKEQKNKLEvelNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQ 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1623 ENLQVVLRQMSKHKTELKNILDMLQLENH---ELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQK 1699
Cdd:TIGR04523  187 KNIDKIKNKLLKLELLLSNLKKKIQKNKSlesQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIK 266
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1700 QLLERDKREIERMTAESRALQSCVECLSKEKEDL-QEKCDIWEKKLaqtKRVLAAAEENSKMEQSNLEKLELNVRKLQQE 1778
Cdd:TIGR04523  267 KQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLnNQKEQDWNKEL---KSELKNQEKKLEEIQNQISQNNKIISQLNEQ 343
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1779 LDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEElanvqdhlnlaKQDLLHTTKHqdvLLSEQTRLQKDISEWANRFED 1858
Cdd:TIGR04523  344 ISQLKKELTNSESENSEKQRELEEKQNEIEKLKKE-----------NQSYKQEIKN---LESQINDLESKIQNQEKLNQQ 409
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1859 CQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEE---SKLETSKVTLKEQQHQLEKELTDQKSKLDQVLS 1935
Cdd:TIGR04523  410 KDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKEliiKNLDNTRESLETQLKVLSRSINKIKQNLEQKQK 489
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1936 KVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQL----VEKSGELLALQKEADSMraDFSLLRNQFLTERKKAE 2011
Cdd:TIGR04523  490 ELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLesekKEKESKISDLEDELNKD--DFELKKENLEKEIDEKN 567
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2012 KQVASLKEALK-IQRSQLEKNLLTINEDSERD---SSLLSKEQKQEN-----SCIQKEMATIELVAQDNHERARRLMKEL 2082
Cdd:TIGR04523  568 KEIEELKQTQKsLKKKQEEKQELIDQKEKEKKdliKEIEEKEKKISSlekelEKAKKENEKLSSIIKNIKSKKNKLKQEV 647
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2083 NQMQYEYTELKKQMAN----QKDLERRQMEISDAMRTLKSEVKDEIRTSLKNLNQFlpelpADLEAILERNENLEGELES 2158
Cdd:TIGR04523  648 KQIKETIKEIRNKWPEiikkIKESKTKIDDIIELMKDWLKELSLHYKKYITRMIRI-----KDLPKLEEKYKEIEKELKK 722

                   ....
gi 2217375254 2159 LKEN 2162
Cdd:TIGR04523  723 LDEF 726
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1764-1967 1.00e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 51.07  E-value: 1.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1764 NLEKLELNVRKLQQELDQLNRdklslhndISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLhtTKHQDVLLSEQT 1843
Cdd:COG4913    236 DLERAHEALEDAREQIELLEP--------IRELAERYAAARERLAELEYLRAALRLWFAQRRLELL--EAELEELRAELA 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1844 RLQKDISEWANRFEDCQKEEETKQQQ--------LQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKVTLKEQ 1915
Cdd:COG4913    306 RLEAELERLEARLDALREELDELEAQirgnggdrLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAAL 385
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217375254 1916 QHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERwceSLEKTLSQTKRQ 1967
Cdd:COG4913    386 RAEAAALLEALEEELEALEEALAEAEAALRDLRRELR---ELEAEIASLERR 434
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1287-1500 1.30e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 49.76  E-value: 1.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1287 RASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVESLMTEL-EIEKSLKHHEDIVDEieci 1365
Cdd:COG4942     33 QQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIaELRAELEAQKEELAE---- 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1366 ektLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQT-ESDAEELERRAQETAVNLVKADQQLRSLQA 1444
Cdd:COG4942    109 ---LLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEElRADLAELAALRAELEAERAELEALLAELEE 185
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254 1445 DAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAER 1500
Cdd:COG4942    186 ERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1262-1532 1.34e-05

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 50.42  E-value: 1.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1262 LENEVSRLEDIMQHLKSKKREERWMR-------ASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDF 1334
Cdd:PRK02224   211 LESELAELDEEIERYEEQREQARETRdeadevlEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEE 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1335 IDGNVESLMTELEIEKSlkHHEDIVDEIEciekTLLKRRSELREAdrlLAEAESELSCTKEKTKNAVEKFTDAKRSLLQT 1414
Cdd:PRK02224   291 LEEERDDLLAEAGLDDA--DAEAVEARRE----ELEDRDEELRDR---LEECRVAAQAHNEEAESLREDADDLEERAEEL 361
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1415 ESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKD 1494
Cdd:PRK02224   362 REEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARER 441
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217375254 1495 IEMAER--------------NEDHHLQVLKESEVLLQAKRAELEKLKSQVTS 1532
Cdd:PRK02224   442 VEEAEAlleagkcpecgqpvEGSPHVETIEEDRERVEELEAELEDLEEEVEE 493
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1360-2157 1.63e-05

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 50.56  E-value: 1.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1360 DEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQL 1439
Cdd:pfam01576   12 EELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQAETELCAEAEEMRARLAARKQELEEILHELESRLEEEEERS 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1440 RSLQADAKDLEQHKIKQEEILKEinkivaaKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHhlqvlkesevlLQAK 1519
Cdd:pfam01576   92 QQLQNEKKKMQQHIQDLEEQLDE-------EEAARQKLQLEKVTTEAKIKKLEEDILLLEDQNSK-----------LSKE 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1520 RAELEKLKSQVTSQQQEMAvldrqlgHKKEELHLLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGE 1599
Cdd:pfam01576  154 RKLLEERISEFTSNLAEEE-------EKAKSLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIAE 226
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1600 LNVQISERKTQLtlikqeiekeeenlqvvlrqmSKHKTELKNILDMLQLEnhelQGLKLQHDQRVSELEktqvavleekl 1679
Cdd:pfam01576  227 LQAQIAELRAQL---------------------AKKEEELQAALARLEEE----TAQKNNALKKIRELE----------- 270
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1680 elenlQQISQQQKGeiewqkqlLERDKREIERMTAESRALQSCVECLSKEKEDL-------QEKCDIWEKKLAQTKRVLa 1752
Cdd:pfam01576  271 -----AQISELQED--------LESERAARNKAEKQRRDLGEELEALKTELEDTldttaaqQELRSKREQEVTELKKAL- 336
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1753 aaEENSKMEQSNLEKLELN----VRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEelanvqdhlnlAKQDL 1828
Cdd:pfam01576  337 --EEETRSHEAQLQEMRQKhtqaLEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQ-----------AKQDS 403
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1829 LHTTKhqdvllseqtRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETS 1908
Cdd:pfam01576  404 EHKRK----------KLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDT 473
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1909 KVTLKEQqhqlekelTDQKSKLDqvlSKVLAAEERVRTLQ----EEERWCESLEKTLSQTKRQLSEREQQLVEKSGELLA 1984
Cdd:pfam01576  474 QELLQEE--------TRQKLNLS---TRLRQLEDERNSLQeqleEEEEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEA 542
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1985 LQKEADSMRADFSLLRNQFlterkkAEKQVASLKEALKIQRSQLEKNLLTINEDSERD-SSLLSKEQKQENSCIQKEMAt 2063
Cdd:pfam01576  543 LEEGKKRLQRELEALTQQL------EEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQlVSNLEKKQKKFDQMLAEEKA- 615
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2064 IELVAQDNHERARRLMKELNQMQYEYT-ELKKQMANQKDLERRQMEISDAMRTLKSEvKDEIRTSLKNLNQFLPELPADL 2142
Cdd:pfam01576  616 ISARYAEERDRAEAEAREKETRALSLArALEEALEAKEELERTNKQLRAEMEDLVSS-KDDVGKNVHELERSKRALEQQV 694
                          810
                   ....*....|....*
gi 2217375254 2143 EAILERNENLEGELE 2157
Cdd:pfam01576  695 EEMKTQLEELEDELQ 709
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
126-167 1.68e-05

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 43.77  E-value: 1.68e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 2217375254  126 KLEVLNLSYNLIGKIEKLDKLLKLRELNLSYN-KISKIEGIEN 167
Cdd:pfam12799    2 NLEVLDLSNNQITDIPPLAKLPNLETLDLSGNnKITDLSDLAN 44
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1413-1610 1.88e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 49.38  E-value: 1.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1413 QTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQ 1492
Cdd:COG4942     24 EAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQK 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1493 KDIE-----MAERNEDHHLQVLKESEVLLQAKRAeLEKLKSQVTSQQQEMAVLDRQLghkkEELHLLQGSMVQAKADLQE 1567
Cdd:COG4942    104 EELAellraLYRLGRQPPLALLLSPEDFLDAVRR-LQYLKYLAPARREQAEELRADL----AELAALRAELEAERAELEA 178
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2217375254 1568 ALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQ 1610
Cdd:COG4942    179 LLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1726-2163 2.78e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 49.63  E-value: 2.78e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1726 LSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKRE 1805
Cdd:TIGR04523   38 LEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKNDKE 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1806 AVNSLQEELANVqdhlnlaKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIE--ENKLK 1883
Cdd:TIGR04523  118 QKNKLEVELNKL-------EKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLniQKNID 190
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1884 LVQQEMMFQRLQ----KERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEE----RWCE 1955
Cdd:TIGR04523  191 KIKNKLLKLELLlsnlKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQnkikKQLS 270
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1956 SLEKTLSQTKRQLSEREQQLVEKSGELLAL--QKEADSMRADFSLLRNQ------FLTERKKAEKQVASLKEA---LKIQ 2024
Cdd:TIGR04523  271 EKQKELEQNNKKIKELEKQLNQLKSEISDLnnQKEQDWNKELKSELKNQekkleeIQNQISQNNKIISQLNEQisqLKKE 350
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2025 RSQLEKNLLTINEDSERDSSLLSKEQKQENSCIQK------EMATIELVAQDNHERARRLMKELNQMQYEYTELKKQman 2098
Cdd:TIGR04523  351 LTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEiknlesQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKE--- 427
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 2099 QKDLERRQMEISDAMRTLKSEvKDEIRTSLKNLNQFLPELPADLEAILERNENLEGELESLKENL 2163
Cdd:TIGR04523  428 IERLKETIIKNNSEIKDLTNQ-DSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKEL 491
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1365-2110 3.00e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 49.58  E-value: 3.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1365 IEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQA 1444
Cdd:TIGR00618  188 KKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRA 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1445 DAKDLEQHKIKQEEILKEINkivaakdsdfqcLSKKKEKLTEElqklQKDIEMAERNEDHHLQVLKESEVLLQAKRAELE 1524
Cdd:TIGR00618  268 RIEELRAQEAVLEETQERIN------------RARKAAPLAAH----IKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRA 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1525 KLKSQVTSQQQEMAVLDRQLghkKEELHLLQGSMVQAKADLQEALRLGETEVTEKCNHIREVksLLEELSFQKGELNVQI 1604
Cdd:TIGR00618  332 AHVKQQSSIEEQRRLLQTLH---SQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTT--LTQKLQSLCKELDILQ 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1605 SERKTQLTLIKQEIEKEeenlQVVLRQMSKHKTELKNILDMLQLENHELQGLKlqhdQRVSELEKTQVAVLEEKLELENL 1684
Cdd:TIGR00618  407 REQATIDTRTSAFRDLQ----GQLAHAKKQQELQQRYAELCAAAITCTAQCEK----LEKIHLQESAQSLKEREQQLQTK 478
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1685 QQISQQqkgeiewqkqllERDKREIERMTAESRALQSCVECLSkEKEDLQEKCDIWEKKlAQTKRVLAAAEENSKMEQSn 1764
Cdd:TIGR00618  479 EQIHLQ------------ETRKKAVVLARLLELQEEPCPLCGS-CIHPNPARQDIDNPG-PLTRRMQRGEQTYAQLETS- 543
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1765 leklelnVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTR 1844
Cdd:TIGR00618  544 -------EEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHA 616
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1845 LQKDISEWANRFEDCQkEEETKQQQLQVLQNEIEENKLKLVQQEMMfQRLQKERESEESKLETSKVTLKEQQHQLEkELT 1924
Cdd:TIGR00618  617 LLRKLQPEQDLQDVRL-HLQQCSQELALKLTALHALQLTLTQERVR-EHALSIRVLPKELLASRQLALQKMQSEKE-QLT 693
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1925 DQKSKLDQVLSKVLAAEERVRTLqeeERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFL 2004
Cdd:TIGR00618  694 YWKEMLAQCQTLLRELETHIEEY---DREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEV 770
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2005 TERKKAEKQVASLKEALKIQRSQLEknlltinEDSERDSSLLSKEQKQENSCIQKEMATIELVAQDN---HERARRLMKE 2081
Cdd:TIGR00618  771 TAALQTGAELSHLAAEIQFFNRLRE-------EDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEeqfLSRLEEKSAT 843
                          730       740
                   ....*....|....*....|....*....
gi 2217375254 2082 LNQMQYEYTELKKQMANQKDLERRQMEIS 2110
Cdd:TIGR00618  844 LGEITHQLLKYEECSKQLAQLTQEQAKII 872
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
270-791 3.03e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 49.25  E-value: 3.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  270 EEVERLERDLEKKMIETEELKSKQTRFLEEIKNQDKLNKSLKEEAMLQKQSCEELKSDLNT--KNELLKQKTIELTracq 347
Cdd:TIGR04523  117 EQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLleKEKLNIQKNIDKI---- 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  348 KQYELEQELAFYKIDAKFEPLNYYPSEYAEIDKapdespyigksrykrnmfaTESYIIDSAQAVQIKKMEPDEQLRNDHM 427
Cdd:TIGR04523  193 KNKLLKLELLLSNLKKKIQKNKSLESQISELKK-------------------QNNQLKDNIEKKQQEINEKTTEISNTQT 253
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  428 NLRG----HTPLDTQLEDKEKKISAAQTRLSELHDEIEKAEQQIL-----RATEEFKQLEEAI--QLKKISEAGKDLLY- 495
Cdd:TIGR04523  254 QLNQlkdeQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISdlnnqKEQDWNKELKSELknQEKKLEEIQNQISQn 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  496 -KQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSL-----------DSKDPKHSHMKAQKSGKEQQ 563
Cdd:TIGR04523  334 nKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYkqeiknlesqiNDLESKIQNQEKLNQQKDEQ 413
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  564 LDIMNKQYQQLESRLDEILSRIAKETEEIKDLEEQLTEGQIAANEALK---------KDLEGVISGLQEYLGTIKGQATQ 634
Cdd:TIGR04523  414 IKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNtresletqlKVLSRSINKIKQNLEQKQKELKS 493
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  635 AQNECRKLRDEKETLLQRLTEVEQERDQL--EIVAMDAENMRK-LEQSALQAELEKERQALKNALGKAQFSeEKEQENSE 711
Cdd:TIGR04523  494 KEKELKKLNEEKKELEEKVKDLTKKISSLkeKIEKLESEKKEKeSKISDLEDELNKDDFELKKENLEKEID-EKNKEIEE 572
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  712 LHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLvrpEEVAARVDELRRKL--------KLGTGEMNIHSPSDVLGKSLAD 783
Cdd:TIGR04523  573 LKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEI---EEKEKKISSLEKELekakkeneKLSSIIKNIKSKKNKLKQEVKQ 649

                   ....*...
gi 2217375254  784 LQKQFSEI 791
Cdd:TIGR04523  650 IKETIKEI 657
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1805-2010 3.04e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.61  E-value: 3.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1805 EAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKL 1884
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1885 VQQE-----------------------------------MMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSK 1929
Cdd:COG4942    100 EAQKeelaellralyrlgrqpplalllspedfldavrrlQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEAL 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1930 LDQvlskvlaaeervrtlQEEERwcESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFLTERKK 2009
Cdd:COG4942    180 LAE---------------LEEER--AALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242

                   .
gi 2217375254 2010 A 2010
Cdd:COG4942    243 T 243
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
148-179 3.09e-05

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 43.00  E-value: 3.09e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2217375254  148 KLRELNLSYNKISKIEGIENMCNLQKLNLAGN 179
Cdd:pfam12799    2 NLEVLDLSNNQITDIPPLAKLPNLETLDLSGN 33
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1360-1570 3.81e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.61  E-value: 3.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1360 DEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQL 1439
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1440 RSLQADAKDL--EQHKIKQEEILKEINKIVAAKDS--DFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVL 1515
Cdd:COG4942    100 EAQKEELAELlrALYRLGRQPPLALLLSPEDFLDAvrRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEAL 179
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 1516 LQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQgsmvQAKADLQEALR 1570
Cdd:COG4942    180 LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQ----QEAEELEALIA 230
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1479-2065 4.61e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.91  E-value: 4.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1479 KKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVtsqqQEMAVLDRQLGHKKEELHLLQGSM 1558
Cdd:PRK03918   179 ERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV----KELEELKEEIEELEKELESLEGSK 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1559 VQAKADLQEalrlgetevTEKcnHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIekeeeNLQVVLRQMSKHKTE 1638
Cdd:PRK03918   255 RKLEEKIRE---------LEE--RIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE-----EYLDELREIEKRLSR 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1639 LKNILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMtaesra 1718
Cdd:PRK03918   319 LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKE------ 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1719 lqscVECLSKEKEDLQEKcdiwEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKL--SLHNDISAM 1796
Cdd:PRK03918   393 ----LEELEKAKEEIEEE----ISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELleEYTAELKRI 464
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1797 QQQLQEKREAVNSLQEELANVQDHLNLAKQdLLHTTKHQDVLLSEQTRLQK----DISEWANRFEDCQKEEETKQQQLQV 1872
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESE-LIKLKELAEQLKELEEKLKKynleELEKKAEEYEKLKEKLIKLKGEIKS 543
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1873 LQNEIEENKLklvqqemmfqrLQKERESEESKLETSKVTLKEQQHQLE-------KELTDQKSKLDQVLSKVLAAEERVR 1945
Cdd:PRK03918   544 LKKELEKLEE-----------LKKKLAELEKKLDELEEELAELLKELEelgfesvEELEERLKELEPFYNEYLELKDAEK 612
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1946 TLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADsmRADFSLLRNQFLTERKKAEKQVASLKEaLKIQR 2025
Cdd:PRK03918   613 ELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYS--EEEYEELREEYLELSRELAGLRAELEE-LEKRR 689
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|
gi 2217375254 2026 SQLEKNLLTINEDSERdssllSKEQKQENSCIQKEMATIE 2065
Cdd:PRK03918   690 EEIKKTLEKLKEELEE-----REKAKKELEKLEKALERVE 724
PRK11281 PRK11281
mechanosensitive channel MscK;
1698-1952 5.61e-05

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 48.75  E-value: 5.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1698 QKQLLERDKREIErmTAESRALQSCVEclskEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLE-------- 1769
Cdd:PRK11281    42 QAQLDALNKQKLL--EAEDKLVQQDLE----QTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKddndeetr 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1770 -----LNVRKLQQELDQLNRDKLSLHND---------------------ISAMQQQLQEKREAVNSLQEElanvQDHLNL 1823
Cdd:PRK11281   116 etlstLSLRQLESRLAQTLDQLQNAQNDlaeynsqlvslqtqperaqaaLYANSQRLQQIRNLLKGGKVG----GKALRP 191
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1824 AKQDLLHTtkhQDVLLSEQTRLQKDISEWANRFEDC---QKEEETK-----QQQLQVLQNEIEENKLKLVQQEMMfqrlQ 1895
Cdd:PRK11281   192 SQRVLLQA---EQALLNAQNDLQRKSLEGNTQLQDLlqkQRDYLTAriqrlEHQLQLLQEAINSKRLTLSEKTVQ----E 264
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2217375254 1896 KERESEESKLETSKVTLKEQQ--HQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEER 1952
Cdd:PRK11281   265 AQSQDEAARIQANPLVAQELEinLQLSQRLLKATEKLNTLTQQNLRVKNWLDRLTQSER 323
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
438-693 5.67e-05

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 48.53  E-value: 5.67e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  438 QLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAI---------QLKKISEAGKDLLYKQLSGRLQLVNKL 508
Cdd:TIGR02169  738 RLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALndlearlshSRIPEIQAELSKLEEEVSRIEARLREI 817
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  509 RQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDpkhSHMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKE 588
Cdd:TIGR02169  818 EQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI---ENLNGKKEELEEELEELEAALRDLESRLGDLKKERDEL 894
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  589 TEEIKDLEEQLTEGQIAANEALK--KDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEkETLLQRLTEVEQERDQLEIV 666
Cdd:TIGR02169  895 EAQLRELERKIEELEAQIEKKRKrlSELKAKLEALEEELSEIEDPKGEDEEIPEEELSL-EDVQAELQRVEEEIRALEPV 973
                          250       260       270
                   ....*....|....*....|....*....|...
gi 2217375254  667 AMDAEN------MRKLEQSALQAELEKERQALK 693
Cdd:TIGR02169  974 NMLAIQeyeevlKRLDELKEKRAKLEEERKAIL 1006
PRK01156 PRK01156
chromosome segregation protein; Provisional
1262-1814 8.28e-05

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 47.97  E-value: 8.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1262 LENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVES 1341
Cdd:PRK01156   164 LERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNE 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1342 LMTELEIEKslKHHEDIvdeiecieKTLLKRRSELREADRLLAEAESELsctKEKTKNAVEKFTDAKRSLLQTESDAEEL 1421
Cdd:PRK01156   244 LSSLEDMKN--RYESEI--------KTAESDLSMELEKNNYYKELEERH---MKIINDPVYKNRNYINDYFKYKNDIENK 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1422 ERRAQETAVNLVKAD---QQLRSLQADAKDLEQHKIKQEEILKEINKIvAAKDSDFQCLSKKKEKLTEELQKLQKDIEMA 1498
Cdd:PRK01156   311 KQILSNIDAEINKYHaiiKKLSVLQKDYNDYIKKKSRYDDLNNQILEL-EGYEMDYNSYLKSIESLKKKIEEYSKNIERM 389
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1499 ERNEDHHLQVLKESEVLLQAKRAE----LEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKAdlqeALRLGET 1574
Cdd:PRK01156   390 SAFISEILKIQEIDPDAIKKELNEinvkLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVCPVC----GTTLGEE 465
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1575 EVTEKCNHIREVKSLLEE-LSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDML-QLENHE 1652
Cdd:PRK01156   466 KSNHIINHYNEKKSRLEEkIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKInELKDKH 545
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1653 LQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQIS----QQQKGEIEWQKQLLERDKREIE---------------RMT 1713
Cdd:PRK01156   546 DKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDietnRSRSNEIKKQLNDLESRLQEIEigfpddksyidksirEIE 625
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1714 AESRALQSCVECLSKEK---EDLQEKCDIWEKKLAQTKRVlaaaEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLH 1790
Cdd:PRK01156   626 NEANNLNNKYNEIQENKiliEKLRGKIDNYKKQIAEIDSI----IPDLKEITSRINDIEDNLKKSRKALDDAKANRARLE 701
                          570       580
                   ....*....|....*....|....
gi 2217375254 1791 NDISAMQQQLQEKREAVNSLQEEL 1814
Cdd:PRK01156   702 STIEILRTRINELSDRINDINETL 725
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
435-900 8.74e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 47.71  E-value: 8.74e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYKQLsgrlqlvNKLRQEALD 514
Cdd:TIGR04523  157 LNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQI-------SELKKQNNQ 229
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKdpkHSHMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIA-----KET 589
Cdd:TIGR04523  230 LKDNIEKKQQEINEKTTEISNTQTQLNQLKDE---QNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISdlnnqKEQ 306
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  590 EEIKDLEEQL---------TEGQIAANEALKKDLEGVISGLQEYLG-------TIKGQATQAQNECRKLRDEKETLLQRL 653
Cdd:TIGR04523  307 DWNKELKSELknqekkleeIQNQISQNNKIISQLNEQISQLKKELTnsesensEKQRELEEKQNEIEKLKKENQSYKQEI 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  654 TEVEQERDQLEIVAMDAENMRKLEQSALQA------ELEKERQALKNALGKAQFS-EEKEQENSELHAKLKHLQDDNNLL 726
Cdd:TIGR04523  387 KNLESQINDLESKIQNQEKLNQQKDEQIKKlqqekeLLEKEIERLKETIIKNNSEiKDLTNQDSVKELIIKNLDNTRESL 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  727 KQQLKDFQNHLNHVVDGLvrpeevaarvDELRRKLKLGTGEMN-IHSPSDVLGKSLADLQKQFSEILARSKWERDEAQVR 805
Cdd:TIGR04523  467 ETQLKVLSRSINKIKQNL----------EQKQKELKSKEKELKkLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEK 536
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  806 ERKLQE--------EMALQQEKLATGQEEFRQACERALEARMNFDKRQHEA--RIQQMENEIHYLQENLKSMEEIQGLTD 875
Cdd:TIGR04523  537 ESKISDledelnkdDFELKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKqeLIDQKEKEKKDLIKEIEEKEKKISSLE 616
                          490       500
                   ....*....|....*....|....*
gi 2217375254  876 LQLQEADEEKERILAQLRELEKKKK 900
Cdd:TIGR04523  617 KELEKAKKENEKLSSIIKNIKSKKN 641
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1860-2030 1.19e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 47.32  E-value: 1.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1860 QKEEETkQQQLQVLQNEIEENKLKLVQQEMMFQRLQKER-----ESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVL 1934
Cdd:COG3206    168 LRREEA-RKALEFLEEQLPELRKELEEAEAALEEFRQKNglvdlSEEAKLLLQQLSELESQLAEARAELAEAEARLAALR 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1935 SKVLAAEERVRTLQEEERWcESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFL----TERKKA 2010
Cdd:COG3206    247 AQLGSGPDALPELLQSPVI-QQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILasleAELEAL 325
                          170       180
                   ....*....|....*....|
gi 2217375254 2011 EKQVASLKEALKIQRSQLEK 2030
Cdd:COG3206    326 QAREASLQAQLAQLEARLAE 345
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1659-1879 1.20e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 46.68  E-value: 1.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1659 QHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCD 1738
Cdd:COG4942     21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELE 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1739 IWEKKLAQTKRVLAAAEENSKME----QSNLEKLELNVRKLQqeldQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEEL 1814
Cdd:COG4942    101 AQKEELAELLRALYRLGRQPPLAlllsPEDFLDAVRRLQYLK----YLAPARREQAEELRADLAELAALRAELEAERAEL 176
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 1815 ANVQDHLNLAKQDLLHTTKHQDVLLseqTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEE 1879
Cdd:COG4942    177 EALLAELEEERAALEALKAERQKLL---ARLEKELAELAAELAELQQEAEELEALIARLEAEAAA 238
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
482-720 1.22e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 46.68  E-value: 1.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  482 QLKKISEAGKDL--LYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDskdpkhshmkAQKSG 559
Cdd:COG4942     18 QADAAAEAEAELeqLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALE----------AELAE 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  560 KEQQLDIMNKQYQQLESRLDEILSRIAKeTEEIKDLEEQLTEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNEC 639
Cdd:COG4942     88 LEKEIAELRAELEAQKEELAELLRALYR-LGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALR 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  640 RKLRDEKETLLQRLTEVEQERDQLEivamdaenMRKLEQSALQAELEKERQALKNALgkaqfsEEKEQENSELHAKLKHL 719
Cdd:COG4942    167 AELEAERAELEALLAELEEERAALE--------ALKAERQKLLARLEKELAELAAEL------AELQQEAEELEALIARL 232

                   .
gi 2217375254  720 Q 720
Cdd:COG4942    233 E 233
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1624-2163 1.27e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.37  E-value: 1.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1624 NLQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEeklelenlqqiSQQQKGEIEWQKQLLE 1703
Cdd:PRK03918   190 NIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEE-----------LEKELESLEGSKRKLE 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1704 RDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKkLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLN 1783
Cdd:PRK03918   259 EKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKE 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1784 RDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEE 1863
Cdd:PRK03918   338 ERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1864 ETKQQQLQVLQNEIEENKLKL-----VQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDqkskldqvLSKVL 1938
Cdd:PRK03918   418 KKEIKELKKAIEELKKAKGKCpvcgrELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRE--------LEKVL 489
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1939 AAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEK-SGELLALQKEADSMRADFSLLrNQFLTERKKAEKQVASL 2017
Cdd:PRK03918   490 KKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKlKEKLIKLKGEIKSLKKELEKL-EELKKKLAELEKKLDEL 568
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2018 KEALK-IQRSQLEKNLLTINEDSERDSSLLSKEQKQensciqkematIELVAQDnheraRRLMKELNQMQYEYTELKKQM 2096
Cdd:PRK03918   569 EEELAeLLKELEELGFESVEELEERLKELEPFYNEY-----------LELKDAE-----KELEREEKELKKLEEELDKAF 632
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217375254 2097 ANQKDLERRQMEISDAMRTLKSEVKDEIRTSLKNLNQFLPELPADLEAILERNENLEGELESLKENL 2163
Cdd:PRK03918   633 EELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKL 699
PRK11281 PRK11281
mechanosensitive channel MscK;
1755-2131 1.51e-04

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 47.21  E-value: 1.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1755 EENSKMEQSNLEK-LEL--NVRKLQQELDQLNRDklslhndISAMQQQLQEKREAVNSLQEELANV--QDHLNLAKQDLl 1829
Cdd:PRK11281    55 EAEDKLVQQDLEQtLALldKIDRQKEETEQLKQQ-------LAQAPAKLRQAQAELEALKDDNDEEtrETLSTLSLRQL- 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1830 htTKHQDVLLSEQTRLQKDISEwANrfedcqKEEETKQQQLQVLQNEIEENKlklvqqemmfQRLQKERESEESKLETSK 1909
Cdd:PRK11281   127 --ESRLAQTLDQLQNAQNDLAE-YN------SQLVSLQTQPERAQAALYANS----------QRLQQIRNLLKGGKVGGK 187
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1910 VTLKEQQHQLEKELT--DQKSKLDQvlsKVLAAEErvrTLQEeerwceslektLSQTKRQLSEREQQLVEKsgELLALQk 1987
Cdd:PRK11281   188 ALRPSQRVLLQAEQAllNAQNDLQR---KSLEGNT---QLQD-----------LLQKQRDYLTARIQRLEH--QLQLLQ- 247
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1988 eadsmradfSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINED-SERdsslLSKEQKQENSCIQKEMATIEL 2066
Cdd:PRK11281   248 ---------EAINSKRLTLSEKTVQEAQSQDEAARIQANPLVAQELEINLQlSQR----LLKATEKLNTLTQQNLRVKNW 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2067 VaqDNHERARRLMKE--------------LNQMQYEYTELK--KQMANQ-KDLERRQMEIS--------------DAMRT 2115
Cdd:PRK11281   315 L--DRLTQSERNIKEqisvlkgslllsriLYQQQQALPSADliEGLADRiADLRLEQFEINqqrdalfqpdayidKLEAG 392
                          410
                   ....*....|....*.
gi 2217375254 2116 LKSEVKDEIRTSLKNL 2131
Cdd:PRK11281   393 HKSEVTDEVRDALLQL 408
LRR_8 pfam13855
Leucine rich repeat;
170-220 1.65e-04

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 41.36  E-value: 1.65e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2217375254  170 NLQKLNLAGNEIEHIPVWLGKKLKSLRVLNLKGNKISSL--QDISKLKPLQDL 220
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLspGAFSGLPSLRYL 54
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
611-849 1.73e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 46.83  E-value: 1.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  611 KKDLEGVISGLQEYLGTIKGqatqAQNECRKLRdEKETLLQRLTEVEQERDQLEIVAMDAENMR--------KLEQSALQ 682
Cdd:COG4913    220 EPDTFEAADALVEHFDDLER----AHEALEDAR-EQIELLEPIRELAERYAAARERLAELEYLRaalrlwfaQRRLELLE 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  683 AELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNLlkQQLKDFQNHLNHVVDGLVRPEEVAARVDELRRKLK 762
Cdd:COG4913    295 AELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGG--DRLEQLEREIERLERELEERERRRARLEALLAALG 372
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  763 LgtgemnihspsdvlgkSLADLQKQFSEILARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFRQACE--RALEAR-M 839
Cdd:COG4913    373 L----------------PLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAeiASLERRkS 436
                          250
                   ....*....|
gi 2217375254  840 NFDKRQHEAR 849
Cdd:COG4913    437 NIPARLLALR 446
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1467-1716 1.77e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 46.30  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1467 VAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGH 1546
Cdd:COG4942     15 AAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAE 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1547 KKEELHLLQGSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTlikqeiekeeeNLQ 1626
Cdd:COG4942     95 LRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLA-----------ELA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1627 VVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQvavleekleLENLQQISQQQKGEIEWQKQL--LER 1704
Cdd:COG4942    164 ALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKEL---------AELAAELAELQQEAEELEALIarLEA 234
                          250
                   ....*....|..
gi 2217375254 1705 DKREIERMTAES 1716
Cdd:COG4942    235 EAAAAAERTPAA 246
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
261-1040 1.88e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 47.04  E-value: 1.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  261 QEAFERFsLEEVERLERDLEKKMIETEELKSKQTRFLEE--IKNQDKLNK-SLKEEAML-----QKQSCEELKSDL-NTK 331
Cdd:pfam15921   73 KEHIERV-LEEYSHQVKDLQRRLNESNELHEKQKFYLRQsvIDLQTKLQEmQMERDAMAdirrrESQSQEDLRNQLqNTV 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  332 NELLKQKTIELTRACQKQYELEQelaFYKIDAKFEP-LNYYPSEYAEIDKAPdespyiGKSRYKRNMFATESYiidSAQA 410
Cdd:pfam15921  152 HELEAAKCLKEDMLEDSNTQIEQ---LRKMMLSHEGvLQEIRSILVDFEEAS------GKKIYEHDSMSTMHF---RSLG 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  411 VQIKKMEpdEQLRNDHMNLRGHT-PLDTQLED-KEKKISAAQTRLSELHDEIEKA----EQQILRATEEFKQLEEAI--- 481
Cdd:pfam15921  220 SAISKIL--RELDTEISYLKGRIfPVEDQLEAlKSESQNKIELLLQQHQDRIEQLisehEVEITGLTEKASSARSQAnsi 297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  482 --QLKKISEAGKD---LLYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAgkqkeiKDLQIAIDSLDSKDPKHSHMkAQ 556
Cdd:pfam15921  298 qsQLEIIQEQARNqnsMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELE------KQLVLANSELTEARTERDQF-SQ 370
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  557 KSGKeqqldiMNKQYQQLESRLDEILSRIAKETEEIKDLEEQLTEGQIAANEaLKKDLEGVISGLQEYLGTIKGQATQAQ 636
Cdd:pfam15921  371 ESGN------LDDQLQKLLADLHKREKELSLEKEQNKRLWDRDTGNSITIDH-LRRELDDRNMEVQRLEALLKAMKSECQ 443
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  637 NECRKLRDEKETLLQRLTEVEQERDQLEIVAmdaENMRKL--EQSALQAELEKERQALKNALGKAQfseEKEQENSELHA 714
Cdd:pfam15921  444 GQMERQMAAIQGKNESLEKVSSLTAQLESTK---EMLRKVveELTAKKMTLESSERTVSDLTASLQ---EKERAIEATNA 517
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  715 KLKHLQDDNNLLKQQLKDFQNHLNHVvdglvrpEEVAARVDELRRKLklgtgemnihSPSDVLGKSLADLQKQFSEILAR 794
Cdd:pfam15921  518 EITKLRSRVDLKLQELQHLKNEGDHL-------RNVQTECEALKLQM----------AEKDKVIEILRQQIENMTQLVGQ 580
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  795 SKWERDEAQVRERKLQEEMALQQEKLatgqEEFRqacerALEARMNFDKRQHEARIQQMENE----IHYLQENLKSMEEI 870
Cdd:pfam15921  581 HGRTAGAMQVEKAQLEKEINDRRLEL----QEFK-----ILKDKKDAKIRELEARVSDLELEkvklVNAGSERLRAVKDI 651
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  871 QGLTDLQLQEADEEKERILAQLRELEKKKKLEDAKSQEQvfgldkelKKLKKAVATSDKLATAELTIAKDQLKSLHGT-- 948
Cdd:pfam15921  652 KQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEM--------ETTTNKLKMQLKSAQSELEQTRNTLKSMEGSdg 723
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  949 -----VMKINQERAEELQEAERFSRKAAQAARDLTRAEAEIELLqnllrqKGEQFRLEMEKTGVGTGANSQVLEIEKLNE 1023
Cdd:pfam15921  724 hamkvAMGMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKHFL------KEEKNKLSQELSTVATEKNKMAGELEVLRS 797
                          810
                   ....*....|....*..
gi 2217375254 1024 TMERQRTEIARLQNVLD 1040
Cdd:pfam15921  798 QERRLKEKVANMEVALD 814
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
76-205 1.93e-04

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 46.32  E-value: 1.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   76 SHAGVRYITEALikkltkQDNLALiKSLNLSLSKDGGKKFKYI-ENLEKCVKLEVLNLSYNLIGK------IEKLDKLLK 148
Cdd:COG5238    277 GAEGAIALAKAL------QGNTTL-TSLDLSVNRIGDEGAIALaEGLQGNKTLHTLNLAYNGIGAqgaialAKALQENTT 349
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254  149 LRELNLSYNKISK------IEGIENMCNLQKLNLAGNEI-EHIPVWLGKKLK--SLRVLNLKGNKI 205
Cdd:COG5238    350 LHSLDLSDNQIGDegaialAKYLEGNTTLRELNLGKNNIgKQGAEALIDALQtnRLHTLILDGNLI 415
HMMR_N pfam15905
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ...
1703-1933 2.16e-04

Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.


Pssm-ID: 464932 [Multi-domain]  Cd Length: 329  Bit Score: 45.96  E-value: 2.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1703 ERDKReIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRV--LAAAEENSKMEQSNLEKLELNVRKLQQELD 1780
Cdd:pfam15905   91 EQDKR-LQALEEELEKVEAKLNAAVREKTSLSASVASLEKQLLELTRVneLLKAKFSEDGTQKKMSSLSMELMKLRNKLE 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1781 QLNRDKLSLHNDisaMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQ 1860
Cdd:pfam15905  170 AKMKEVMAKQEG---MEGKLQVTQKNLEHSKGKVAQLEEKLVSTEKEKIEEKSETEKLLEYITELSCVSEQVEKYKLDIA 246
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254 1861 KEEET---KQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKvtlKEQQHQLEKELTDQKSKLDQV 1933
Cdd:pfam15905  247 QLEELlkeKNDEIESLKQSLEEKEQELSKQIKDLNEKCKLLESEKEELLREY---EEKEQTLNAELEELKEKLTLE 319
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1765-1923 2.39e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 44.92  E-value: 2.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1765 LEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLS--EQ 1842
Cdd:COG1579     12 LQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNnkEY 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1843 TRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEKE 1922
Cdd:COG1579     92 EALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAAK 171

                   .
gi 2217375254 1923 L 1923
Cdd:COG1579    172 I 172
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1290-1542 2.60e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 46.16  E-value: 2.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1290 KRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDGNVESlmteleiekslkhhEDIVDEIECIEKTL 1369
Cdd:COG3206    163 EQNLELRREEARKALEFLEEQLPELRKELEEAEAALEEFRQKNGLVDLSEEA--------------KLLLQQLSELESQL 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1370 LKRRSELREADRLLAEAESELSCTKEKTKNAVEK--FTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQAdak 1447
Cdd:COG3206    229 AEARAELAEAEARLAALRAQLGSGPDALPELLQSpvIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRA--- 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1448 dleqhkikqeEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEV---LLQAKRAELE 1524
Cdd:COG3206    306 ----------QLQQEAQRILASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREVEVareLYESLLQRLE 375
                          250
                   ....*....|....*...
gi 2217375254 1525 KLKSQVTSQQQEMAVLDR 1542
Cdd:COG3206    376 EARLAEALTVGNVRVIDP 393
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1698-2121 2.65e-04

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 46.35  E-value: 2.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1698 QKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEK-KLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQ 1776
Cdd:pfam10174  174 KKSGEEDWERTRRIAEAEMQLGHLEVLLDQKEKENIHLREELHRRnQLQPDPAKTKALQTVIEMKDTKISSLERNIRDLE 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1777 QELDQLNrdklslHNDISAMQQQLQEKR--EAVNSLQEELANVQDHLnlaKQDLlhtTKHQDVLLSEQTRLQKdiseWAN 1854
Cdd:pfam10174  254 DEVQMLK------TNGLLHTEDREEEIKqmEVYKSHSKFMKNKIDQL---KQEL---SKKESELLALQTKLET----LTN 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1855 RFEDCQKEEE-------TKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESeeskletskvtLKEQQHQLEKELTDQK 1927
Cdd:pfam10174  318 QNSDCKQHIEvlkesltAKEQRAAILQTEVDALRLRLEEKESFLNKKTKQLQD-----------LTEEKSTLAGEIRDLK 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1928 SKLDQVLSKVLAAEERVRTLQEE-----------ERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSmradf 1996
Cdd:pfam10174  387 DMLDVKERKINVLQKKIENLQEQlrdkdkqlaglKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEQRER----- 461
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1997 slLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLSKEQKQENSCIQKEMATielvaQDNHERAR 2076
Cdd:pfam10174  462 --EDRERLEELESLKKENKDLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAV-----EQKKEECS 534
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*
gi 2217375254 2077 RLMKELNQMQyeytelkkqmaNQKDLERRQMEISDAMRTLKSEVK 2121
Cdd:pfam10174  535 KLENQLKKAH-----------NAEEAVRTNPEINDRIRLLEQEVA 568
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
270-597 2.65e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.60  E-value: 2.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  270 EEVERLERDLEKKMIETEELK----------SKQTRFLEEIKNQDKLnksLKEEAMLQKQSCEELKSDlntknelLKQKT 339
Cdd:TIGR02169  681 ERLEGLKRELSSLQSELRRIEnrldelsqelSDASRKIGEIEKEIEQ---LEQEEEKLKERLEELEED-------LSSLE 750
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  340 IELTRACQKQYELEQElafykIDAKFEPLNYYPSEYAEIDKAPDESPYIGKSRYKRNMFATESYIIDSAQAVQ--IKKME 417
Cdd:TIGR02169  751 QEIENVKSELKELEAR-----IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEqkLNRLT 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  418 PDEQLRNDHMN--LRGHTPLDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEeaiqlKKISEAGKDLly 495
Cdd:TIGR02169  826 LEKEYLEKEIQelQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLK-----KERDELEAQL-- 898
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  496 KQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKSGKEQQLDIMN------- 568
Cdd:TIGR02169  899 RELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEpvnmlai 978
                          330       340
                   ....*....|....*....|....*....
gi 2217375254  569 KQYQQLESRLDEILSRIAKETEEIKDLEE 597
Cdd:TIGR02169  979 QEYEEVLKRLDELKEKRAKLEEERKAILE 1007
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1848-2044 2.67e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 45.59  E-value: 2.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1848 DISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQK 1927
Cdd:COG3883     17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALY 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1928 ------SKLDQVL---------------SKVLAAEERVRTLQEEERwcESLEKTLSQTKRQLSEREQQLveksGELLALQ 1986
Cdd:COG3883     97 rsggsvSYLDVLLgsesfsdfldrlsalSKIADADADLLEELKADK--AELEAKKAELEAKLAELEALK----AELEAAK 170
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217375254 1987 KEADSMRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSS 2044
Cdd:COG3883    171 AELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAA 228
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
76-181 2.78e-04

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 45.42  E-value: 2.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   76 SHAGVRYITEALikklTKQDNLALIKSLNLSLSKDGGKKFKyiENLEKCVKLEVLNLSYNLIGK--IEKLDKLLK----- 148
Cdd:cd00116    178 GDAGIRALAEGL----KANCNLEVLDLNNNGLTDEGASALA--ETLASLKSLEVLNLGDNNLTDagAAALASALLspnis 251
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 2217375254  149 LRELNLSYNKI------SKIEGIENMCNLQKLNLAGNEI 181
Cdd:cd00116    252 LLTLSLSCNDItddgakDLAEVLAEKESLLELDLRGNKF 290
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1583-1809 2.98e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.53  E-value: 2.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1583 IREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQ 1662
Cdd:COG4942     22 AAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1663 RVSELEKTQVAVLEEKLELENLQQISQQQKGEI----EWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKcd 1738
Cdd:COG4942    102 QKEELAELLRALYRLGRQPPLALLLSPEDFLDAvrrlQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEAL-- 179
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254 1739 iwEKKLAQTKRVLAAAEENskmEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNS 1809
Cdd:COG4942    180 --LAELEEERAALEALKAE---RQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPA 245
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1295-2022 4.12e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.68  E-value: 4.12e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1295 KEMEELHHNIDDLLQEKKSLEcEVEELHRTVQKRQQQKDfidgnveslmtELEIEKSLKHHEDIVDEIECIEKTLLKRRS 1374
Cdd:COG4913    235 DDLERAHEALEDAREQIELLE-PIRELAERYAAARERLA-----------ELEYLRAALRLWFAQRRLELLEAELEELRA 302
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1375 ELREADRLLAEAESELsctkektKNAVEKFTDAKRSLLQTESDA-EELERRAQETAVNLVKADQQLRSLQADAKDLEQHK 1453
Cdd:COG4913    303 ELARLEAELERLEARL-------DALREELDELEAQIRGNGGDRlEQLEREIERLERELEERERRRARLEALLAALGLPL 375
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1454 IKQEEILKEINKIVAAKdsdfqclskkKEKLTEELQKLQKDIEMAERNEDHHLQVLKEsevlLQAKRAELEKLKSQVTSQ 1533
Cdd:COG4913    376 PASAEEFAALRAEAAAL----------LEALEEELEALEEALAEAEAALRDLRRELRE----LEAEIASLERRKSNIPAR 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1534 QQEM-AVLDRQLGHKKEELHLLqGSMVQAKADLQE------------ALRLgetEVTEKcnHIREVKSLLEELSFqKGEL 1600
Cdd:COG4913    442 LLALrDALAEALGLDEAELPFV-GELIEVRPEEERwrgaiervlggfALTL---LVPPE--HYAAALRWVNRLHL-RGRL 514
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1601 NVQiserktqltlikqeiekeeeNLQVVLRQMSKHKTELKNILDMLQLENHELQG-LKLQHDQR--------VSELEKTQ 1671
Cdd:COG4913    515 VYE--------------------RVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAwLEAELGRRfdyvcvdsPEELRRHP 574
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1672 VAVLEEKlelenlqQISQ-QQKGEIEWQKQL-------------LERDKREIERMTAESRALQSCVECLSKEKEDLQEKC 1737
Cdd:COG4913    575 RAITRAG-------QVKGnGTRHEKDDRRRIrsryvlgfdnrakLAALEAELAELEEELAEAEERLEALEAELDALQERR 647
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1738 DIWEKKLAQTkrvlaaaeenskMEQSNLEKLELNVRKLQQELDQLNRDklslHNDISAMQQQLQEKREAVNSLQEELAnv 1817
Cdd:COG4913    648 EALQRLAEYS------------WDEIDVASAEREIAELEAELERLDAS----SDDLAALEEQLEELEAELEELEEELD-- 709
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1818 qdhlnlakqdllhttkhqdvllseqtRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVqQEMMFQRLQKE 1897
Cdd:COG4913    710 --------------------------ELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALL-EERFAAALGDA 762
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1898 RESEESK-LETSKVTLKEQQHQLEKELTDQKSKLDQV-LSKVLAAEERVRTLQEEERWCESLEktlsqtkrqlserEQQL 1975
Cdd:COG4913    763 VERELREnLEERIDALRARLNRAEEELERAMRAFNREwPAETADLDADLESLPEYLALLDRLE-------------EDGL 829
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*..
gi 2217375254 1976 VEKSGELLALQKEAdsMRADFSLLRNQFLTERKKAEKQVASLKEALK 2022
Cdd:COG4913    830 PEYEERFKELLNEN--SIEFVADLLSKLRRAIREIKERIDPLNDSLK 874
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1797-2085 4.15e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.68  E-value: 4.15e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1797 QQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVL--LSEQTRLQKDISEWanrfedcQKEEETKQQQLQvlq 1874
Cdd:COG4913    609 RAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALqrLAEYSWDEIDVASA-------EREIAELEAELE--- 678
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1875 nEIEENKLKLVQQEMMFQRLQKERESEESKLETskvtLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQE---EE 1951
Cdd:COG4913    679 -RLDASSDDLAALEEQLEELEAELEELEEELDE----LKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRallEE 753
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1952 RWCESL-EKTLSQTKRQLSEREQQLVEKsgellaLQKEADSMRADFSLLRNQFLTERKKAEKQVASLKEALKIqRSQLEK 2030
Cdd:COG4913    754 RFAAALgDAVERELRENLEERIDALRAR------LNRAEEELERAMRAFNREWPAETADLDADLESLPEYLAL-LDRLEE 826
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 2031 NLLTINEDserdssllsKEQKQENSCIQKEMATIELVAQDNHERARRLMKELNQM 2085
Cdd:COG4913    827 DGLPEYEE---------RFKELLNENSIEFVADLLSKLRRAIREIKERIDPLNDS 872
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1895-2112 4.21e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.14  E-value: 4.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1895 QKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLS---KVLAAEERVRTLQEEERwceSLEKTLSQTKRQLSER 1971
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAAlerRIAALARRIRALEQELA---ALEAELAELEKEIAEL 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1972 EQQLVEKSGE----LLALQKEADSMRADFSLLRNQFLTERKKAE------KQVASLKEALKIQRSQLEKNLLTINEDSER 2041
Cdd:COG4942     96 RAELEAQKEElaelLRALYRLGRQPPLALLLSPEDFLDAVRRLQylkylaPARREQAEELRADLAELAALRAELEAERAE 175
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254 2042 DSSLLSKEQKQENScIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMANQKDLERRQMEISDA 2112
Cdd:COG4942    176 LEALLAELEEERAA-LEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPA 245
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1735-2163 4.33e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 45.73  E-value: 4.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1735 EKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEEL 1814
Cdd:TIGR00618  173 FPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQ 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1815 ANVQDHLNLAKQDLLHTTKHQDVLLS-EQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQR 1893
Cdd:TIGR00618  253 EEQLKKQQLLKQLRARIEELRAQEAVlEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAA 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1894 LQKERESEESKlETSKVTLKEQQHQLEKELTDQKSKLDQvLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQ 1973
Cdd:TIGR00618  333 HVKQQSSIEEQ-RRLLQTLHSQEIHIRDAHEVATSIREI-SCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQA 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1974 QLVEKSGELLALQKEADSMRADFSLLRNQFLTERKKAEKQVASLKEALKIQ---------RSQLEKNLLTINEDSERDSS 2044
Cdd:TIGR00618  411 TIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLqesaqslkeREQQLQTKEQIHLQETRKKA 490
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2045 LLSK--EQKQENSCIQKEMAT---IELVAQDNHE-----------RARRLMKELNQMQYEYTELKKQMANQKDLERRQME 2108
Cdd:TIGR00618  491 VVLArlLELQEEPCPLCGSCIhpnPARQDIDNPGpltrrmqrgeqTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQ 570
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217375254 2109 ISDAMRTLKSEVKDEIRTSLKNLNQFLPELPADLEAILERNENLEGELESLKENL 2163
Cdd:TIGR00618  571 SFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQ 625
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1276-2120 4.81e-04

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 45.55  E-value: 4.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1276 LKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLEceveelhrtvqkrqqqkdfidgnvESLMTELEiekslkhh 1355
Cdd:pfam01576    7 MQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQ------------------------EQLQAETE-------- 54
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1356 edIVDEIECIEKTLLKRRSELREadrLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKA 1435
Cdd:pfam01576   55 --LCAEAEEMRARLAARKQELEE---ILHELESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAARQKLQLEKVTT 129
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1436 DQQLRSLQADAKDLEQhkikQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIE-MAERNEDHHLQVLKESEV 1514
Cdd:pfam01576  130 EAKIKKLEEDILLLED----QNSKLSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEaMISDLEERLKKEEKGRQE 205
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1515 LLQAKR---AELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQ---AKADLQEALRLGETEVTEKCNHIREVKS 1588
Cdd:pfam01576  206 LEKAKRkleGESTDLQEQIAELQAQIAELRAQLAKKEEELQAALARLEEetaQKNNALKKIRELEAQISELQEDLESERA 285
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1589 LLEELSFQKGELNVQISERKTQLtlikqEIEKEEENLQVVLRqmSKHKTELKNILDMLQLE--NHE--LQGLKLQHDQRV 1664
Cdd:pfam01576  286 ARNKAEKQRRDLGEELEALKTEL-----EDTLDTTAAQQELR--SKREQEVTELKKALEEEtrSHEaqLQEMRQKHTQAL 358
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1665 SEL--------------EKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESralqscveclSKEK 1730
Cdd:pfam01576  359 EELteqleqakrnkanlEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSES----------ERQR 428
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1731 EDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLS-------LHNDISAMQQQLQEK 1803
Cdd:pfam01576  429 AELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQELLQEETRQKLNlstrlrqLEDERNSLQEQLEEE 508
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1804 REAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLK 1883
Cdd:pfam01576  509 EEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVD 588
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1884 LVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCES------- 1956
Cdd:pfam01576  589 LDHQRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRAEAEAREKETRALSLARALEEALEAKEELERTNKQlraemed 668
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1957 --------------LEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFLTERKKAEKQVASLKEALK 2022
Cdd:pfam01576  669 lvsskddvgknvheLERSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERDLQARDEQGEEKRRQLV 748
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2023 IQRSQLEKNLltinEDSERDSSLLSKEQKQENSCIQKEMATIELVAQDNHERARRLMKELNQMQYEYTELKKQMANQKDL 2102
Cdd:pfam01576  749 KQVRELEAEL----EDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQRELEEARASRDEI 824
                          890
                   ....*....|....*...
gi 2217375254 2103 ERRQMEISDAMRTLKSEV 2120
Cdd:pfam01576  825 LAQSKESEKKLKNLEAEL 842
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1253-1894 5.08e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 45.34  E-value: 5.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1253 HCNVPEHHNLENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECE---VEELHRTVQKRQ 1329
Cdd:TIGR00618  279 LEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQrrlLQTLHSQEIHIR 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1330 QQKDFIDGNVESLMTELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKR 1409
Cdd:TIGR00618  359 DAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQR 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1410 SLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKiVAAKDSDFQCLSKKKEK------ 1483
Cdd:TIGR00618  439 YAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLA-RLLELQEEPCPLCGSCIhpnpar 517
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1484 -LTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAK 1562
Cdd:TIGR00618  518 qDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQ 597
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1563 ADLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTEL--- 1639
Cdd:TIGR00618  598 DLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELlas 677
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1640 -KNILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESra 1718
Cdd:TIGR00618  678 rQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTV-- 755
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1719 lqscveclSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNdisAMQQ 1798
Cdd:TIGR00618  756 --------LKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILN---LQCE 824
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1799 QLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQkdisewanrfedcqkEEETKQQQLQVLQNEIE 1878
Cdd:TIGR00618  825 TLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKII---------------QLSDKLNGINQIKIQFD 889
                          650
                   ....*....|....*.
gi 2217375254 1879 ENKLKLVQQEMMFQRL 1894
Cdd:TIGR00618  890 GDALIKFLHEITLYAN 905
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
438-861 5.89e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.14  E-value: 5.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  438 QLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDL--LYKQLSGRLQLVNKLRQEALDL 515
Cdd:COG4717     82 EAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALeaELAELPERLEELEERLEELREL 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  516 ELQMEKQKQEIAGKQKEIKDLqiaidsldskdpkhshMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDL 595
Cdd:COG4717    162 EEELEELEAELAELQEELEEL----------------LEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEEL 225
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  596 EEQL--TEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMDAENM 673
Cdd:COG4717    226 EEELeqLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAE 305
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  674 RKLEQSALQAELEKERQALKNALG-KAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLK--DFQNHLNHVVD--GLVRPE 748
Cdd:COG4717    306 ELQALPALEELEEEELEELLAALGlPPDLSPEELLELLDRIEELQELLREAEELEEELQleELEQEIAALLAeaGVEDEE 385
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  749 EVAARVDELRRKLKLGTGEMNIHSPSDVLGKSLADLQKQFSEILARSKWERDEAQVRErkLQEEMALQQEKLATGQEEFR 828
Cdd:COG4717    386 ELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEELEE--LEEELEELREELAELEAELE 463
                          410       420       430
                   ....*....|....*....|....*....|...
gi 2217375254  829 QACERALEARMNFDKRQHEARIQQMENEIHYLQ 861
Cdd:COG4717    464 QLEEDGELAELLQELEELKAELRELAEEWAALK 496
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1428-1613 5.97e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 44.82  E-value: 5.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1428 TAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQ 1507
Cdd:COG3883      7 AAPTPAFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERRE 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1508 VLKES--------------EVLLQAK-------RAE-LEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADL 1565
Cdd:COG3883     87 ELGERaralyrsggsvsylDVLLGSEsfsdfldRLSaLSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAEL 166
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 2217375254 1566 QEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTL 1613
Cdd:COG3883    167 EAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAA 214
COG5022 COG5022
Myosin heavy chain [General function prediction only];
298-891 6.19e-04

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 45.45  E-value: 6.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  298 EEIKNQDKLNKSLKEEAMLQKQSCEELKSDLNTKNELLKQKTIELTRACQKQYELEQELAFYKIDAKFEPLNYYPSEYAE 377
Cdd:COG5022    810 KEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKI 889
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  378 IDKapdespyigksryKRNMFATESYIIDSaQAVQIKK-MEPDEQLRNDHmnlrghtpldtqledKEKKIsaaqTRLSEL 456
Cdd:COG5022    890 DVK-------------SISSLKLVNLELES-EIIELKKsLSSDLIENLEF---------------KTELI----ARLKKL 936
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  457 HDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDL--LYKQLsgrlqlvNKLRQEALDLELQMEKQKQEIAGKQKEIK 534
Cdd:COG5022    937 LNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYedLLKKS-------TILVREGNKANSELKNFKKELAELSKQYG 1009
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  535 DLQIAIDSLDSKD----PKHSHMK---AQKSGKEQQLDI------MNKQYQQLESRLDEILSRiaKETEEIKDLEE---Q 598
Cdd:COG5022   1010 ALQESTKQLKELPvevaELQSASKiisSESTELSILKPLqklkglLLLENNQLQARYKALKLR--RENSLLDDKQLyqlE 1087
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  599 LTEGQIAANEALKKDLEGVIS-GLQEYLGTIKGQATQAQNecrkLRDEKETLLQRLTEVEQERDQLEIVAMDAENMRKle 677
Cdd:COG5022   1088 STENLLKTINVKDLEVTNRNLvKPANVLQFIVAQMIKLNL----LQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFW-- 1161
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  678 QSALQAELEKERQALKnalgkaqfSEEKEQENSELHAKLKHLQDDNNLLKQQLKdfqNHLNHVVDGLVRpeevAARVDEL 757
Cdd:COG5022   1162 EANLEALPSPPPFAAL--------SEKRLYQSALYDEKSKLSSSEVNDLKNELI---ALFSKIFSGWPR----GDKLKKL 1226
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  758 RRKLKLGTGEMNIHSPSDVLGKSLADLQKQFSEILaRSKWERDEAQVRERKLQEEM-------ALQQEKLATGQEEFRQA 830
Cdd:COG5022   1227 ISEGWVPTEYSTSLKGFNNLNKKFDTPASMSNEKL-LSLLNSIDNLLSSYKLEEEVlpatinsLLQYINVGLFNALRTKA 1305
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217375254  831 CERALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLQLQEADEEKERILAQ 891
Cdd:COG5022   1306 SSLRWKSATEVNYNSEELDDWCREFEISDVDEELEELIQAVKVLQLLKDDLNKLDELLDAC 1366
46 PHA02562
endonuclease subunit; Provisional
439-667 6.63e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 45.01  E-value: 6.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  439 LEDKEKKISAAQTRLSELHDE----IEKAEQQILRATEEFKQLEeaIQLKKISEAgkdllykqlsgrlqlVNKLRQEALD 514
Cdd:PHA02562   204 IEEQRKKNGENIARKQNKYDElveeAKTIKAEIEELTDELLNLV--MDIEDPSAA---------------LNKLNTAAAK 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQkqeiagkQKEIKDLQ------IAIDSLDSKDPKHSHMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKE 588
Cdd:PHA02562   267 IKSKIEQF-------QKVIKMYEkggvcpTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKL 339
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217375254  589 TEEIKDLEEQltEGQIAANEALKKDLEGVISGLQEYLGTIKGqatqaqnECRKLRDEKETLLQRLTEVEQERDQLEIVA 667
Cdd:PHA02562   340 LELKNKISTN--KQSLITLVDKAKKVKAAIEELQAEFVDNAE-------ELAKLQDELDKIVKTKSELVKEKYHRGIVT 409
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1293-1607 7.51e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 45.03  E-value: 7.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1293 SEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDfiDGNVESLMTELE-------IEKSLKHHEDIVDEIECI 1365
Cdd:PRK02224   410 AEDFLEELREERDELREREAELEATLRTARERVEEAEALLE--AGKCPECGQPVEgsphvetIEEDRERVEELEAELEDL 487
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1366 EKTLLKRRSELREADRLlAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAvnlVKADQQlrslQAD 1445
Cdd:PRK02224   488 EEEVEEVEERLERAEDL-VEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELE---AEAEEK----REA 559
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1446 AKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSK------KKEKLTEELQKLQ-KDIEMAERNEDH--HLQVLKESEVLL 1516
Cdd:PRK02224   560 AAEAEEEAEEAREEVAELNSKLAELKERIESLERirtllaAIADAEDEIERLReKREALAELNDERreRLAEKRERKREL 639
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1517 QAK--RAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGS--MVQAKADLQEALRLGETEVTEKCNHIREVKSLLEE 1592
Cdd:PRK02224   640 EAEfdEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEigAVENELEELEELRERREALENRVEALEALYDEAEE 719
                          330
                   ....*....|....*
gi 2217375254 1593 LSFQKGELNVQISER 1607
Cdd:PRK02224   720 LESMYGDLRAELRQR 734
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1715-1943 7.54e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 44.44  E-value: 7.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1715 ESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLnRDKLSlhNDIS 1794
Cdd:COG3883     17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER-REELG--ERAR 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1795 AMQQQlqekrEAVNSLQEELANVQDHlnlakQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQ 1874
Cdd:COG3883     94 ALYRS-----GGSVSYLDVLLGSESF-----SDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALK 163
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217375254 1875 NEIEENKLKLvqqemmfQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEER 1943
Cdd:COG3883    164 AELEAAKAEL-------EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAA 225
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1700-2122 8.00e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 44.76  E-value: 8.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1700 QLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQsnleklelnvrkLQQEL 1779
Cdd:COG4717     74 KELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEA------------LEAEL 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1780 DQLNRDKLSLHNDIsamqQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQdvllseqtrlqkdISEWANRFEDC 1859
Cdd:COG4717    142 AELPERLEELEERL----EELRELEEELEELEAELAELQEELEELLEQLSLATEEE-------------LQDLAEELEEL 204
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1860 QKEEETKQQQLQVLQNEIE--ENKLKLVQQEMMFQRLQKERESEESKLE-TSKVTLKEQQHQLEKELTDQKSKLDQVLSK 1936
Cdd:COG4717    205 QQRLAELEEELEEAQEELEelEEELEQLENELEAAALEERLKEARLLLLiAAALLALLGLGGSLLSLILTIAGVLFLVLG 284
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1937 VLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQlveksgELLALQKEADSMRADFSLLRNQFLTERKKAEKQVAS 2016
Cdd:COG4717    285 LLALLFLLLAREKASLGKEAEELQALPALEELEEEELE------ELLAALGLPPDLSPEELLELLDRIEELQELLREAEE 358
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2017 LKEALKIQRSQLEKNLLTINEDSERDSSLLSK-EQKQENSCIQKEMATIE--LVAQDNHERARRLMKELNQMQYEYTELK 2093
Cdd:COG4717    359 LEEELQLEELEQEIAALLAEAGVEDEEELRAAlEQAEEYQELKEELEELEeqLEELLGELEELLEALDEEELEEELEELE 438
                          410       420
                   ....*....|....*....|....*....
gi 2217375254 2094 KQManqKDLERRQMEISDAMRTLKSEVKD 2122
Cdd:COG4717    439 EEL---EELEEELEELREELAELEAELEQ 464
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1746-2092 8.37e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.12  E-value: 8.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1746 QTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAK 1825
Cdd:COG4372     21 KTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQ 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1826 QDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKL 1905
Cdd:COG4372    101 EELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAE 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1906 ETSKVTLKEQQHQLEKELTDQKSKLDQ---VLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSGEL 1982
Cdd:COG4372    181 AEQALDELLKEANRNAEKEEELAEAEKlieSLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEI 260
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1983 LALQKEADSMRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLSKEQKQENSCIQKEMA 2062
Cdd:COG4372    261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELA 340
                          330       340       350
                   ....*....|....*....|....*....|
gi 2217375254 2063 TIELVAQDNHERARRLMKELNQMQYEYTEL 2092
Cdd:COG4372    341 DLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
COG5022 COG5022
Myosin heavy chain [General function prediction only];
1763-2136 8.66e-04

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 44.68  E-value: 8.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1763 SNLEKLELNVRKLQQELDQLNRDKLSLhnDISAMQQQLQEKREAVNSLQEELANVQDHlnlaKQDLLHTTKHQDVLLSEQ 1842
Cdd:COG5022    787 VDYELKWRLFIKLQPLLSLLGSRKEYR--SYLACIIKLQKTIKREKKLRETEEVEFSL----KAEVLIQKFGRSLKAKKR 860
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1843 TRLQKDisewANRFEDCQKEEETKQQQLQVLQNEIEE-NKLKLVQQEMMFQRLQKERESEESKLETSKVTLKE----QQH 1917
Cdd:COG5022    861 FSLLKK----ETIYLQSAQRVELAERQLQELKIDVKSiSSLKLVNLELESEIIELKKSLSSDLIENLEFKTELiarlKKL 936
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1918 QLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFS 1997
Cdd:COG5022    937 LNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTK 1016
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1998 LL--RNQFLTERKKAEKQVASLKEALKIQRSQLE---KNLLTINEDSERDSSLLskeqkqenscIQKEMATIELVAQDNH 2072
Cdd:COG5022   1017 QLkeLPVEVAELQSASKIISSESTELSILKPLQKlkgLLLLENNQLQARYKALK----------LRRENSLLDDKQLYQL 1086
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217375254 2073 ERARRLMKELNqmqyeytELKKQMANQKDLERRQMEISDAMRTLKSEVKDEIRTSLKNLNQFLP 2136
Cdd:COG5022   1087 ESTENLLKTIN-------VKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEISKFLSQLVNTLE 1143
PRK12704 PRK12704
phosphodiesterase; Provisional
442-537 8.78e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 44.38  E-value: 8.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  442 KEKKISAAQtrlSELHDEIEKAEQQILRATEEFKQLEEAIQLKKiseagkdllyKQLSGRLQLVNKLRQEALDLELQMEK 521
Cdd:PRK12704    55 KKEALLEAK---EEIHKLRNEFEKELRERRNELQKLEKRLLQKE----------ENLDRKLELLEKREEELEKKEKELEQ 121
                           90
                   ....*....|....*.
gi 2217375254  522 QKQEIAGKQKEIKDLQ 537
Cdd:PRK12704   122 KQQELEKKEEELEELI 137
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1463-1922 1.11e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.99  E-value: 1.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1463 INKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKEsevlLQAKRAELEKLKSQVTSQQQEMAVLDR 1542
Cdd:COG4717     48 LERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEE----LEELEEELEELEAELEELREELEKLEK 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1543 QLGHKkeelhllqgSMVQAKADLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEE 1622
Cdd:COG4717    124 LLQLL---------PLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEEL 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1623 ENLQVVLRQMSKHKTELKNILDMLQ---------LENHELQGLKLQHDQRVSELEKTQVAV----------LEEKLELEN 1683
Cdd:COG4717    195 QDLAEELEELQQRLAELEEELEEAQeeleeleeeLEQLENELEAAALEERLKEARLLLLIAaallallglgGSLLSLILT 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1684 LQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQS 1763
Cdd:COG4717    275 IAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLR 354
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1764 NLEKL--ELNVRKLQQELDQL-NRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQdvLLS 1840
Cdd:COG4717    355 EAEELeeELQLEELEQEIAALlAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEE--LEE 432
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1841 EQTRLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLvQQEMMFQRLQkERESEESKLETSKVTLKEQQHQLE 1920
Cdd:COG4717    433 ELEELEEELEELEEELEELREELAELEAELEQLEEDGELAELLQ-ELEELKAELR-ELAEEWAALKLALELLEEAREEYR 510

                   ..
gi 2217375254 1921 KE 1922
Cdd:COG4717    511 EE 512
PRK12704 PRK12704
phosphodiesterase; Provisional
1434-1537 1.29e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 44.00  E-value: 1.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1434 KADQQLRSLQADAKD-LEQHKIKQEEILKEinKIVAAKD------SDFQCLSKKKEKlteELQKLQKDIEMAERNEDHHL 1506
Cdd:PRK12704    28 IAEAKIKEAEEEAKRiLEEAKKEAEAIKKE--ALLEAKEeihklrNEFEKELRERRN---ELQKLEKRLLQKEENLDRKL 102
                           90       100       110
                   ....*....|....*....|....*....|.
gi 2217375254 1507 QVLKESEVLLQAKRAELEKLKSQVTSQQQEM 1537
Cdd:PRK12704   103 ELLEKREEELEKKEKELEQKQQELEKKEEEL 133
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1377-1931 1.30e-03

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 43.94  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1377 READRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLE---QHK 1453
Cdd:pfam05483  229 EEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDIKmslQRS 308
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1454 IKQEEILKEinkivaakdsDFQCLSKKKEKLTEElqklqKDIEMAERNEDH--HLQVLKESEVLLQAKRAELEKLKSQVT 1531
Cdd:pfam05483  309 MSTQKALEE----------DLQIATKTICQLTEE-----KEAQMEELNKAKaaHSFVVTEFEATTCSLEELLRTEQQRLE 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1532 SQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEALR-LGETEVTEKCNhiREVKSLLEELSFQKGELNVQISERKTQ 1610
Cdd:pfam05483  374 KNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKKiLAEDEKLLDEK--KQFEKIAEELKGKEQELIFLLQAREKE 451
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1611 LTLIKQEIEKEEENLQVVLRQMSKHKTELknildmlqlENHELQGLKLQHDQRVSELEK---TQVAVLEEKLELENLQQI 1687
Cdd:pfam05483  452 IHDLEIQLTAIKTSEEHYLKEVEDLKTEL---------EKEKLKNIELTAHCDKLLLENkelTQEASDMTLELKKHQEDI 522
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1688 SQQQKGEIEWQKQLLERDKREiermTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEK 1767
Cdd:pfam05483  523 INCKKQEERMLKQIENLEEKE----MNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNN 598
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1768 LELNVRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQ----DHLNLAKQDLLHTTKHQDVLLSEQT 1843
Cdd:pfam05483  599 LKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKqkfeEIIDNYQKEIEDKKISEEKLLEEVE 678
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1844 RLQKDISEWANRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLetsKVTLKEQQHQLEKEL 1923
Cdd:pfam05483  679 KAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEERDSELGLYKNKEQEQSSA---KAALEIELSNIKAEL 755

                   ....*...
gi 2217375254 1924 TDQKSKLD 1931
Cdd:pfam05483  756 LSLKKQLE 763
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1699-2163 1.31e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 44.04  E-value: 1.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1699 KQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQE 1778
Cdd:pfam10174  288 KNKIDQLKQELSKKESELLALQTKLETLTNQNSDCKQHIEVLKESLTAKEQRAAILQTEVDALRLRLEEKESFLNKKTKQ 367
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1779 LDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLnlakqdllhttKHQDvllseqtrlqKDISEWANRFED 1858
Cdd:pfam10174  368 LQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIENLQEQL-----------RDKD----------KQLAGLKERVKS 426
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1859 CQKEEETKQQQLQVLQNEIEEnklklvqQEMMFQRLQKERESEESKletskvtLKEQQHQLEKELTDQKSKLDQVLSKVL 1938
Cdd:pfam10174  427 LQTDSSNTDTALTTLEEALSE-------KERIIERLKEQREREDRE-------RLEELESLKKENKDLKEKVSALQPELT 492
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1939 AAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMrADFSLLRNQFLTERKKAEKQVASLK 2018
Cdd:pfam10174  493 EKESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKLENQLKKAHNA-EEAVRTNPEINDRIRLLEQEVARYK 571
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2019 EALKIQRSQLEKnLLTINEDSERDSSLLSKEQKQENSCIQKEMATiELVAQDNHERARRLMKELNQMQYEYTELKKQMAN 2098
Cdd:pfam10174  572 EESGKAQAEVER-LLGILREVENEKNDKDKKIAELESLTLRQMKE-QNKKVANIKHGQQEMKKKGAQLLEEARRREDNLA 649
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254 2099 QKDLERRQMEISDAMRTLKSEVkDEIRTSLKNLNQFLPELPADLEAI-LERNENLEGELESLKENL 2163
Cdd:pfam10174  650 DNSQQLQLEELMGALEKTRQEL-DATKARLSSTQQSLAEKDGHLTNLrAERRKQLEEILEMKQEAL 714
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
475-1064 1.39e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 43.90  E-value: 1.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  475 KQLEEAIQLKKISEAGKDL--LYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHS- 551
Cdd:PRK03918   149 KVVRQILGLDDYENAYKNLgeVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEk 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  552 ------HMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKDLEEQ---LTEGQIAANE-----ALKKDLEGV 617
Cdd:PRK03918   229 evkeleELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkeLKELKEKAEEyiklsEFYEEYLDE 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  618 ISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMDAENMRKLEQSALQAELEKERQALKNALG 697
Cdd:PRK03918   309 LREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  698 KAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKDFQNHLNHVVDGLVRPEEVAARVDELRRK--LKLGTGEM-NIHSPS 774
Cdd:PRK03918   389 LEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKelLEEYTAELkRIEKEL 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  775 DVLGKSLADLQKQFSEIlaRSKWERDEAQVRERKLQEEMALQQEKLAT-GQEEFRQACERALEARMNFDKRqhEARIQQM 853
Cdd:PRK03918   469 KEIEEKERKLRKELREL--EKVLKKESELIKLKELAEQLKELEEKLKKyNLEELEKKAEEYEKLKEKLIKL--KGEIKSL 544
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  854 ENEIHYLQENLKSMEEIQGltdlQLQEADEEKERILAQLREL------EKKKKLEDAKSQEQVFgldkelkklkkavaTS 927
Cdd:PRK03918   545 KKELEKLEELKKKLAELEK----KLDELEEELAELLKELEELgfesveELEERLKELEPFYNEY--------------LE 606
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  928 DKLATAELTIAKDQLKSLHGTVMKINQERAEELQEAERFSRKAAQAARDLTRAEAEiELLQNLLRQKGEQFRLEMEKtgv 1007
Cdd:PRK03918   607 LKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYE-ELREEYLELSRELAGLRAEL--- 682
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2217375254 1008 gTGANSQVLEIEKLNETMERQRTEIARLQNVLDLtgsdnkggFENVLEEIAELRREV 1064
Cdd:PRK03918   683 -EELEKRREEIKKTLEKLKEELEEREKAKKELEK--------LEKALERVEELREKV 730
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1365-1525 1.41e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 42.60  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1365 IEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQ--ETAVNLVKADQQLRSL 1442
Cdd:COG1579     15 LDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKkyEEQLGNVRNNKEYEAL 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1443 QADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIEMAernedhhLQVLKESEVLLQAKRAE 1522
Cdd:COG1579     95 QKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEE-------LAELEAELEELEAEREE 167

                   ...
gi 2217375254 1523 LEK 1525
Cdd:COG1579    168 LAA 170
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1258-1453 1.42e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.14  E-value: 1.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1258 EHHNLENEVSRLEDIMQHLKSKKREERWmraskRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTVQKRQQQKDFIDG 1337
Cdd:COG4913    263 RYAAARERLAELEYLRAALRLWFAQRRL-----ELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGG 337
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1338 NveslmtelEIEkSLKHhedivdEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKftdAKRSLLQTESD 1417
Cdd:COG4913    338 D--------RLE-QLER------EIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAE---AAALLEALEEE 399
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 2217375254 1418 AEELERRAQETAVNLVKADQQLRSLQADAKDLEQHK 1453
Cdd:COG4913    400 LEALEEALAEAEAALRDLRRELRELEAEIASLERRK 435
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1869-2034 1.60e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 42.60  E-value: 1.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1869 QLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKVTLKEqqhqLEKELTDQKSKLDQVLSKVLAAEER----- 1943
Cdd:COG1579     11 DLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELED----LEKEIKRLELEIEEVEARIKKYEEQlgnvr 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1944 ----VRTLQEEErwcESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFLTERKKAEKQVASLKE 2019
Cdd:COG1579     87 nnkeYEALQKEI---ESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEA 163
                          170
                   ....*....|....*
gi 2217375254 2020 ALKIQRSQLEKNLLT 2034
Cdd:COG1579    164 EREELAAKIPPELLA 178
mukB PRK04863
chromosome partition protein MukB;
427-861 1.62e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 43.79  E-value: 1.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  427 MNLRGHTPLDTQLEDKEKKISAAQTRLSELHDEIEKA-------EQQILRATEEFKQLEEA-IQLKKISEAGKDL--LYK 496
Cdd:PRK04863   283 VHLEEALELRRELYTSRRQLAAEQYRLVEMARELAELneaesdlEQDYQAASDHLNLVQTAlRQQEKIERYQADLeeLEE 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  497 QLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKSGKEQQLDIMNKQYQQLES 576
Cdd:PRK04863   363 RLEEQNEVVEEADEQQEENEARAEAAEEEVDELKSQLADYQQALDVQQTRAIQYQQAVQALERAKQLCGLPDLTADNAED 442
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  577 RLDEILSRIAKETEEIKDLEEQLTEGQiAANEALKKDLEGVISglqeylgtIKGQ--ATQAQNECRKLRDEKETLLQRLT 654
Cdd:PRK04863   443 WLEEFQAKEQEATEELLSLEQKLSVAQ-AAHSQFEQAYQLVRK--------IAGEvsRSEAWDVARELLRRLREQRHLAE 513
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  655 EVEQERDQLeivamdaenmRKLEQS-ALQAELEKERQALKNALGKA-QFSEEKEQENSELHAKLKHLQDdnnllkqqlkd 732
Cdd:PRK04863   514 QLQQLRMRL----------SELEQRlRQQQRAERLLAEFCKRLGKNlDDEDELEQLQEELEARLESLSE----------- 572
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  733 fqnhlnhvvdglvRPEEVAARVDELRRKLKlgtgemnihspsdvlgkslaDLQKQFSEILARS-KWErdEAQVRERKLQE 811
Cdd:PRK04863   573 -------------SVSEARERRMALRQQLE--------------------QLQARIQRLAARApAWL--AAQDALARLRE 617
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2217375254  812 EMAlqqEKLATGQ--EEFRQA-CERALEARMNfdKRQHEARIQQMENEIHYLQ 861
Cdd:PRK04863   618 QSG---EEFEDSQdvTEYMQQlLERERELTVE--RDELAARKQALDEEIERLS 665
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1772-2162 1.66e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.60  E-value: 1.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1772 VRKLQQELDQLNRDKLSLHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISE 1851
Cdd:COG4717     48 LERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQL 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1852 WA--NRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEKELTDQKSK 1929
Cdd:COG4717    128 LPlyQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQR 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1930 LDQvlskvlaAEERVRTLQEE-----ERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFS------- 1997
Cdd:COG4717    208 LAE-------LEEELEEAQEEleeleEELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILtiagvlf 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1998 ------------LLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDS--SLLSKEQKQENSCIQKEMAT 2063
Cdd:COG4717    281 lvlgllallfllLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEllELLDRIEELQELLREAEELE 360
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2064 IELVAQDNHERARRLMKELN-QMQYEYTELKKQMANQKDLERRQMEISDAMRTLKSEV--------KDEIRTSLKNLNQF 2134
Cdd:COG4717    361 EELQLEELEQEIAALLAEAGvEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELeellealdEEELEEELEELEEE 440
                          410       420
                   ....*....|....*....|....*...
gi 2217375254 2135 LPELPADLEAILERNENLEGELESLKEN 2162
Cdd:COG4717    441 LEELEEELEELREELAELEAELEQLEED 468
PRK01156 PRK01156
chromosome segregation protein; Provisional
1727-2239 1.67e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 43.74  E-value: 1.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1727 SKEKEDLQEKCDIweKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRdklSLHNDISAMQQQLQEKREa 1806
Cdd:PRK01156   149 AQRKKILDEILEI--NSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKK---QIADDEKSHSITLKEIER- 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1807 vnsLQEELANVQDHLNLAKQDLlhttKHQDVLLSEQTRLQKDISEWANRFEDcQKEEETKQQQLQVLQNEIEENKLKLVQ 1886
Cdd:PRK01156   223 ---LSIEYNNAMDDYNNLKSAL----NELSSLEDMKNRYESEIKTAESDLSM-ELEKNNYYKELEERHMKIINDPVYKNR 294
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1887 QEMM-FQRLQKERESEESKLETSKVTLkEQQHQLEKELTDQKSKLDQVLSKVLAAEE---RVRTLQEEERWCESLEKTLS 1962
Cdd:PRK01156   295 NYINdYFKYKNDIENKKQILSNIDAEI-NKYHAIIKKLSVLQKDYNDYIKKKSRYDDlnnQILELEGYEMDYNSYLKSIE 373
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1963 QTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFLTERKKAEKQVASL---KEALKIQRSQLEKNLLTINEDS 2039
Cdd:PRK01156   374 SLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLnqrIRALRENLDELSRNMEMLNGQS 453
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2040 ----------ERDSSLLSKEQKQENSCIQKEMATIELVAQDNHERARRLMK--------ELNQMQYEYTELKKQMANQKD 2101
Cdd:PRK01156   454 vcpvcgttlgEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKrkeyleseEINKSINEYNKIESARADLED 533
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2102 LERRQMEISDA----------MRTLKSEVKDEIRTSLKNLNQFLPELpaDLEAILERNENLEGELESLKENLPFTMNEGP 2171
Cdd:PRK01156   534 IKIKINELKDKhdkyeeiknrYKSLKLEDLDSKRTSWLNALAVISLI--DIETNRSRSNEIKKQLNDLESRLQEIEIGFP 611
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2172 FEEKLNFSQVHIMDEHWRG-----------EALREKLRHREDRLK---AQLRHCMSKQAEVLIKGKrQTEGTLHSLRRQV 2237
Cdd:PRK01156   612 DDKSYIDKSIREIENEANNlnnkyneiqenKILIEKLRGKIDNYKkqiAEIDSIIPDLKEITSRIN-DIEDNLKKSRKAL 690

                   ..
gi 2217375254 2238 DA 2239
Cdd:PRK01156   691 DD 692
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
170-213 1.72e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 38.00  E-value: 1.72e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2217375254  170 NLQKLNLAGNEIEHIPvwLGKKLKSLRVLNLKGN-KISSLQDISK 213
Cdd:pfam12799    2 NLEVLDLSNNQITDIP--PLAKLPNLETLDLSGNnKITDLSDLAN 44
PLN03150 PLN03150
hypothetical protein; Provisional
145-206 1.88e-03

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 43.27  E-value: 1.88e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217375254  145 KLLKLRELNLSYNKIskiEG-----IENMCNLQKLNLAGNEIE-HIPVWLGKkLKSLRVLNLKGNKIS 206
Cdd:PLN03150   440 KLRHLQSINLSGNSI---RGnippsLGSITSLEVLDLSYNSFNgSIPESLGQ-LTSLRILNLNGNSLS 503
COG5022 COG5022
Myosin heavy chain [General function prediction only];
1338-1602 1.91e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 43.53  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1338 NVESLMTELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEaesELSCTKEKTKNAVEKFTDAKRSLLqtesd 1417
Cdd:COG5022    798 KLQPLLSLLGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAE---VLIQKFGRSLKAKKRFSLLKKETI----- 869
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1418 aeelerrAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDfqcLSKKKEKLTEELQKLQKDIEM 1497
Cdd:COG5022    870 -------YLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSD---LIENLEFKTELIARLKKLLNN 939
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1498 AERNEDHHLQVLKESEVL-LQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQ---GSMVQAKADLQEALRLGE 1573
Cdd:COG5022    940 IDLEEGPSIEYVKLPELNkLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKkelAELSKQYGALQESTKQLK 1019
                          250       260       270
                   ....*....|....*....|....*....|..
gi 2217375254 1574 ---TEVTEKCNHIREVKSLLEELSFQKGELNV 1602
Cdd:COG5022   1020 elpVEVAELQSASKIISSESTELSILKPLQKL 1051
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1393-1828 1.94e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.22  E-value: 1.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1393 TKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQhKIKQEEILKEINKIVAAKDS 1472
Cdd:COG4717     65 KPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEK-LLQLLPLYQELEALEAELAE 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1473 DFQCLSKKKEKLtEELQKLQKDIEMAERNEDHHLQVLKES-EVLLQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEEL 1551
Cdd:COG4717    144 LPERLEELEERL-EELRELEEELEELEAELAELQEELEELlEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEEL 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1552 HLLQGSMVQAKADLQealRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQ 1631
Cdd:COG4717    223 EELEEELEQLENELE---AAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKAS 299
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1632 MSKHKTELKNILDMLQLENHELQGLKlqHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIer 1711
Cdd:COG4717    300 LGKEAEELQALPALEELEEEELEELL--AALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAAL-- 375
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1712 mtaesralqsCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLE-LNVRKLQQELDQLNRDKLSLH 1790
Cdd:COG4717    376 ----------LAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEaLDEEELEEELEELEEELEELE 445
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 2217375254 1791 NDISAMQQQLQEKREAVNSL--QEELANVQDHLNLAKQDL 1828
Cdd:COG4717    446 EELEELREELAELEAELEQLeeDGELAELLQELEELKAEL 485
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1914-2163 1.94e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 43.51  E-value: 1.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1914 EQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMR 1993
Cdd:PRK03918   186 KRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELE 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1994 ADFSLLRNQfLTERKKAEKQVASLKEALKiQRSQLEKNLLTINEDSERDSSLLSKEQKQENScIQKEMATIElvaqDNHE 2073
Cdd:PRK03918   266 ERIEELKKE-IEELEEKVKELKELKEKAE-EYIKLSEFYEEYLDELREIEKRLSRLEEEING-IEERIKELE----EKEE 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2074 RARRLMKELNQMQYEYTELKKQMANQKDLERRQMEISDAMRTLKSEVKDEIRTSLKNLNQFLPELPADLEAILERNENLE 2153
Cdd:PRK03918   339 RLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELK 418
                          250
                   ....*....|
gi 2217375254 2154 GELESLKENL 2163
Cdd:PRK03918   419 KEIKELKKAI 428
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
458-665 2.14e-03

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 43.50  E-value: 2.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  458 DEIEKAEQQILRATEEFKQLEEAIQ--LKKISE-AGKDLLYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIagkqKEIK 534
Cdd:TIGR01612 1500 DEADKNAKAIEKNKELFEQYKKDVTelLNKYSAlAIKNKFAKTKKDSEIIIKEIKDAHKKFILEAEKSEQKI----KEIK 1575
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  535 DLQIAIDsldskDPKHSHMKAQKSGKEQQLDIMNKQYQQLesRLDEILSRIAKETEEIKDLEEQLTEGQIAANEALKKDL 614
Cdd:TIGR01612 1576 KEKFRIE-----DDAAKNDKSNKAAIDIQLSLENFENKFL--KISDIKKKINDCLKETESIEKKISSFSIDSQDTELKEN 1648
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217375254  615 EGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLEI 665
Cdd:TIGR01612 1649 GDNLNSLQEFLESLKDQKKNIEDKKKELDELDSEIEKIEIDVDQHKKNYEI 1699
PTZ00121 PTZ00121
MAEBL; Provisional
1265-1522 2.20e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 2.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1265 EVSRLEDIMQHLKSKKREERWMRASKRQSE--KEMEELH--HNIDDLLQEKKSLECEVEELHRTVQKRQQQKdfidGNVE 1340
Cdd:PTZ00121  1520 EAKKADEAKKAEEAKKADEAKKAEEKKKADelKKAEELKkaEEKKKAEEAKKAEEDKNMALRKAEEAKKAEE----ARIE 1595
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1341 SLMTELEIEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEE 1420
Cdd:PTZ00121  1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK 1675
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1421 LERRAQETAVNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQK-DIEMAE 1499
Cdd:PTZ00121  1676 KAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEaKKDEEE 1755
                          250       260
                   ....*....|....*....|...
gi 2217375254 1500 RNEDHHLQVLKESEVLLQAKRAE 1522
Cdd:PTZ00121  1756 KKKIAHLKKEEEKKAEEIRKEKE 1778
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1759-2052 2.28e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 43.50  E-value: 2.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1759 KMEQSNLEKLELNVRKLQQ-----------ELDQLNRDKLSLHNDISAMQ----QQLQEKREAVNSLQEELANVQDHLNL 1823
Cdd:PTZ00108   995 KRKEYLLGKLERELARLSNkvrfikhvingELVITNAKKKDLVKELKKLGyvrfKDIIKKKSEKITAEEEEGAEEDDEAD 1074
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1824 AKQDLLHT--TKHQDVLLSEQ-TRLQKDisewanRFEDCQKEEETKQQQLQVLQN-EIEE------NKLKLVQQEMMFQR 1893
Cdd:PTZ00108  1075 DEDDEEELgaAVSYDYLLSMPiWSLTKE------KVEKLNAELEKKEKELEKLKNtTPKDmwledlDKFEEALEEQEEVE 1148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1894 LQKERESEESKLETSKVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQEEERWCES---LEKTLSQTKRQLSE 1970
Cdd:PTZ00108  1149 EKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDkpdNKKSNSSGSDQEDD 1228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1971 REQQLVEKSGELLALQKEAD-----SMRADFSLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSL 2045
Cdd:PTZ00108  1229 EEQKTKPKKSSVKRLKSKKNnssksSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKK 1308

                   ....*..
gi 2217375254 2046 LSKEQKQ 2052
Cdd:PTZ00108  1309 KVKKRLE 1315
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1408-2131 2.34e-03

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 43.17  E-value: 2.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1408 KRSLLQTESDAEELERRAQEtavNLVKADQQLRSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEE 1487
Cdd:pfam05483   91 KKWKVSIEAELKQKENKLQE---NRKIIEAQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNATRHLCNLLKETCARS 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1488 LQKLQKDIEMAERNEDHHLQVLKESEVLLQAkraeLEKLKSQVTSQQQEMAVldrQLGHKKEELHLLQGSMVQAKADLQE 1567
Cdd:pfam05483  168 AEKTKKYEYEREETRQVYMDLNNNIEKMILA----FEELRVQAENARLEMHF---KLKEDHEKIQHLEEEYKKEINDKEK 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1568 ALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQ 1647
Cdd:pfam05483  241 QVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQ 320
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1648 LENHELQGLKLQHDQRVSELEKTQVAvleeklelenLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLS 1727
Cdd:pfam05483  321 IATKTICQLTEEKEAQMEELNKAKAA----------HSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKS 390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1728 KEKEDLQEKCDIWEKKLAQTKRVLAAaEENSKMEQSNLEKLELNVRKLQQELDQLNRDKlslHNDISAMQQQLQEKREAV 1807
Cdd:pfam05483  391 SELEEMTKFKNNKEVELEELKKILAE-DEKLLDEKKQFEKIAEELKGKEQELIFLLQAR---EKEIHDLEIQLTAIKTSE 466
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1808 NSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWAnrfedcqkeEETKQQQLQVLQNEIEEnklklvqq 1887
Cdd:pfam05483  467 EHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMT---------LELKKHQEDIINCKKQE-------- 529
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1888 emmfQRLQKERESeeskLETSKVTLKEQQHQLEKELTDQKsklDQVLSKVLAAEERVRTLQEE----ERWCESLEKTLSQ 1963
Cdd:pfam05483  530 ----ERMLKQIEN----LEEKEMNLRDELESVREEFIQKG---DEVKCKLDKSEENARSIEYEvlkkEKQMKILENKCNN 598
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1964 TKRQLSEREQQLVEKSGELLALQKEADSMRADFsllrNQFLTERKKAEKQVASLKEALK--IQRSQLEKNLLTINEDSER 2041
Cdd:pfam05483  599 LKKQIENKNKNIEELHQENKALKKKGSAENKQL----NAYEIKVNKLELELASAKQKFEeiIDNYQKEIEDKKISEEKLL 674
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2042 DSSLLSKEQKQENSCIQKEmatIELVAQDNHERARRLM-KELNQMQYEYTELKKQMANQKDLERRQMEISDAMRTLKSEV 2120
Cdd:pfam05483  675 EEVEKAKAIADEAVKLQKE---IDKRCQHKIAEMVALMeKHKHQYDKIIEERDSELGLYKNKEQEQSSAKAALEIELSNI 751
                          730
                   ....*....|.
gi 2217375254 2121 KDEIRTSLKNL 2131
Cdd:pfam05483  752 KAELLSLKKQL 762
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1792-1950 2.45e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.83  E-value: 2.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1792 DISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEWANRFEDCQK--EEETKQQQ 1869
Cdd:COG1579     11 DLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEqlGNVRNNKE 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1870 LQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLEtskvTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTLQE 1949
Cdd:COG1579     91 YEALQKEIESLKRRISDLEDEILELMERIEELEEELA----ELEAELAELEAELEEKKAELDEELAELEAELEELEAERE 166

                   .
gi 2217375254 1950 E 1950
Cdd:COG1579    167 E 167
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1292-2167 2.53e-03

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 43.50  E-value: 2.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1292 QSEKEMEELHHNIDDLLQEKKSLE-CE--VEELHRTVQKRQQQKDFIDGNVESLMTELEIEKSLKHHedivdeiecIEKT 1368
Cdd:TIGR01612  901 EINKSIEEEYQNINTLKKVDEYIKiCEntKESIEKFHNKQNILKEILNKNIDTIKESNLIEKSYKDK---------FDNT 971
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1369 LLKRRSELREADRllaeaESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVkaDQQLRSLQADAKD 1448
Cdd:TIGR01612  972 LIDKINELDKAFK-----DASLNDYEAKNNELIKYFNDLKANLGKNKENMLYHQFDEKEKATNDI--EQKIEDANKNIPN 1044
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1449 LEQ------HKIkQEEILKEINK--------IVAAKDSDFQCLSKKKEKL----------------TEELQKLQKDIEMA 1498
Cdd:TIGR01612 1045 IEIaihtsiYNI-IDEIEKEIGKniellnkeILEEAEINITNFNEIKEKLkhynfddfgkeenikyADEINKIKDDIKNL 1123
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1499 ERNEDHHLQVLKESEvllqaKRAE--LEKLKSQVTSQQQemaVLDRQLGHKK-EELHLLQGSMVqakadlqealrlgeTE 1575
Cdd:TIGR01612 1124 DQKIDHHIKALEEIK-----KKSEnyIDEIKAQINDLED---VADKAISNDDpEEIEKKIENIV--------------TK 1181
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1576 VTEKCNHIREVKSLLEELSfqkgelnvQISERKTQLTLIKQEIEKEEENL-QVVLRQMSKHKTELKNILDMLQLENHELQ 1654
Cdd:TIGR01612 1182 IDKKKNIYDEIKKLLNEIA--------EIEKDKTSLEEVKGINLSYGKNLgKLFLEKIDEEKKKSEHMIKAMEAYIEDLD 1253
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1655 GLKLQHDQRVSELektqvavleeklelenlqQISQQQKGEIEWQKQLLERDKREIERMTAESRALQSCVECLSKEKEDLQ 1734
Cdd:TIGR01612 1254 EIKEKSPEIENEM------------------GIEMDIKAEMETFNISHDDDKDHHIISKKHDENISDIREKSLKIIEDFS 1315
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1735 EKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLEL-NVRKLQQeldqlnrdklsLHNDISAMQQQLQEKREAVNSLQEE 1813
Cdd:TIGR01612 1316 EESDINDIKKELQKNLLDAQKHNSDINLYLNEIANIyNILKLNK-----------IKKIIDEVKEYTKEIEENNKNIKDE 1384
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1814 LANVQDHLNLAKQDL-LHTTKHQdvllSEQTRLQKDISEWANRFEDCQK---EEETKQQQLQVLQNEIEENKLKLVQQ-E 1888
Cdd:TIGR01612 1385 LDKSEKLIKKIKDDInLEECKSK----IESTLDDKDIDECIKKIKELKNhilSEESNIDTYFKNADENNENVLLLFKNiE 1460
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1889 MMFQR----LQKERESEESKLETSKVTLKEQQHQLEKELTD-QKSKLDQVLSKVLAAEERVRTLQEEERWCE-SLEKTLS 1962
Cdd:TIGR01612 1461 MADNKsqhiLKIKKDNATNDHDFNINELKEHIDKSKGCKDEaDKNAKAIEKNKELFEQYKKDVTELLNKYSAlAIKNKFA 1540
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1963 QTKRQLSEREQQLVEKSGELLAlqkEADSMRADFSLLRNQFLTERKKAEKQVASLKEALKIQRS--QLEKNLLTINEDSE 2040
Cdd:TIGR01612 1541 KTKKDSEIIIKEIKDAHKKFIL---EAEKSEQKIKEIKKEKFRIEDDAAKNDKSNKAAIDIQLSleNFENKFLKISDIKK 1617
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2041 RDSSLLSkeqkqENSCIQKEMATIELVAQDNheRARRLMKELNQMQYEYTELKKQmanQKDLERRQMEISDAMRTLKSeV 2120
Cdd:TIGR01612 1618 KINDCLK-----ETESIEKKISSFSIDSQDT--ELKENGDNLNSLQEFLESLKDQ---KKNIEDKKKELDELDSEIEK-I 1686
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....*..
gi 2217375254 2121 KDEIRTSLKNLNQFLpelpadLEAILERNENLEGELESLKENLPFTM 2167
Cdd:TIGR01612 1687 EIDVDQHKKNYEIGI------IEKIKEIAIANKEEIESIKELIEPTI 1727
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
435-888 2.53e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 43.24  E-value: 2.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLEEA---IQLKKISEAGK-------DLLYKQLSGRLQL 504
Cdd:pfam01576   73 LEEILHELESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAArqkLQLEKVTTEAKikkleedILLLEDQNSKLSK 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  505 VNKLRQEAL-----DLELQMEKQKQEIAGKQKEIKDLQIAIDSLDSKDPKHSHMKAQKSGKEQQLDIMNKQYQQLESRLD 579
Cdd:pfam01576  153 ERKLLEERIseftsNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIAELQAQIA 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  580 EILSRIAKETEEIKDLEEQLTEGQIAANEALKK--DLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETL-------- 649
Cdd:pfam01576  233 ELRAQLAKKEEELQAALARLEEETAQKNNALKKirELEAQISELQEDLESERAARNKAEKQRRDLGEELEALkteledtl 312
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  650 ----------LQRLTEVEQERDQLEIVAMDAE----NMRKLEQSALQAELEKERQALKNALGKAQFSEEKEQENSELHAK 715
Cdd:pfam01576  313 dttaaqqelrSKREQEVTELKKALEEETRSHEaqlqEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  716 LKHLQD-------DNNLLKQQLKDFQNHLNhvvDGLVRPEEVAARVDELRRKLKLGTGEMN-IHSPSDVLGKSLADLQKQ 787
Cdd:pfam01576  393 LRTLQQakqdsehKRKKLEGQLQELQARLS---ESERQRAELAEKLSKLQSELESVSSLLNeAEGKNIKLSKDVSSLESQ 469
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  788 F---SEILARSKWERDEAQVRERKLQEEMALQQEKLATgQEEFRQACERALEArMNFDKRQHEARIQQMENEIHYLQENL 864
Cdd:pfam01576  470 LqdtQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEE-EEEAKRNVERQLST-LQAQLSDMKKKLEEDAGTLEALEEGK 547
                          490       500
                   ....*....|....*....|....
gi 2217375254  865 KSMEEIQGLTDLQLQEADEEKERI 888
Cdd:pfam01576  548 KRLQRELEALTQQLEEKAAAYDKL 571
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1625-2013 2.86e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1625 LQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQVAVLEEKLELENlQQISQQQKGEIEWQKQLLER 1704
Cdd:COG4717     76 LEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQEL-EALEAELAELPERLEELEER 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1705 DKREIERMTAESRALQSCVECLSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNR 1784
Cdd:COG4717    155 LEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLEN 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1785 DKLSLhndisAMQQQLQEKR---------------------------EAVNSLQEELANVQDHLNLAKQDLLHTTKHQDV 1837
Cdd:COG4717    235 ELEAA-----ALEERLKEARlllliaaallallglggsllsliltiaGVLFLVLGLLALLFLLLAREKASLGKEAEELQA 309
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1838 LLSEQTRLQKDISEWANRFEDCQKEEET-------KQQQLQVLQNEIE--ENKLKLVQQEMMFQRLQKERESEESKLETS 1908
Cdd:COG4717    310 LPALEELEEEELEELLAALGLPPDLSPEellelldRIEELQELLREAEelEEELQLEELEQEIAALLAEAGVEDEEELRA 389
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1909 KVTLKEQQHQLEKELTDQKSKLDQVLSKVLAAEERVRTlqeeerwcESLEKTLSQTKRQLSEREQQLVEKSGELLALQKE 1988
Cdd:COG4717    390 ALEQAEEYQELKEELEELEEQLEELLGELEELLEALDE--------EELEEELEELEEELEELEEELEELREELAELEAE 461
                          410       420
                   ....*....|....*....|....*
gi 2217375254 1989 ADSMRADFSLLRNQFLTERKKAEKQ 2013
Cdd:COG4717    462 LEQLEEDGELAELLQELEELKAELR 486
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1264-1668 2.93e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1264 NEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLEcEVEELHRTVQKRQQqkdfIDGNVESLM 1343
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLE-KLLQLLPLYQELEA----LEAELAELP 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1344 TELE-IEKSLKHHEDIVDEIECIEKTLLKRRSELREA-DRLLAEAESELSCTKEKTKNAVEKFTDAKRSLLQTESDAEEL 1421
Cdd:COG4717    146 ERLEeLEERLEELRELEEELEELEAELAELQEELEELlEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEEL 225
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1422 ERRAQETAVNLVKADQQLR---------------SLQADAKDLEQHKIKQEEILKEINKIVAAkdsDFQCLSKKKEKLTE 1486
Cdd:COG4717    226 EEELEQLENELEAAALEERlkearlllliaaallALLGLGGSLLSLILTIAGVLFLVLGLLAL---LFLLLAREKASLGK 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1487 ELQKLQKDIEMAERNEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQemavLDRQLGHKKEELhLLQGSMVQAKADLQ 1566
Cdd:COG4717    303 EAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQE----LLREAEELEEEL-QLEELEQEIAALLA 377
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1567 EALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISER--------KTQLTLIKQEIEKEEENLQVVLRQMSKHKTE 1638
Cdd:COG4717    378 EAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELeellealdEEELEEELEELEEELEELEEELEELREELAE 457
                          410       420       430
                   ....*....|....*....|....*....|
gi 2217375254 1639 LKNILDMLQlENHELQGLKLQHDQRVSELE 1668
Cdd:COG4717    458 LEAELEQLE-EDGELAELLQELEELKAELR 486
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
86-216 3.34e-03

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 41.96  E-value: 3.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254   86 ALIKKLTKQDNLALIKSLNLSLSKDGGKKFkyIENLEKCV-KLEVLNLSYNLI--GKIEKLDKLL----KLRELNLSYNK 158
Cdd:cd00116     99 GVLESLLRSSSLQELKLNNNGLGDRGLRLL--AKGLKDLPpALEKLVLGRNRLegASCEALAKALranrDLKELNLANNG 176
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217375254  159 ISKiEGIENMC-------NLQKLNLAGNEIEHIPV----WLGKKLKSLRVLNLKGNKISSLqDISKLKP 216
Cdd:cd00116    177 IGD-AGIRALAeglkancNLEVLDLNNNGLTDEGAsalaETLASLKSLEVLNLGDNNLTDA-GAAALAS 243
PRK12704 PRK12704
phosphodiesterase; Provisional
1740-1902 3.44e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 42.46  E-value: 3.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1740 WEKKLAQTKRVLAAAE-----ENSKMEQSNLEKL-ELNVR----KLQQELDQLNRDKLslhNDISAMQQQLQEKREAVNS 1809
Cdd:PRK12704    24 VRKKIAEAKIKEAEEEakrilEEAKKEAEAIKKEaLLEAKeeihKLRNEFEKELRERR---NELQKLEKRLLQKEENLDR 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1810 LQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEwanRFEDCQK--EEETKQQQLQVLQNEIEENKLKLVQQ 1887
Cdd:PRK12704   101 KLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ---ELERISGltAEEAKEILLEKVEEEARHEAAVLIKE 177
                          170
                   ....*....|....*
gi 2217375254 1888 EMMFQRLQKERESEE 1902
Cdd:PRK12704   178 IEEEAKEEADKKAKE 192
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
435-734 3.80e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 42.31  E-value: 3.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQILRATEEFKQLE-----------EAIQLKKISEAGKDLLYKQLSGRLQ 503
Cdd:TIGR04523  108 INSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEkeleklnnkynDLKKQKEELENELNLLEKEKLNIQK 187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  504 LVNKLRQEALDLELQM---EKQKQEIAGKQKEIKDLQ----IAIDSLDSKDPKHSHMKAQKSGKEQQLDIMNKQYQQLES 576
Cdd:TIGR04523  188 NIDKIKNKLLKLELLLsnlKKKIQKNKSLESQISELKkqnnQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKK 267
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  577 RLDEILSRIAKETEEIKDLEEQLTEGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEV 656
Cdd:TIGR04523  268 QLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQL 347
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  657 EQERDQLEivamdaenmrkLEQSALQAELEKERQALKNAL----GKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKD 732
Cdd:TIGR04523  348 KKELTNSE-----------SENSEKQRELEEKQNEIEKLKkenqSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKK 416

                   ..
gi 2217375254  733 FQ 734
Cdd:TIGR04523  417 LQ 418
PTZ00121 PTZ00121
MAEBL; Provisional
515-1034 4.16e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.44  E-value: 4.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAiDSLDSKDPKHSHMKAQKSGKEQQLDimNKQYQQLESRLDEILSRIAKETEEIKD 594
Cdd:PTZ00121  1029 IEELTEYGNNDDVLKEKDIIDEDID-GNHEGKAEAKAHVGQDEGLKPSYKD--FDFDAKEDNRADEATEEAFGKAEEAKK 1105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  595 LEEQLTEGQIAANEALKKDLEgvisgLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEveqERDQLEIvAMDAENMR 674
Cdd:PTZ00121  1106 TETGKAEEARKAEEAKKKAED-----ARKAEEARKAEDARKAEEARKAEDAKRVEIARKAE---DARKAEE-ARKAEDAK 1176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  675 KLEQsALQAELEKERQALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNllkqqlkdfqnhlnhvVDGLVRPEEVAARV 754
Cdd:PTZ00121  1177 KAEA-ARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKK----------------AEAVKKAEEAKKDA 1239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  755 DELRRKLKlgtgEMNIHSPSDVLGKSLADLQKQFSEILARSKWERDEAQVRERKLQEEMALQQEKLATGQEEFRQACE-- 832
Cdd:PTZ00121  1240 EEAKKAEE----ERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEak 1315
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  833 RALEARMNFDKRQHEARIQQMENEIHYLQENLKSMEEIQGLTDLqlqEADEEKERILAQLRELEKKKKLEDAKSQEQVFG 912
Cdd:PTZ00121  1316 KADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEA---EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  913 LDKELKKLKKAVATSDKLATAELTIAK-DQLKSLHGTVMKINQ--ERAEELQEAERFSRKAAQAAR-DLTRAEAEIELLQ 988
Cdd:PTZ00121  1393 ADEAKKKAEEDKKKADELKKAAAAKKKaDEAKKKAEEKKKADEakKKAEEAKKADEAKKKAEEAKKaEEAKKKAEEAKKA 1472
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 2217375254  989 NLLRQKGEQFRLEMEKTGVGTGANSQVLEIEKLNEtmERQRTEIAR 1034
Cdd:PTZ00121  1473 DEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE--AKKKADEAK 1516
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1256-1608 4.40e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 42.36  E-value: 4.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1256 VPEHHNLENEVSRLEDIMQHLKSKKREERWMRASKRQSEKEMEELHHNIDDLLQEKKSLECEVEELHRTV---------- 1325
Cdd:PRK03918   285 LKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLeeleerhely 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1326 --------QKRQQQKDFIDGNVESLMTELE-IEKSLKHHEDIVDEIECIEKTLLKRRSELREADRLLAEAESEL-----S 1391
Cdd:PRK03918   365 eeakakkeELERLKKRLTGLTPEKLEKELEeLEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCpvcgrE 444
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1392 CTKEKTKNAVEKFTDAKRSLLQTESDAEELERRAQETAVNLVKADQQLRSLQADAKDLEQHKiKQEEILKEINKIVAAKD 1471
Cdd:PRK03918   445 LTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLK-ELEEKLKKYNLEELEKK 523
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1472 sdfqclSKKKEKLTEELQKLQKDIEMAERnEDHHLQVLKESEVLLQAKRAELEKLKSQVTSQQQEMAVLD-RQLGHKKEE 1550
Cdd:PRK03918   524 ------AEEYEKLKEKLIKLKGEIKSLKK-ELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESvEELEERLKE 596
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217375254 1551 LHLLQGSMVQAKA------DLQEALRLGETEVTEKCNHIREVKSLLEELSFQKGELNVQISERK 1608
Cdd:PRK03918   597 LEPFYNEYLELKDaekeleREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEE 660
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
2012-2163 5.10e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.06  E-value: 5.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 2012 KQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLSKEQKQENSCIQKEMATIELVAQDNHERARRLMKELNQMQYEYTE 2091
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPERLEE 150
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217375254 2092 LKKQMANQKDLERRQMEISDAMRTLKSEVKDEIRTSLKNLNQFLPELPADLEAILERNENLEGELESLKENL 2163
Cdd:COG4717    151 LEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEEL 222
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1516-1736 6.26e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.29  E-value: 6.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1516 LQAKRAELEKLKSQVTSQQQEMAVLDRQLGHKKEELHLLQGSMVQAKADLQEA---LRLGETEVTEKCNHIREVKSLLEE 1592
Cdd:COG4942     22 AAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALeqeLAALEAELAELEKEIAELRAELEA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1593 LSFQKGELNVQISERKTQLTLIKQEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGLKLQHDQRVSELEKTQV 1672
Cdd:COG4942    102 QKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLA 181
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217375254 1673 AVLEEKLELENLQQISQQQKGEIEwqkQLLERDKREIERMTAESRALQSCVECLSKEKEDLQEK 1736
Cdd:COG4942    182 ELEEERAALEALKAERQKLLARLE---KELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1438-1656 6.31e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.29  E-value: 6.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1438 QLRSLQADAKDLEQHKIKQEEILKEINKivaakdsdfqcLSKKKEKLTEELQKLQKDIEMAERNEDHHLQVLKESEVLLQ 1517
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIAELEKELAA-----------LKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1518 AKRAELEKLKSQVTSQQQEMA----VLDRQLGHKKEELHLLQGSMVQAKAD---LQEALRLGETEVTEKCNHIREVKSLL 1590
Cdd:COG4942     87 ELEKEIAELRAELEAQKEELAellrALYRLGRQPPLALLLSPEDFLDAVRRlqyLKYLAPARREQAEELRADLAELAALR 166
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254 1591 EELSFQKGELNVQISERKTQLTlikqEIEKEEENLQVVLRQMSKHKTELKNILDMLQLENHELQGL 1656
Cdd:COG4942    167 AELEAERAELEALLAELEEERA----ALEALKAERQKLLARLEKELAELAAELAELQQEAEELEAL 228
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
550-664 6.65e-03

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 40.27  E-value: 6.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  550 HSHMKAQKSGKEQQLDiMNKQYQQLESRLDEILSRIAKETEEIKDLEEQLTEgqiaaneaLKKDLEGVISGLQEyLGTIK 629
Cdd:pfam13851   22 RNNLELIKSLKEEIAE-LKKKEERNEKLMSEIQQENKRLTEPLQKAQEEVEE--------LRKQLENYEKDKQS-LKNLK 91
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 2217375254  630 GQATQAQNECRKLRDEKETLLQRLTEVEQERDQLE 664
Cdd:pfam13851   92 ARLKVLEKELKDLKWEHEVLEQRFEKVERERDELY 126
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1369-2047 7.65e-03

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 41.75  E-value: 7.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1369 LLKRRSELREADRLLAEAESELSCTK-EKTKNAVE----KFTDAKRSLLQTESDAE--ELERRAQETAVNLVKADQQL-- 1439
Cdd:pfam12128  155 LGRERVELRSLARQFALCDSESPLRHiDKIAKAMHskegKFRDVKSMIVAILEDDGvvPPKSRLNRQQVEHWIRDIQAia 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1440 ---------RSLQADAKDLEQHKIKQEEILKEINKIVAAKDSDFQCLSKKKEKLTEELQKLQKDIE--MAERNEDhhlqv 1508
Cdd:pfam12128  235 gimkirpefTKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKekRDELNGE----- 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1509 LKESEVLLQAKRAELEKLKSQVTSQQQEmavldrqlGHKKEELHLLQGSMVQAKADLQEALRLGETE----VTEKCNHIR 1584
Cdd:pfam12128  310 LSAADAAVAKDRSELEALEDQHGAFLDA--------DIETAAADQEQLPSWQSELENLEERLKALTGkhqdVTAKYNRRR 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1585 EVKS--LLEELSFQKGELNVQISERKTQLTLIKQEIEKEEENLQvvlRQMSKHKTELKNILDMLQLENHELQG------- 1655
Cdd:pfam12128  382 SKIKeqNNRDIAGIKDKLAKIREARDRQLAVAEDDLQALESELR---EQLEAGKLEFNEEEYRLKSRLGELKLrlnqata 458
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1656 ---LKLQHDQRVSELEKTQVAVLEEKLELENLQQISQQQKGEIEWQKQLLERDKREIERMTAESRALQ--------SCVE 1724
Cdd:pfam12128  459 tpeLLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELElqlfpqagTLLH 538
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1725 CLSKEKEDlqekcdiWEKKLAqtkRVLAAA-----------EENSKMEQSNLEKLELNVRKLQ-----QELDQLNRDKLS 1788
Cdd:pfam12128  539 FLRKEAPD-------WEQSIG---KVISPEllhrtdldpevWDGSVGGELNLYGVKLDLKRIDvpewaASEEELRERLDK 608
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1789 LHNDISAMQQQLQEKREAVNSLQEELANVQDHLNLAKQDLLHTTKHQDVLLSEQTRLQKDISEwanrfedcQKEEETKQQ 1868
Cdd:pfam12128  609 AEEALQSAREKQAAAEEQLVQANGELEKASREETFARTALKNARLDLRRLFDEKQSEKDKKNK--------ALAERKDSA 680
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1869 QLQVLQNEIEENKLKLVQQEMM-----------FQRLQKERESEESK----------LETSKVTLKEQQHQLEKELTDQK 1927
Cdd:pfam12128  681 NERLNSLEAQLKQLDKKHQAWLeeqkeqkrearTEKQAYWQVVEGALdaqlallkaaIAARRSGAKAELKALETWYKRDL 760
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1928 SKLDQVLSKVLAAEERVRTL-----------QEEERWCESLEKTLSQTKRQLSEreqQLVEKSGELLALQKEADSMRADF 1996
Cdd:pfam12128  761 ASLGVDPDVIAKLKREIRTLerkieriavrrQEVLRYFDWYQETWLQRRPRLAT---QLSNIERAISELQQQLARLIADT 837
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217375254 1997 SLLRNQFLTERKKAEKQVASLKEALKIQRSQLEKnLLTINEDSERDSSLLS 2047
Cdd:pfam12128  838 KLRRAKLEMERKASEKQQVRLSENLRGLRCEMSK-LATLKEDANSEQAQGS 887
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
435-910 7.70e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 41.35  E-value: 7.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  435 LDTQLEDKEKKISAAQTRLSELHDEIEKAEQQIlrateefkqleeaiqlkkiseagkDLLYKQLSGRLQLVNKLRQEALD 514
Cdd:pfam10174  294 LKQELSKKESELLALQTKLETLTNQNSDCKQHI------------------------EVLKESLTAKEQRAAILQTEVDA 349
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  515 LELQMEKQKQEIAGKQKEIKDLQIAIDSLDSkdpKHSHMKAQKSGKEQQLDIMNKQYQQLESRLDEILSRIAKETEEIKD 594
Cdd:pfam10174  350 LRLRLEEKESFLNKKTKQLQDLTEEKSTLAG---EIRDLKDMLDVKERKINVLQKKIENLQEQLRDKDKQLAGLKERVKS 426
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  595 LEEQL--TEGQIAANEALKKDLEGVISGLQEylgtikgqatQAQNECRKLRDEKETLLQRLTEVEQERDQLEIVAMD--- 669
Cdd:pfam10174  427 LQTDSsnTDTALTTLEEALSEKERIIERLKE----------QREREDRERLEELESLKKENKDLKEKVSALQPELTEkes 496
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  670 -----AENMRKLEQSA---------LQAELEKERQ---ALKNALGKAQFSEEKEQENSELHAKLKHLQDDNNLLKQQLKD 732
Cdd:pfam10174  497 slidlKEHASSLASSGlkkdsklksLEIAVEQKKEecsKLENQLKKAHNAEEAVRTNPEINDRIRLLEQEVARYKEESGK 576
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  733 FQNHLNHVVDGLVRPEEvaarvDELRRKLKLGTGEMNIHSPSDVLGKSLADLQKQFSEILARSKWERDEAQVRERKLQEE 812
Cdd:pfam10174  577 AQAEVERLLGILREVEN-----EKNDKDKKIAELESLTLRQMKEQNKKVANIKHGQQEMKKKGAQLLEEARRREDNLADN 651
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  813 MALQQ-EKLATGQEEFRQACErALEARMNFDKRQHEARIQQMENeihYLQENLKSMEEIQGLTDLQLQEADEEKERILAQ 891
Cdd:pfam10174  652 SQQLQlEELMGALEKTRQELD-ATKARLSSTQQSLAEKDGHLTN---LRAERRKQLEEILEMKQEALLAAISEKDANIAL 727
                          490
                   ....*....|....*....
gi 2217375254  892 LRELEKKKKledaKSQEQV 910
Cdd:pfam10174  728 LELSSSKKK----KTQEEV 742
46 PHA02562
endonuclease subunit; Provisional
507-742 7.78e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 41.15  E-value: 7.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  507 KLRQEALDLEL--QMEK-QKQEIAGKQKEIKDLQIAIDSLDSKdpkhshMKAQKSGKEQQldimNKQYQQLESRLDEILS 583
Cdd:PHA02562   154 KLVEDLLDISVlsEMDKlNKDKIRELNQQIQTLDMKIDHIQQQ------IKTYNKNIEEQ----RKKNGENIARKQNKYD 223
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  584 RIAKETEEIKDLEEQLTEgQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNE------------CR----------- 640
Cdd:PHA02562   224 ELVEEAKTIKAEIEELTD-ELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVikmyekggvcptCTqqisegpdrit 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  641 KLRDEKETLLQRLTEVEQERDQLEiVAMDAENMRKLEQSALQAELEKERQALKNALGKAqfseekeqenSELHAKLKHLQ 720
Cdd:PHA02562   303 KIKDKLKELQHSLEKLDTAIDELE-EIMDEFNEQSKKLLELKNKISTNKQSLITLVDKA----------KKVKAAIEELQ 371
                          250       260
                   ....*....|....*....|..
gi 2217375254  721 DDNNLLKQQLKDFQNHLNHVVD 742
Cdd:PHA02562   372 AEFVDNAEELAKLQDELDKIVK 393
46 PHA02562
endonuclease subunit; Provisional
304-580 7.78e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 41.15  E-value: 7.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  304 DKLNKSLKEEAMLQKQSCEELKSDLNTK--------NELLKQKTIELTRAcQKQYELEQELAfYKIDAKFEPLNyypSEY 375
Cdd:PHA02562   169 DKLNKDKIRELNQQIQTLDMKIDHIQQQiktynkniEEQRKKNGENIARK-QNKYDELVEEA-KTIKAEIEELT---DEL 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  376 AEIDKaPDESPyiGKSRYKRNMFATESyiidSAQAVQIKKMEpdeqlrndHMNLRGHT-PLDTQ-LEDKEKKISAAQTRL 453
Cdd:PHA02562   244 LNLVM-DIEDP--SAALNKLNTAAAKI----KSKIEQFQKVI--------KMYEKGGVcPTCTQqISEGPDRITKIKDKL 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  454 SELhdeiekaeqqilraTEEFKQLEEAIQlkkiseagkdllykQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEI 533
Cdd:PHA02562   309 KEL--------------QHSLEKLDTAID--------------ELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKA 360
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 2217375254  534 KDLQIAIDSLDSKDPKHshmKAQKSGKEQQLDIMNKQYQQLESRLDE 580
Cdd:PHA02562   361 KKVKAAIEELQAEFVDN---AEELAKLQDELDKIVKTKSELVKEKYH 404
Surf_Exclu_PgrA TIGR04320
SEC10/PgrA surface exclusion domain; This model describes a conserved domain found in surface ...
1726-1828 8.05e-03

SEC10/PgrA surface exclusion domain; This model describes a conserved domain found in surface proteins of a number of Firmutes. Many members have LPXTG C-terminal anchoring motifs and a substantial number have the KxYKxGKxW putative sorting signal at the N-terminus. The tetracycline resistance plasmid pCF10 in Enterococcus faecalis promotes conjugal plasmid transfer in response to sex pheromones, but PgrA/Sec10 encoded by that plasmid, a member of this family, specifically inhibits the ability of cells to receive homologous plasmids. The phenomenon is called surface exclusion.


Pssm-ID: 275124 [Multi-domain]  Cd Length: 356  Bit Score: 40.87  E-value: 8.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1726 LSKEKEDLQEKCDIWEKKLAQTKRVLAAAEENSKMEQSNLEKLELNVRKLQQELDQLNRDKLSL-HNDISAMQQQLQEKR 1804
Cdd:TIGR04320  252 PPNSLAALQAKLATAQADLAAAQTALNTAQAALTSAQTAYAAAQAALATAQKELANAQAQALQTaQNNLATAQAALANAE 331
                           90       100
                   ....*....|....*....|....
gi 2217375254 1805 EAVNSLQEELANVQDHLNLAKQDL 1828
Cdd:TIGR04320  332 ARLAKAKEALANLNADLAKKQAAL 355
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1854-2085 8.29e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 41.03  E-value: 8.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1854 NRFEDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLETSKVTLKEQQHQLEkeltdQKSKLDQV 1933
Cdd:pfam07888   34 NRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELE-----EKYKELSA 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1934 LSKVLAAEERVRTLQEEE-----RWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFSLLRNQFLTERK 2008
Cdd:pfam07888  109 SSEELSEEKDALLAQRAAheariRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELR 188
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217375254 2009 KAEKQVASLKEALKIQRSQLEKNLLTINEDSERDSSLLSKEQKQENSCiqKEMATIELVAQDNHERARRLMKELNQM 2085
Cdd:pfam07888  189 SLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEALL--EELRSLQERLNASERKVEGLGEELSSM 263
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
245-664 8.94e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 41.20  E-value: 8.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  245 RSLESLEGQPVTTQDRQEAFERfSLEEVERLERDLEKKMIETEELKSKQTRFLEEIKNQDKLNKSL----KEEAML--QK 318
Cdd:PRK03918   245 KELESLEGSKRKLEEKIRELEE-RIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELreieKRLSRLeeEI 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  319 QSCEELKSDLNTKNELLKQKTIELTRACQKQYELEQ-ELAFYKIDAKFEPLNYYPSEYAEIDKAPDESPYIGKSRYKRNM 397
Cdd:PRK03918   324 NGIEERIKELEEKEERLEELKKKLKELEKRLEELEErHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEI 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  398 FATESYIIDsaqavQIKKMEPDEQLRNDHMN----LRGHTPL-----------------DTQLEDKEKKISAAQTRLSEL 456
Cdd:PRK03918   404 EEEISKITA-----RIGELKKEIKELKKAIEelkkAKGKCPVcgrelteehrkelleeyTAELKRIEKELKEIEEKERKL 478
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  457 HDEIEKAEQQILRATEEFKQLEEAIQLKKISEAGKDLLYKQLSGRLQLVNKLRQEALDLELQMEKQKQEIAGKQKEIKDL 536
Cdd:PRK03918   479 RKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKL 558
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  537 QIAIDSLDSKDPKHSHMKAQKS-----------GKEQQLDIMNKQYQQL---ESRLDEILSRIAKETEEIKDLEEQLTEG 602
Cdd:PRK03918   559 AELEKKLDELEEELAELLKELEelgfesveeleERLKELEPFYNEYLELkdaEKELEREEKELKKLEEELDKAFEELAET 638
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217375254  603 QIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKETLLQRLTEVEQERDQLE 664
Cdd:PRK03918   639 EKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLK 700
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
572-722 9.72e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 39.91  E-value: 9.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254  572 QQLESRLDEILSRIAKETEEIKDLEEQLT--EGQIAANEALKKDLEGVISGLQEYLGTIKGQATQAQNECRKLRDEKE-- 647
Cdd:COG1579     13 QELDSELDRLEHRLKELPAELAELEDELAalEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNKEye 92
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217375254  648 TLLQRLTEVEQERDQLEIVAMDAenMRKLEQ-SALQAELEKERQALKNALGKAQfsEEKEQENSELHAKLKHLQDD 722
Cdd:COG1579     93 ALQKEIESLKRRISDLEDEILEL--MERIEElEEELAELEAELAELEAELEEKK--AELDEELAELEAELEELEAE 164
46 PHA02562
endonuclease subunit; Provisional
1778-2030 9.79e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 41.15  E-value: 9.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1778 ELDQLNRDKlslhndISAMQQQLQEKREAVNSLQEELANVQDHLNlakqdllhttkhqdvllsEQTRLQKD-ISEWANRF 1856
Cdd:PHA02562   167 EMDKLNKDK------IRELNQQIQTLDMKIDHIQQQIKTYNKNIE------------------EQRKKNGEnIARKQNKY 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1857 EDCQKEEETKQQQLQVLQNEIEENKLKLVQQEMMFQRLQKERESEESKLET-SKV-----------TLKEQQHQLEKELT 1924
Cdd:PHA02562   223 DELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQfQKVikmyekggvcpTCTQQISEGPDRIT 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217375254 1925 DQKSKLDQVLSKVLAAEERVRTLQEEERWCESLEKTLSQTKRQLSEREQQLVEKSGELLALQKEADSMRADFsllrnqfl 2004
Cdd:PHA02562   303 KIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEF-------- 374
                          250       260
                   ....*....|....*....|....*.
gi 2217375254 2005 TERKKAEKQVASLKEALKIQRSQLEK 2030
Cdd:PHA02562   375 VDNAEELAKLQDELDKIVKTKSELVK 400
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH