|
Name |
Accession |
Description |
Interval |
E-value |
| HD_SAS6_N |
cd10142 |
N-terminal head domain found in spindle assembly abnormal protein 6 and similar proteins; ... |
4-144 |
8.04e-58 |
|
N-terminal head domain found in spindle assembly abnormal protein 6 and similar proteins; Spindle assembly abnormal protein 6 (SAS6) is a central scaffolding component of the centrioles that ensures their 9-fold symmetry. It is required for centrosome biogenesis and duplication, and is required for both mother-centriole-dependent centriole duplication and deuterosome-dependent centriole amplification in multiciliated cells. It is also required for the recruitment of microcephaly protein STIL to the procentriole and for STIL-mediated centriole amplification. SAS6 is comprised of an N-terminal globular head domain, a centrally located coiled-coil domain, and a disordered C-terminus. These monomers homodimerize symmetrically, through two dimerization domains, the N-terminal globular head domains and long extended alpha-helical coiled-coil regions. These homodimers can self-assemble into a 9-fold symmetric cartwheel structure comprised of nine SAS6 homodimers associated via their head domains; the dimerized coiled-coil domains being the spokes, the central hub being the head domains. This model corresponds to the N-terminal head domain of SAS6, which is structurally related to other XRCC4-superfamily members, XRCC4, PAXX, XLF and CCDC61.
Pssm-ID: 408998 Cd Length: 137 Bit Score: 191.23 E-value: 8.04e-58
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 4 VLFQQLVPLLVKCKDCEERRGSVRVSIELQSlSNPVHRKDLVIRLTDDTDPFFLYNLVISEEDFQSLKLQQGLLVDFLAF 83
Cdd:cd10142 1 VLYSRELPVEVKSQDREERLEVLRVKIEILS-SGSGHKKELRVRLTDETDLFFLYTLELNEEDFQSLKEQQGLLVDFSAF 79
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 170650645 84 PQKFIDLLQQCMQEHAKETPRFLLQLLSSAtllENSPVLLNVVETNPFKHLIHLSLKLLPG 144
Cdd:cd10142 80 PQKLIDLLELCIKEESKEPPKFLLVLVISS---DKGRATLEIVETNPFKHLTHLSLKFLPG 137
|
|
| SAS-6_N |
pfam16531 |
Centriolar protein SAS N-terminal; SAS-6_N is the N-terminal domain of the SAS-6 centriolar ... |
44-141 |
6.92e-28 |
|
Centriolar protein SAS N-terminal; SAS-6_N is the N-terminal domain of the SAS-6 centriolar protein, both in C.elegans and in humans. The N-terminal domain is the region through which the 9 rod-shaped homodimers that SAS-6 forms on oligomerization interact with each other. Proper functioning of the centriole requires this correct oligomerization.
Pssm-ID: 465163 Cd Length: 88 Bit Score: 107.28 E-value: 6.92e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 44 LVIRLTDDTDPFFLYNLVISEEDFQSLKLQQGLLVDFLAFPQKFIDLLQQCMQEhaketprfllQLLSSATLLENSPVLL 123
Cdd:pfam16531 1 LRIELTDDSDLFFLYILEIDEEDFESLKQEQNLLVDFEDFPQKLIKLLNNCIKE----------PNLLVFLIQDDGTATL 70
|
90
....*....|....*...
gi 170650645 124 NVVETNPFKHLIHLSLKL 141
Cdd:pfam16531 71 VFIENNEFKNLEHLSLDF 88
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
165-450 |
1.46e-15 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 80.87 E-value: 1.46e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 165 LSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEwashtasltnkhSQELTAEKEKALQTQVQCQQQHEQQKKELETLHQ 244
Cdd:TIGR02168 673 LERRREIEELEEKIEELEEKIAELEKALAELRKE------------LEELEEELEQLRKELEELSRQISALRKDLARLEA 740
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 245 R------NIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHIN 318
Cdd:TIGR02168 741 EveqleeRIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 319 QLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLM 398
Cdd:TIGR02168 821 NLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELS 900
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 170650645 399 GKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAgqflrakEQEVCRLQEQL 450
Cdd:TIGR02168 901 EELRELESKRSELRRELEELREKLAQLELRLEGL-------EVRIDNLQERL 945
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
184-482 |
1.06e-13 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 74.59 E-value: 1.06e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 184 TLSEKMQELDK----LRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQQQHEQQKKELETLHQRnIHQLQSRLSELEA 259
Cdd:COG1196 217 ELKEELKELEAelllLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELE-LEEAQAEEYELLA 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 260 ANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLEQEIKDKDQLVL 339
Cdd:COG1196 296 ELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELA 375
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 340 RTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVTIQQEKLLAEKE 419
Cdd:COG1196 376 EAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELE 455
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 170650645 420 EMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNE-KLITWLNKELNENQ 482
Cdd:COG1196 456 EEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEgFLEGVKAALLLAGL 519
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
243-482 |
1.84e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 70.86 E-value: 1.84e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 243 HQRNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQT 322
Cdd:TIGR02168 237 LREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLER 316
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 323 KVAVLEQEI------KDKDQLVL-RTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLK 395
Cdd:TIGR02168 317 QLEELEAQLeeleskLDELAEELaELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIA 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 396 TLMGKLK-LKNTVTI---QQEKLLAEKEEMLQK-ERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKL 470
Cdd:TIGR02168 397 SLNNEIErLEARLERledRRERLQQEIEELLKKlEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476
|
250
....*....|..
gi 170650645 471 ITWLNKELNENQ 482
Cdd:TIGR02168 477 LDAAERELAQLQ 488
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
167-470 |
2.19e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 70.47 E-value: 2.19e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 167 LTRsLDDVTRQLHITQETL---SEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQQQHEQQKKELETL- 242
Cdd:TIGR02168 188 LDR-LEDILNELERQLKSLerqAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELe 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 243 -----HQRNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHI 317
Cdd:TIGR02168 267 ekleeLRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 318 NQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQ------KVA-LEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKL 390
Cdd:TIGR02168 347 EELKEELESLEAELEELEAELEELESRLEELEEQletlrsKVAqLELQIASLNNEIERLEARLERLEDRRERLQQEIEEL 426
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 391 Q-----GDLKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEE------ 459
Cdd:TIGR02168 427 LkkleeAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENlegfse 506
|
330
....*....|..
gi 170650645 460 -SKQLLKNNEKL 470
Cdd:TIGR02168 507 gVKALLKNQSGL 518
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
236-490 |
1.45e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 67.65 E-value: 1.45e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 236 KKELETL-HQRNIHQLQSRLSELEAANKELTERKykgdSTVRELKAKLAGVEEELQRAKQEVLSLRRENctldtecHEKE 314
Cdd:COG1196 219 KEELKELeAELLLLKLRELEAELEELEAELEELE----AELEELEAELAELEAELEELRLELEELELEL-------EEAQ 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 315 KHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEEngeknqiQLGKLEATIKSLSAELLKANEIIKKLQGDL 394
Cdd:COG1196 288 AEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEE-------ELEELEEELEELEEELEEAEEELEEAEAEL 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 395 KTLMGKLKLKNTVTIQ-QEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITW 473
Cdd:COG1196 361 AEAEEALLEAEAELAEaEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEE 440
|
250
....*....|....*..
gi 170650645 474 LNKELNENQLVRKQDTL 490
Cdd:COG1196 441 EEALEEAAEEEAELEEE 457
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
162-480 |
5.37e-11 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 65.86 E-value: 5.37e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 162 EEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEwashtasltnkhsqELTAEKEKALQTQVQCQQQHEQQKKELEt 241
Cdd:TIGR02169 170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERLRRE--------------REKAERYQALLKEKREYEGYELLKEKEA- 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 242 lHQRNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEvlslrrENCTLDTECHEKEKHINQLQ 321
Cdd:TIGR02169 235 -LERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEE------EQLRVKEKIGELEAEIASLE 307
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 322 TKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKL 401
Cdd:TIGR02169 308 RSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDEL 387
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 402 KlknTVTIQQEKLLAEKEEM------LQKERKESQDAGQFLRAK-----------EQEVCRLQEQLETTVQKLEESKQLL 464
Cdd:TIGR02169 388 K---DYREKLEKLKREINELkreldrLQEELQRLSEELADLNAAiagieakinelEEEKEDKALEIKKQEWKLEQLAADL 464
|
330
....*....|....*.
gi 170650645 465 KNNEKLITWLNKELNE 480
Cdd:TIGR02169 465 SKYEQELYDLKEEYDR 480
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
248-487 |
7.26e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 65.73 E-value: 7.26e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 248 HQLQSRLSELEAankELTERKYkgdstvRELKAKLAGVEEELQRAKQEVLSLRRENctldtecHEKEKHINQLQTKVAVL 327
Cdd:COG1196 216 RELKEELKELEA---ELLLLKL------RELEAELEELEAELEELEAELEELEAEL-------AELEAELEELRLELEEL 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 328 EQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTV 407
Cdd:COG1196 280 ELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAE 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 408 TIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNENQLVRKQ 487
Cdd:COG1196 360 LAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEE 439
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
161-426 |
1.52e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 64.69 E-value: 1.52e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 161 KEEKLSLTRSLDDVTRQLHITQETLSEKM-------QELDKLRSEWASHTASLTNKHSQ-ELTAEKEKALQTQVQCQQQH 232
Cdd:TIGR02168 245 QEELKEAEEELEELTAELQELEEKLEELRlevseleEEIEELQKELYALANEISRLEQQkQILRERLANLERQLEELEAQ 324
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 233 EQQKKELETLHQRNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHE 312
Cdd:TIGR02168 325 LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIER 404
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 313 KEKHINQLQTKVAVLEQEIKDKDQLVLRTKeaFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQG 392
Cdd:TIGR02168 405 LEARLERLEDRRERLQQEIEELLKKLEEAE--LKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAER 482
|
250 260 270
....*....|....*....|....*....|....*
gi 170650645 393 DLKTLMGKLKLKNTVTIQQEKL-LAEKEEMLQKER 426
Cdd:TIGR02168 483 ELAQLQARLDSLERLQENLEGFsEGVKALLKNQSG 517
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
262-487 |
2.74e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.92 E-value: 2.74e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 262 KELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENctldtecHEKEKHINQLQTKVAVLEQEIKDKDQLVLRT 341
Cdd:TIGR02168 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKEL-------EELSRQISALRKDLARLEAEVEQLEERIAQL 752
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 342 KEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAEllkaneiIKKLQGDLKTlmgklkLKNTVTIQQEKLLAEKEEM 421
Cdd:TIGR02168 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQ-------IEQLKEELKA------LREALDELRAELTLLNEEA 819
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 170650645 422 LQKERKESQDAGQfLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNENQLVRKQ 487
Cdd:TIGR02168 820 ANLRERLESLERR-IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERAS 884
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
161-426 |
2.77e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 63.80 E-value: 2.77e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 161 KEEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKALQtqvqcqqqheqqkKELE 240
Cdd:COG1196 245 EAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARL-------------EERR 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 241 TLHQRNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENctldtecHEKEKHINQL 320
Cdd:COG1196 312 RELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAEL-------AEAEEELEEL 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 321 QTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGK 400
Cdd:COG1196 385 AEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLEL 464
|
250 260
....*....|....*....|....*.
gi 170650645 401 LKLKNTVTIQQEKLLAEKEEMLQKER 426
Cdd:COG1196 465 LAELLEEAALLEAALAELLEELAEAA 490
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
236-486 |
1.26e-09 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 61.62 E-value: 1.26e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 236 KKELETLHQ------RNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTE 309
Cdd:TIGR02169 687 KRELSSLQSelrrieNRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR 766
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 310 CHEKEKHINQLQTKVAVLEQEIKDkdqlvlrtkEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKK 389
Cdd:TIGR02169 767 IEELEEDLHKLEEALNDLEARLSH---------SRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQE 837
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 390 LQGDLKTLMGKlklKNTVTIQQEKLLAEKEEMLQKERKesqdagqfLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEK 469
Cdd:TIGR02169 838 LQEQRIDLKEQ---IKSIEKEIENLNGKKEELEEELEE--------LEAALRDLESRLGDLKKERDELEAQLRELERKIE 906
|
250
....*....|....*..
gi 170650645 470 LITWLNKELNENQLVRK 486
Cdd:TIGR02169 907 ELEAQIEKKRKRLSELK 923
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
244-487 |
2.73e-09 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 60.34 E-value: 2.73e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 244 QRNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRREnctLDTECHEKEKHINQLQTK 323
Cdd:COG1196 238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAE---LARLEQDIARLEERRREL 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 324 VAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLM----- 398
Cdd:COG1196 315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLealra 394
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 399 --GKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNK 476
Cdd:COG1196 395 aaELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAAL 474
|
250
....*....|.
gi 170650645 477 ELNENQLVRKQ 487
Cdd:COG1196 475 LEAALAELLEE 485
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
307-555 |
7.16e-09 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 58.30 E-value: 7.16e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 307 DTECHEKEKHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELlkaNEI 386
Cdd:COG3883 15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREEL---GER 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 387 IKKLQ---------------GDLKTLMGKLKLKNTVTIQQEKLLAE---KEEMLQKERKESQDAGQFLRAKEQEVCRLQE 448
Cdd:COG3883 92 ARALYrsggsvsyldvllgsESFSDFLDRLSALSKIADADADLLEElkaDKAELEAKKAELEAKLAELEALKAELEAAKA 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 449 QLETTVQKLEESKQLLKNNEK-LITWLNKELNENQLVRKQDTLGTSATPHSTSNSTIRSGLSPNLNVVDRLNYPSCGIGY 527
Cdd:COG3883 172 ELEAQQAEQEALLAQLSAEEAaAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGA 251
|
250 260
....*....|....*....|....*...
gi 170650645 528 PVSSALTFQNAFPHVVAAKNTSHPISGP 555
Cdd:COG3883 252 AGAAGAAAGSAGAAGAAAGAAGAGAAAA 279
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
249-478 |
1.41e-08 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 57.08 E-value: 1.41e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 249 QLQSRLSELEAANKELTERKYKgdstVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLE 328
Cdd:COG4942 21 AAAEAEAELEQLQQEIAELEKE----LAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELR 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 329 QEIKDKDQLvlrtkeafdtIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVT 408
Cdd:COG4942 97 AELEAQKEE----------LAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALR 166
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 409 IQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKEL 478
Cdd:COG4942 167 AELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEA 236
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
157-490 |
4.75e-08 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 56.18 E-value: 4.75e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 157 LKCSKEEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLTNkhsqeLTAEKEKALQTQVQCQQQHEQQK 236
Cdd:TIGR04523 206 LKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQ-----LKDEQNKIKKQLSEKQKELEQNN 280
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 237 KELETLhQRNIHQLQSRLSEL-----EAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECH 311
Cdd:TIGR04523 281 KKIKEL-EKQLNQLKSEISDLnnqkeQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENS 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 312 EKEKHINQLQTKVAVLEQEIKDKDQ----LVLRTKEAFDTIQEQKVA---LEENGEKNQIQLGKLEATIKSLSAELLKAN 384
Cdd:TIGR04523 360 EKQRELEEKQNEIEKLKKENQSYKQeiknLESQINDLESKIQNQEKLnqqKDEQIKKLQQEKELLEKEIERLKETIIKNN 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 385 EIIKKLQGDLKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQ---LETTVQKLEESK 461
Cdd:TIGR04523 440 SEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEkkeLEEKVKDLTKKI 519
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 170650645 462 QLLKNN-----------EKLITWLNKELNENQLVRKQDTL 490
Cdd:TIGR04523 520 SSLKEKieklesekkekESKISDLEDELNKDDFELKKENL 559
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
250-478 |
6.90e-08 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.84 E-value: 6.90e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 250 LQSRLSELEAANKELTERKYkgdstvrELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLEQ 329
Cdd:TIGR02169 665 GILFSRSEPAELQRLRERLE-------GLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKE 737
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 330 EIKDK-------DQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEAT-----IKSLSAELLKANEIIKKLQGDLKTL 397
Cdd:TIGR02169 738 RLEELeedlsslEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARlshsrIPEIQAELSKLEEEVSRIEARLREI 817
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 398 MGKLKlkntvtiqqeKLLAEKeEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETtvqKLEESKQLLKNNEKLITWLNKE 477
Cdd:TIGR02169 818 EQKLN----------RLTLEK-EYLEKEIQELQEQRIDLKEQIKSIEKEIENLNG---KKEELEEELEELEAALRDLESR 883
|
.
gi 170650645 478 L 478
Cdd:TIGR02169 884 L 884
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
249-462 |
8.53e-08 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 55.41 E-value: 8.53e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 249 QLQSRLSELEAANKELTERkykgdstVRELKAKLAGVEEELQ--RAKQEVLSLRRENCTLDTEchekekhINQLQTKVAV 326
Cdd:COG3206 165 NLELRREEARKALEFLEEQ-------LPELRKELEEAEAALEefRQKNGLVDLSEEAKLLLQQ-------LSELESQLAE 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 327 LEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQI--QLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLK-- 402
Cdd:COG3206 231 ARAELAEAEARLAALRAQLGSGPDALPELLQSPVIQQLraQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQqe 310
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 170650645 403 -------LKNTVTI--QQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQ 462
Cdd:COG3206 311 aqrilasLEAELEAlqAREASLQAQLAQLEARLAELPELEAELRRLEREVEVARELYESLLQRLEEARL 379
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
160-488 |
1.07e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 55.07 E-value: 1.07e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 160 SKEEKLS-LTRSLDDVTRQL------HITQETLSEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKalqtqvqcqqqh 232
Cdd:PRK03918 335 EKEERLEeLKKKLKELEKRLeeleerHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEE------------ 402
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 233 eqQKKELETLHQRnIHQLQSRLSELEAANKELTERKYKGDSTVREL------------KAKLAGVEEELQRAKQEVLSLR 300
Cdd:PRK03918 403 --IEEEISKITAR-IGELKKEIKELKKAIEELKKAKGKCPVCGRELteehrkelleeyTAELKRIEKELKEIEEKERKLR 479
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 301 RENCTLDTECHEKEKHINQLQTkvavLEQEIKDKDQLvlrtkEAFDtIQEQKVALEENgEKNQIQLGKLEATIKSLSAEL 380
Cdd:PRK03918 480 KELRELEKVLKKESELIKLKEL----AEQLKELEEKL-----KKYN-LEELEKKAEEY-EKLKEKLIKLKGEIKSLKKEL 548
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 381 LKANEIIKKL----------QGDLKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQL 450
Cdd:PRK03918 549 EKLEELKKKLaelekkldelEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEEL 628
|
330 340 350
....*....|....*....|....*....|....*...
gi 170650645 451 ETTVQKLEESKQLLKNNEKLITWLNKELNENQLVRKQD 488
Cdd:PRK03918 629 DKAFEELAETEKRLEELRKELEELEKKYSEEEYEELRE 666
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
243-443 |
2.07e-07 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 52.23 E-value: 2.07e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 243 HQRNIHQLQ---SRLSELEAANKELTERkykgdstVRELKAKLAGVEEELQRAKQEVLSLRREnctldtechekekhinq 319
Cdd:COG1579 5 DLRALLDLQeldSELDRLEHRLKELPAE-------LAELEDELAALEARLEAAKTELEDLEKE----------------- 60
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 320 lqtkVAVLEQEIKDKDQLVLRTKEAFDTIQEQK--VALeengeknQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTL 397
Cdd:COG1579 61 ----IKRLELEIEEVEARIKKYEEQLGNVRNNKeyEAL-------QKEIESLKRRISDLEDEILELMERIEELEEELAEL 129
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 170650645 398 MGKLKlkntvtiQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEV 443
Cdd:COG1579 130 EAELA-------ELEAELEEKKAELDEELAELEAELEELEAEREEL 168
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
260-419 |
2.93e-07 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 53.63 E-value: 2.93e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 260 ANKELTERKYKGDSTVRELKAKL-AGVEEELQRAKQEVLSLRREnctLDTECHEKEKHINQLQTKVAVLEQEIKDKDQLV 338
Cdd:PRK12704 29 AEAKIKEAEEEAKRILEEAKKEAeAIKKEALLEAKEEIHKLRNE---FEKELRERRNELQKLEKRLLQKEENLDRKLELL 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 339 LRTKEAFDTIQEQKVALEEngeknqiQLGKLEATIKSLSAELLKANEIIKKLQGD------LKTLMGKLKLKNTVTIQQE 412
Cdd:PRK12704 106 EKREEELEKKEKELEQKQQ-------ELEKKEEELEELIEEQLQELERISGLTAEeakeilLEKVEEEARHEAAVLIKEI 178
|
....*..
gi 170650645 413 KLLAEKE 419
Cdd:PRK12704 179 EEEAKEE 185
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
250-470 |
3.33e-07 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 53.36 E-value: 3.33e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 250 LQSRLSELEAANKELterkykgdstVRELKAKLAGVEEELQRAKqevlslrRENCTLDTECHEKEKHINQLQTKVAVLEQ 329
Cdd:pfam07888 32 LQNRLEECLQERAEL----------LQAQEAANRQREKEKERYK-------RDREQWERQRRELESRVAELKEELRQSRE 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 330 EIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVTI 409
Cdd:pfam07888 95 KHEELEEKYKELSASSEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERK 174
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 170650645 410 QQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKL 470
Cdd:pfam07888 175 QLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEAL 235
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
160-471 |
6.57e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 52.76 E-value: 6.57e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 160 SKEEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEwashtasLTNKHSQELTAEKEKAlqtqvqcqqQHEQQKKEL 239
Cdd:PRK03918 201 ELEEVLREINEISSELPELREELEKLEKEVKELEELKEE-------IEELEKELESLEGSKR---------KLEEKIREL 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 240 EtlhqRNIHQLQSRLSELEAANKELTERKYKGDSTVR------ELKAKLAGVEEELQRAKQEVLSLRRENctldTECHEK 313
Cdd:PRK03918 265 E----ERIEELKKEIEELEEKVKELKELKEKAEEYIKlsefyeEYLDELREIEKRLSRLEEEINGIEERI----KELEEK 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 314 EKHINQLQTKvavlEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQI-QLGKLEATIKSLSAELLKANEIIKKLQG 392
Cdd:PRK03918 337 EERLEELKKK----LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGlTPEKLEKELEELEKAKEEIEEEISKITA 412
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 170650645 393 DLKTLMGKLKLKNTVTIQQEKllAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLI 471
Cdd:PRK03918 413 RIGELKKEIKELKKAIEELKK--AKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVL 489
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
249-492 |
1.46e-06 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 51.06 E-value: 1.46e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 249 QLQSRLSELEAANKELterkykgdstvRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLE 328
Cdd:COG4372 32 QLRKALFELDKLQEEL-----------EQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQ 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 329 QEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVT 408
Cdd:COG4372 101 EELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAE 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 409 IQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNENQLVRKQD 488
Cdd:COG4372 181 AEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEI 260
|
....
gi 170650645 489 TLGT 492
Cdd:COG4372 261 EELE 264
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
153-394 |
1.52e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 51.61 E-value: 1.52e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 153 LAGCLKCSKEEKLSLTRSLDDVTRQLHITQETLSEK---MQELDKLRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQ 229
Cdd:TIGR02169 228 LLKEKEALERQKEAIERQLASLEEELEKLTEEISELekrLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLE 307
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 230 QQHEQQKKELETLHQR------NIHQLQSRLSELEAANKELTERK------YKGDSTVRE-LKAKLAGVEEELQRAKQEV 296
Cdd:TIGR02169 308 RSIAEKERELEDAEERlakleaEIDKLLAEIEELEREIEEERKRRdklteeYAELKEELEdLRAELEEVDKEFAETRDEL 387
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 297 LSLRREnctLDTECHEKEKHIN----------QLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQL 366
Cdd:TIGR02169 388 KDYREK---LEKLKREINELKReldrlqeelqRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADL 464
|
250 260
....*....|....*....|....*...
gi 170650645 367 GKLEATIKSLSAELLKANEIIKKLQGDL 394
Cdd:TIGR02169 465 SKYEQELYDLKEEYDRVEKELSKLQREL 492
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
170-480 |
2.43e-06 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 50.81 E-value: 2.43e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 170 SLDDVTRQLHITQETlsEKMQELDKLRSEWASHTASLTNKHSQELTAEKekALQTQVQCQQQHEQQKKELETLHQrNIHQ 249
Cdd:PRK02224 188 SLDQLKAQIEEKEEK--DLHERLNGLESELAELDEEIERYEEQREQARE--TRDEADEVLEEHEERREELETLEA-EIED 262
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 250 LQSRLSELEAANKELTERKYKGDSTVRELKAKLAGV--EEELQRAKQEVLSLRRE--------------NCTLDTECHEK 313
Cdd:PRK02224 263 LRETIAETEREREELAEEVRDLRERLEELEEERDDLlaEAGLDDADAEAVEARREeledrdeelrdrleECRVAAQAHNE 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 314 E-----KHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQ-------IQLGKLEATIKSLSAELL 381
Cdd:PRK02224 343 EaeslrEDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRerfgdapVDLGNAEDFLEELREERD 422
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 382 KANEIIKKLQGDLKTLMGKLKLKNTV--------------------TI-----QQEKLLAEKEEM--LQKERKESQDAGQ 434
Cdd:PRK02224 423 ELREREAELEATLRTARERVEEAEALleagkcpecgqpvegsphveTIeedreRVEELEAELEDLeeEVEEVEERLERAE 502
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 170650645 435 FLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNE 480
Cdd:PRK02224 503 DLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAE 548
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
217-385 |
3.12e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 50.54 E-value: 3.12e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 217 EKEKALQTQVQCQQQHEQQKKELETLHQRnIHQLQSRLSELEAANKELT--ERKYKGDSTVRELKAKLAGVEEELQRAKQ 294
Cdd:COG4717 75 ELEEELKEAEEKEEEYAELQEELEELEEE-LEELEAELEELREELEKLEklLQLLPLYQELEALEAELAELPERLEELEE 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 295 EVLSLRRenctLDTECHEKEKHINQLQTKVAVLEQEIK-DKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATI 373
Cdd:COG4717 154 RLEELRE----LEEELEELEAELAELQEELEELLEQLSlATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEEL 229
|
170
....*....|..
gi 170650645 374 KSLSAELLKANE 385
Cdd:COG4717 230 EQLENELEAAAL 241
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
190-452 |
3.77e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 50.30 E-value: 3.77e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 190 QELDKLRSEWASHTASLtnkhsQELTAEKEKALQTQVQCQQQHEQQKKELETL-HQRNIHQLQSRLSELEAANKELterk 268
Cdd:COG4913 617 AELAELEEELAEAEERL-----EALEAELDALQERREALQRLAEYSWDEIDVAsAEREIAELEAELERLDASSDDL---- 687
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 269 ykgdstvRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLEQEIkdKDQLVLRTKEAFDTI 348
Cdd:COG4913 688 -------AALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLA--RLELRALLEERFAAA 758
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 349 QEQKV------ALEENGEKNQIQLGKLEATI---------------KSLSAELLKANEIIKKLQgdlktlmgklklkntv 407
Cdd:COG4913 759 LGDAVerelreNLEERIDALRARLNRAEEELeramrafnrewpaetADLDADLESLPEYLALLD---------------- 822
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 170650645 408 TIQQEKLLAEKEEMLQ-KERKESQDAGQFLRAKEQEVCRLQEQLET 452
Cdd:COG4913 823 RLEEDGLPEYEERFKElLNENSIEFVADLLSKLRRAIREIKERIDP 868
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
237-476 |
8.04e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 49.40 E-value: 8.04e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 237 KELETLHQRNIHQLQSRLSELEAANKELTERKYKGD---STVRELKAKLAGVEEELQRAKQEvlslrrenctLDTECHEK 313
Cdd:pfam01576 18 KERQQKAESELKELEKKHQQLCEEKNALQEQLQAETelcAEAEEMRARLAARKQELEEILHE----------LESRLEEE 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 314 EKHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGD 393
Cdd:pfam01576 88 EERSQQLQNEKKKMQQHIQDLEEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLEDQNSKLSKERKLLEERISEFTSN 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 394 LKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESK-QLLKNNEKLIT 472
Cdd:pfam01576 168 LAEEEEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIAELQAQIAELRaQLAKKEEELQA 247
|
....
gi 170650645 473 WLNK 476
Cdd:pfam01576 248 ALAR 251
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
236-461 |
8.13e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 49.29 E-value: 8.13e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 236 KKELETLHQRNiHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQR----------AKQEVLSLRRENCT 305
Cdd:PRK03918 178 IERLEKFIKRT-ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKEleelkeeieeLEKELESLEGSKRK 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 306 LDTECHEKEKHINQLQTKVAVLEQ------EIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAE 379
Cdd:PRK03918 257 LEEKIRELEERIEELKKEIEELEEkvkelkELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 380 LLKANEIIKKLQG----------------DLKTLMGKL-KLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQE 442
Cdd:PRK03918 337 EERLEELKKKLKElekrleeleerhelyeEAKAKKEELeRLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGE 416
|
250
....*....|....*....
gi 170650645 443 VCRLQEQLETTVQKLEESK 461
Cdd:PRK03918 417 LKKEIKELKKAIEELKKAK 435
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
317-487 |
8.21e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.61 E-value: 8.21e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 317 INQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKT 396
Cdd:COG4942 22 AAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEA 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 397 LMGKLKlKNTVTIQQ------EKLLAEKEEMLQKER---------KESQDAGQFLRAKEQEVCRLQEQLETTVQKLEesk 461
Cdd:COG4942 102 QKEELA-ELLRALYRlgrqppLALLLSPEDFLDAVRrlqylkylaPARREQAEELRADLAELAALRAELEAERAELE--- 177
|
170 180
....*....|....*....|....*.
gi 170650645 462 QLLKNNEKLITWLNKELNENQLVRKQ 487
Cdd:COG4942 178 ALLAELEEERAALEALKAERQKLLAR 203
|
|
| HD_XRCC4-like_N |
cd22210 |
N-terminal head domain found in the XRCC4 superfamily of proteins; The XRCC4 superfamily ... |
41-144 |
1.08e-05 |
|
N-terminal head domain found in the XRCC4 superfamily of proteins; The XRCC4 superfamily includes five families: XRCC4, XLF, PAXX, SAS6 and CCDC61. XRCC4 (X-ray repair cross-complementing protein 4), XLF (XRCC4-like factor) and PAXX (paralog of XRCC4 and XLF) play crucial roles in the non-homologous end-joining (NHEJ) DNA repair pathway. SAS6 (spindle assembly abnormal protein 6) and CCDC61 (coiled-coil domain-containing protein 61) have a centrosomal/centriolar function. Members of this superfamily have an N-terminal globular head domain, a centrally located coiled-coil, and a C-terminal low-complexity region. They form homodimers through two homodimerization domains: an N-terminal globular head domain and a parallel coiled-coil domain. In addition, some members such as XRCC4 and XLF form symmetric heterodimers that interact through their globular head domains at the opposite end of the homodimer interface, and may form XLF-XRCC4 filaments. This model corresponds to the N-terminal head domain of XRCC4 superfamily proteins.
Pssm-ID: 408999 Cd Length: 115 Bit Score: 44.84 E-value: 1.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 41 RKDLVIRLTDDtdpFFLYNLVISEEDFQSLKLQQGLLVDFLAFPQKFIDLLQQCmqehAKETPRFLLQLlssaTLLENSP 120
Cdd:cd22210 25 ERGLRLHVSDD---AFLWTGEVSESDISQLKNDQGILVDFASFPGKLRSALEKC----ILASDRFTFVL----TIRGDEA 93
|
90 100
....*....|....*....|....
gi 170650645 121 VLlnVVETNPFKHLIHLSLKLLPG 144
Cdd:cd22210 94 YL--KLVEILDEQLPHITFALRKV 115
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
166-401 |
1.40e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 47.84 E-value: 1.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 166 SLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLtnkhsqeltAEKEKALQTQVQCQQQHEQQKKELETLHQR 245
Cdd:COG4942 59 ALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL---------EAQKEELAELLRALYRLGRQPPLALLLSPE 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 246 NIHQLQSRLSELEAANKELTERkykgdstVRELKAKLagveEELQRAKQEvlslrrenctLDTECHEKEKHINQLQTKVA 325
Cdd:COG4942 130 DFLDAVRRLQYLKYLAPARREQ-------AEELRADL----AELAALRAE----------LEAERAELEALLAELEEERA 188
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 170650645 326 VLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKnqiqlgkLEATIKSLSAELLKANEiiKKLQGDLKTLMGKL 401
Cdd:COG4942 189 ALEALKAERQKLLARLEKELAELAAELAELQQEAEE-------LEALIARLEAEAAAAAE--RTPAAGFAALKGKL 255
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
236-485 |
1.84e-05 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 48.00 E-value: 1.84e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 236 KKELETLHQRNIHQLQSRLSEL-----EAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLslrrenctldtec 310
Cdd:PRK05771 37 KEELSNERLRKLRSLLTKLSEAldklrSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIK------------- 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 311 hEKEKHINQLQTKVAVLEQEIKDKDQLvlrtkEAFDTiqeqKVALEENGEKNQIQLGKLEATIKS-LSAELLKANEIIKK 389
Cdd:PRK05771 104 -ELEEEISELENEIKELEQEIERLEPW-----GNFDL----DLSLLLGFKYVSVFVGTVPEDKLEeLKLESDVENVEYIS 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 390 lqgdlktlmgKLKLKNTVTIQQEKLLAEK-EEMLQK---ERKESQDAG---QFLRAKEQEVCRLQEQLETTVQKLEESKQ 462
Cdd:PRK05771 174 ----------TDKGYVYVVVVVLKELSDEvEEELKKlgfERLELEEEGtpsELIREIKEELEEIEKERESLLEELKELAK 243
|
250 260
....*....|....*....|...
gi 170650645 463 llKNNEKLITWlnKELNENQLVR 485
Cdd:PRK05771 244 --KYLEELLAL--YEYLEIELER 262
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
211-482 |
1.87e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 48.04 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 211 SQELTAEKEKALQTQVQCQQQHEQQKKELETLHQRNIHQLQSRLSELEAANKELTERKYKGDSTVRELKaklagVEEELQ 290
Cdd:pfam02463 165 SRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLK-----LNEERI 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 291 RAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLE-QEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKL 369
Cdd:pfam02463 240 DLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEeKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 370 EATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVTIQ----QEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCR 445
Cdd:pfam02463 320 EKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEEleklQEKLEQLEEELLAKKKLESERLSSAAKLKEEELEL 399
|
250 260 270
....*....|....*....|....*....|....*..
gi 170650645 446 LQEQLETTVQKLEESKQLLKNNEKLITWLNKELNENQ 482
Cdd:pfam02463 400 KSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEE 436
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
186-484 |
2.11e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 47.83 E-value: 2.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 186 SEKMQELDKlRSEWASHTASLTNKHSQELTAE--KEKA--LQTQVQCQQQHEQQKKELETLHQRNIHQLQSRLSELEAAN 261
Cdd:PTZ00121 1456 AKKAEEAKK-KAEEAKKADEAKKKAEEAKKADeaKKKAeeAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAK 1534
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 262 KELTERKYKGDSTVREL-KAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLEQEIKDKDQLVLR 340
Cdd:PTZ00121 1535 KADEAKKAEEKKKADELkKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKK 1614
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 341 TKEAfdTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVTIQQEKLLAEKEE 420
Cdd:PTZ00121 1615 AEEA--KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 170650645 421 MLQKERKESQDAGQfLRAKEQEVCRLQEQL----ETTVQKLEESKQLLKNNEKLITWLNKELNENQLV 484
Cdd:PTZ00121 1693 ALKKEAEEAKKAEE-LKKKEAEEKKKAEELkkaeEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
215-487 |
2.81e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.37 E-value: 2.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 215 TAEKEKALQTQVQCQQQHEQQKKELETLHQRnIHQLQSRLSELEAANKELTERKYKGDSTVRE----LKAKLAGVEEELQ 290
Cdd:PRK03918 471 IEEKERKLRKELRELEKVLKKESELIKLKEL-AEQLKELEEKLKKYNLEELEKKAEEYEKLKEklikLKGEIKSLKKELE 549
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 291 RAKQevlsLRRENCTLDTECHEKEKHINQLQTKVAVLEQE-IKDKDQLVLRTKEAFDTIQEQKVALEEngeknqiqLGKL 369
Cdd:PRK03918 550 KLEE----LKKKLAELEKKLDELEEELAELLKELEELGFEsVEELEERLKELEPFYNEYLELKDAEKE--------LERE 617
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 370 EATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNtVTIQQEKLLAEKEEMLQKERKESQdagqfLRAKEQEVCRLQEQ 449
Cdd:PRK03918 618 EKELKKLEEELDKAFEELAETEKRLEELRKELEELE-KKYSEEEYEELREEYLELSRELAG-----LRAELEELEKRREE 691
|
250 260 270
....*....|....*....|....*....|....*...
gi 170650645 450 LETTVQKLEESKQLLKNNEKLITWLNKELNENQLVRKQ 487
Cdd:PRK03918 692 IKKTLEKLKEELEEREKAKKELEKLEKALERVEELREK 729
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
183-482 |
4.55e-05 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 46.65 E-value: 4.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 183 ETL-SEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQQQHEQQK------KELETLHQRNIHQLQSRLS 255
Cdd:pfam15921 248 EALkSESQNKIELLLQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEiiqeqaRNQNSMYMRQLSDLESTVS 327
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 256 ELEAankELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDtechekekhiNQLQTKVAVLEQeikdkd 335
Cdd:pfam15921 328 QLRS---ELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLD----------DQLQKLLADLHK------ 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 336 qlvlRTKEaFDTIQEQKVALEENGEKNQIqlgkleaTIKSLSAELLKANEIIKKLQGDLKTlmgklkLKNTVTIQQEKLL 415
Cdd:pfam15921 389 ----REKE-LSLEKEQNKRLWDRDTGNSI-------TIDHLRRELDDRNMEVQRLEALLKA------MKSECQGQMERQM 450
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 170650645 416 AEKEEMLQKERKESQDAGQFLRAKeqevcrlqEQLETTVQKLEESKQLLKNNEKLITWLNKELNENQ 482
Cdd:pfam15921 451 AAIQGKNESLEKVSSLTAQLESTK--------EMLRKVVEELTAKKMTLESSERTVSDLTASLQEKE 509
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
162-469 |
5.93e-05 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 46.32 E-value: 5.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 162 EEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLR----SEWASHTASLTN---KHSQELTAEKEKaLQTQVQCQQQHEQ 234
Cdd:pfam01576 299 EELEALKTELEDTLDTTAAQQELRSKREQEVTELKkaleEETRSHEAQLQEmrqKHTQALEELTEQ-LEQAKRNKANLEK 377
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 235 QKKELETLHQrnihQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKE 314
Cdd:pfam01576 378 AKQALESENA----ELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAE 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 315 KHINQLQTKVAVLEQEIKDKDQLV---LRTKEAFDT----IQEQKVALEENGEKNQIQLGKLEATIKSLSAELLkanEII 387
Cdd:pfam01576 454 GKNIKLSKDVSSLESQLQDTQELLqeeTRQKLNLSTrlrqLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLS---DMK 530
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 388 KKLQGDLKTLMGKLKLKntvtiqqEKLLAEKEEMLQKERKESQDAGQFLRAKEqevcRLQEQLETTVQKLEESKQLLKNN 467
Cdd:pfam01576 531 KKLEEDAGTLEALEEGK-------KRLQRELEALTQQLEEKAAAYDKLEKTKN----RLQQELDDLLVDLDHQRQLVSNL 599
|
..
gi 170650645 468 EK 469
Cdd:pfam01576 600 EK 601
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
236-426 |
6.47e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.91 E-value: 6.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 236 KKELETLHQRNIHQLQSRLSELEAANKELTERKykgdSTVRELKAKLAGVEEELQRAKQEVLSLRRE---------NCTL 306
Cdd:COG4717 55 ADELFKPQGRKPELNLKELKELEEELKEAEEKE----EEYAELQEELEELEEELEELEAELEELREEleklekllqLLPL 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 307 DTECHEKEKHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQI----QLGKLEATIKSLSAELLK 382
Cdd:COG4717 131 YQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLateeELQDLAEELEELQQRLAE 210
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 170650645 383 ANEIIKKLQGDLKTLMGKLKlkntvTIQQEKLLAEKEEMLQKER 426
Cdd:COG4717 211 LEEELEEAQEELEELEEELE-----QLENELEAAALEERLKEAR 249
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
182-433 |
7.51e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 45.53 E-value: 7.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 182 QETLSEKMQELDKLRSEwashtASLTNKHSQELTAEKEKALqtqvqcqqqheqqkKELETLhQRNIHQLQSRLSELEAAN 261
Cdd:COG4942 19 ADAAAEAEAELEQLQQE-----IAELEKELAALKKEEKALL--------------KQLAAL-ERRIAALARRIRALEQEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 262 KELTERkykgdstVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKE----KHINQLQTKVAVLEQEIKDKDQL 337
Cdd:COG4942 79 AALEAE-------LAELEKEIAELRAELEAQKEELAELLRALYRLGRQPPLALllspEDFLDAVRRLQYLKYLAPARREQ 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 338 VLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVTIQQEKLLAE 417
Cdd:COG4942 152 AEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231
|
250
....*....|....*.
gi 170650645 418 KEEMLQKERKESQDAG 433
Cdd:COG4942 232 LEAEAAAAAERTPAAG 247
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
204-462 |
7.70e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 45.53 E-value: 7.70e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 204 ASLTNKHSQELTAEKEKALQTQVQCQQQHEQQKKELetlhQRNIHQLQSRLSELEAANKELTERkykgdstVRELKAKLA 283
Cdd:COG4942 11 LALAAAAQADAAAEAEAELEQLQQEIAELEKELAAL----KKEEKALLKQLAALERRIAALARR-------IRALEQELA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 284 GVEEELQRAKQEVLSLRREnctLDTECHEKEKHINQLQTkvavleQEIKDKDQLVLRTKEAFDTIQEQKV------ALEE 357
Cdd:COG4942 80 ALEAELAELEKEIAELRAE---LEAQKEELAELLRALYR------LGRQPPLALLLSPEDFLDAVRRLQYlkylapARRE 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 358 NGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLmgklklkntvtiqqEKLLAEKEEMLQKERKESQDAGQFLR 437
Cdd:COG4942 151 QAEELRADLAELAALRAELEAERAELEALLAELEEERAAL--------------EALKAERQKLLARLEKELAELAAELA 216
|
250 260
....*....|....*....|....*
gi 170650645 438 AKEQEVCRLQEQLETTVQKLEESKQ 462
Cdd:COG4942 217 ELQQEAEELEALIARLEAEAAAAAE 241
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
148-478 |
9.40e-05 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 45.78 E-value: 9.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 148 EIKKFLAGCLKCSKEEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKAlqtqvq 227
Cdd:TIGR04523 307 DWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQ------ 380
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 228 cqqqheqQKKEletlhqrNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLD 307
Cdd:TIGR04523 381 -------SYKQ-------EIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLT 446
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 308 TECHEKEKHINQLQTKVAVLEQEIKD---------------KDQLVLRTKEaFDTIQEQKVALEENGEKNQIQLGKLEAT 372
Cdd:TIGR04523 447 NQDSVKELIIKNLDNTRESLETQLKVlsrsinkikqnleqkQKELKSKEKE-LKKLNEEKKELEEKVKDLTKKISSLKEK 525
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 373 IKSLSAELLKANEIIKKLQGDLKTL---MGKLKLKNTVTIQQEKLL-------------AEKEEMLQKERKESQDAGQFL 436
Cdd:TIGR04523 526 IEKLESEKKEKESKISDLEDELNKDdfeLKKENLEKEIDEKNKEIEelkqtqkslkkkqEEKQELIDQKEKEKKDLIKEI 605
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 170650645 437 RAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKEL 478
Cdd:TIGR04523 606 EEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEV 647
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
161-356 |
1.05e-04 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 45.70 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 161 KEEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKAlqtqvQCQQQHEQQKKELE 240
Cdd:COG1196 336 EEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAA-----ELAAQLEELEEAEE 410
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 241 TLHQRnIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQL 320
Cdd:COG1196 411 ALLER-LERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEA 489
|
170 180 190
....*....|....*....|....*....|....*.
gi 170650645 321 QTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALE 356
Cdd:COG1196 490 AARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGA 525
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
157-458 |
1.09e-04 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 45.49 E-value: 1.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 157 LKCSKEEKLSLTRSLDDVTRQLHITQETLSE-------KMQELDKLRSEwashtasltNKHSQELTAEKEKALQTQVQCQ 229
Cdd:pfam15921 491 LESSERTVSDLTASLQEKERAIEATNAEITKlrsrvdlKLQELQHLKNE---------GDHLRNVQTECEALKLQMAEKD 561
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 230 QQHEQQKKELETL------HQRNIHQLQSRLSELEaanKELTER----------KYKGDSTVRELKAKLAGVEEE---LQ 290
Cdd:pfam15921 562 KVIEILRQQIENMtqlvgqHGRTAGAMQVEKAQLE---KEINDRrlelqefkilKDKKDAKIRELEARVSDLELEkvkLV 638
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 291 RAKQE----VLSLRRENCTLDTECHEKEKHINQLQTKVAVLEQEIKDK--------DQLVLRTKEAFDTIQEQKVALEEN 358
Cdd:pfam15921 639 NAGSErlraVKDIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKseemetttNKLKMQLKSAQSELEQTRNTLKSM 718
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 359 ------------GEKNQI-----QLGKLEATIKSLSAELLKAN-------EIIKKLQGDLKTLM---GKLKLKNTVTIQQ 411
Cdd:pfam15921 719 egsdghamkvamGMQKQItakrgQIDALQSKIQFLEEAMTNANkekhflkEEKNKLSQELSTVAtekNKMAGELEVLRSQ 798
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 170650645 412 EKLLAEK----EEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLE 458
Cdd:pfam15921 799 ERRLKEKvanmEVALDKASLQFAECQDIIQRQEQESVRLKLQHTLDVKELQ 849
|
|
| DUF724 |
pfam05266 |
Protein of unknown function (DUF724); This family contains several uncharacterized proteins ... |
238-333 |
1.63e-04 |
|
Protein of unknown function (DUF724); This family contains several uncharacterized proteins found in Arabidopsis thaliana and other plants. This region is often found associated with Agenet domains and may contain coiled-coil.
Pssm-ID: 428400 [Multi-domain] Cd Length: 188 Bit Score: 43.03 E-value: 1.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 238 ELETlHQRNIHQLQSRLSEL----EAANKELTERKyKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEK 313
Cdd:pfam05266 79 ELEK-HGFDVKAPQSRINKLlslkDRQTKLLEELK-KLEKKIAEEESEKRKLEEEIDELEKKILELERQLALAKEKKEAA 156
|
90 100
....*....|....*....|
gi 170650645 314 EKHINQLQTKVAVLEQEIKD 333
Cdd:pfam05266 157 DKEIARLKSEAEKLEQEIQD 176
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
166-366 |
1.79e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 44.62 E-value: 1.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 166 SLTRSLDDVTRQLHITQETLSEKMQELDKLRSE----WASHTASLTNKHSQELTAEKEKALQTQVQCQQQHEQQKKELET 241
Cdd:COG3206 172 EARKALEFLEEQLPELRKELEEAEAALEEFRQKnglvDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGS 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 242 LHQ--------RNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVL-SLRRENCTLDTECHE 312
Cdd:COG3206 252 GPDalpellqsPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILaSLEAELEALQAREAS 331
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 170650645 313 KEKHINQLQTKVAVL---EQEIKDKDQLVLRTKEAFDTIQE--QKVALEENGEKNQIQL 366
Cdd:COG3206 332 LQAQLAQLEARLAELpelEAELRRLEREVEVARELYESLLQrlEEARLAEALTVGNVRV 390
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
251-459 |
3.33e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 44.14 E-value: 3.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 251 QSRLSELEAANKELTERkykgdstVRELKAKLAGVEEELQRAKQevlslRRENCTLDTECHEKEKHINQLQTKVAVLEQE 330
Cdd:COG4913 609 RAKLAALEAELAELEEE-------LAEAEERLEALEAELDALQE-----RREALQRLAEYSWDEIDVASAEREIAELEAE 676
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 331 IKDkdqlVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLmgklklkntVTIQ 410
Cdd:COG4913 677 LER----LDASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAA---------EDLA 743
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 170650645 411 QEKLLAEKEEMLQKERKEsQDAGQFLRAKEQEVCRLQEQLETTVQKLEE 459
Cdd:COG4913 744 RLELRALLEERFAAALGD-AVERELRENLEERIDALRARLNRAEEELER 791
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
168-366 |
3.41e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 43.90 E-value: 3.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 168 TRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLtNKHSQELTAEKEKALQTQVQCQQQHEQQKKELETLhQRNI 247
Cdd:TIGR02169 814 LREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSI-EKEIENLNGKKEELEEELEELEAALRDLESRLGDL-KKER 891
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 248 HQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELqrakQEVLSLRREnctlDTECHEKEKHINQLQTKVAVL 327
Cdd:TIGR02169 892 DELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEEL----SEIEDPKGE----DEEIPEEELSLEDVQAELQRV 963
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 170650645 328 EQEIKDKDQLVLR-------TKEAFDTIQEQKVALEEngEKNQIQL 366
Cdd:TIGR02169 964 EEEIRALEPVNMLaiqeyeeVLKRLDELKEKRAKLEE--ERKAILE 1007
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
160-451 |
3.80e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 43.35 E-value: 3.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 160 SKEEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASL--TNKHSQELTAEKEKALQTQVQCQQQHEQQKK 237
Cdd:COG4372 29 LSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELeqLEEELEELNEQLQAAQAELAQAQEELESLQE 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 238 ELETLHQRnIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKE--K 315
Cdd:COG4372 109 EAEELQEE-LEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQAldE 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 316 HINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLK 395
Cdd:COG4372 188 LLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAI 267
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*.
gi 170650645 396 TLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLE 451
Cdd:COG4372 268 LVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLE 323
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
272-492 |
3.94e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 43.35 E-value: 3.94e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 272 DSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQ 351
Cdd:COG4372 30 SEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEE 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 352 KVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLK-LKNTVTIQQEKLLAEKEEMLQKERKESQ 430
Cdd:COG4372 110 AEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLEsLQEELAALEQELQALSEAEAEQALDELL 189
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 170650645 431 DAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNENQLVRKQDTLGT 492
Cdd:COG4372 190 KEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEEL 251
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
254-477 |
4.87e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 43.22 E-value: 4.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 254 LSELEAANKELteRKYKGDSTVRELKaKLAGVEEELQRAKQEVLSLRRenctLDTECHEKEKHINQLQTKVAVLEQEIKD 333
Cdd:COG4717 48 LERLEKEADEL--FKPQGRKPELNLK-ELKELEEELKEAEEKEEEYAE----LQEELEELEEELEELEAELEELREELEK 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 334 KDQLvlrtKEAFDTIQEQKvALEENGEKNQIQLGKLEA---TIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVTIQ 410
Cdd:COG4717 121 LEKL----LQLLPLYQELE-ALEAELAELPERLEELEErleELRELEEELEELEAELAELQEELEELLEQLSLATEEELQ 195
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 170650645 411 QeklLAEKEEMLQKERKEsqdagqflrakeqevcrLQEQLETTVQKLEESKQLLKNNEKLITWLNKE 477
Cdd:COG4717 196 D---LAEELEELQQRLAE-----------------LEEELEEAQEELEELEEELEQLENELEAAALE 242
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
243-482 |
5.51e-04 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 43.19 E-value: 5.51e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 243 HQRNI--HQLQSRLSELEAAN-KELTERKYKGDSTVRELKAKLAGVEEELQR-----AKQEVLSLRRENCTLDTECHEKE 314
Cdd:pfam17380 280 HQKAVseRQQQEKFEKMEQERlRQEKEEKAREVERRRKLEEAEKARQAEMDRqaaiyAEQERMAMERERELERIRQEERK 359
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 315 KHINQL-QTKVAVLEQEIKDKDQLVL-------RTKEAFDTIQEQKVALEENGEKNQIQLGKLEatikSLSAELLKANEI 386
Cdd:pfam17380 360 RELERIrQEEIAMEISRMRELERLQMerqqkneRVRQELEAARKVKILEEERQRKIQQQKVEME----QIRAEQEEARQR 435
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 387 -IKKLQGDLKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQdagqflraKEQEVCRLQEQLETTV--QKLEESKQL 463
Cdd:pfam17380 436 eVRRLEEERAREMERVRLEEQERQQQVERLRQQEEERKRKKLELE--------KEKRDRKRAEEQRRKIleKELEERKQA 507
|
250
....*....|....*....
gi 170650645 464 LKNNEKLITWLNKELNENQ 482
Cdd:pfam17380 508 MIEEERKRKLLEKEMEERQ 526
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
348-480 |
6.24e-04 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 42.84 E-value: 6.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 348 IQEQKVALEENGEKNQIQLGKLEAtiKSLSAE-LLKANEIIKKLQGDLKTlmgKLKLKNTVTIQQEKLLAEKEEMLqKER 426
Cdd:PRK12704 28 IAEAKIKEAEEEAKRILEEAKKEA--EAIKKEaLLEAKEEIHKLRNEFEK---ELRERRNELQKLEKRLLQKEENL-DRK 101
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 170650645 427 KESqdagqfLRAKEQEvcrLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNE 480
Cdd:PRK12704 102 LEL------LEKREEE---LEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
183-488 |
6.77e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 43.11 E-value: 6.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 183 ETLSEKMQELDKLRSEWASHTASLTNKHS-QELTAEKEKALQTQVQCQQQHEQQKKELETlHQRNIHQLQSRLSELEAAN 261
Cdd:TIGR00606 795 ERFQMELKDVERKIAQQAAKLQGSDLDRTvQQVNQEKQEKQHELDTVVSKIELNRKLIQD-QQEQIQHLKSKTNELKSEK 873
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 262 KELTERKYKGDSTVRELKAKLAGVEE---ELQRAKQEVLSLRRencTLDTECHEKEKHINQLQTKVAVLEQEIKDK---- 334
Cdd:TIGR00606 874 LQIGTNLQRRQQFEEQLVELSTEVQSlirEIKDAKEQDSPLET---FLEKDQQEKEELISSKETSNKKAQDKVNDIkekv 950
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 335 DQLVLRTKEAFDTIQEQKvaleengeknQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNtvtiQQEKL 414
Cdd:TIGR00606 951 KNIHGYMKDIENKIQDGK----------DDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQK----IQERW 1016
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 170650645 415 LAEkEEMLQKERKESQDAGQFLRAKEQEVCRLQ-EQLETTVQKLEESKQLLKNNEKLITWLNKELNENQLVRKQD 488
Cdd:TIGR00606 1017 LQD-NLTLRKRENELKEVEEELKQHLKEMGQMQvLQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKE 1090
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
148-482 |
9.05e-04 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 42.65 E-value: 9.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 148 EIKKFLAGCLKCSKEEKLSLTRSLDDVTRQLHITQETLSEKMQEL-DKLRSEWASHTASLTNKHSQELTAEKEKALQTQV 226
Cdd:TIGR00618 187 AKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLrEALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLR 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 227 QCQQQHEQQKKELETLHQR-NIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCT 305
Cdd:TIGR00618 267 ARIEELRAQEAVLEETQERiNRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRL 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 306 LDTeCHEKEKHINQlQTKVAVLEQEIKDKdQLVLRtkEAFDTIQEQKVALEENgeknqiqlgkleatIKSLSAELlkanE 385
Cdd:TIGR00618 347 LQT-LHSQEIHIRD-AHEVATSIREISCQ-QHTLT--QHIHTLQQQKTTLTQK--------------LQSLCKEL----D 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 386 IIKKLQGDLKTLMGKLKlkntvTIQQEKLLAEKEEMLQKERKE-----SQDAGQFLRAKEQEVCRLQEQLETTVQKLEES 460
Cdd:TIGR00618 404 ILQREQATIDTRTSAFR-----DLQGQLAHAKKQQELQQRYAElcaaaITCTAQCEKLEKIHLQESAQSLKEREQQLQTK 478
|
330 340
....*....|....*....|..
gi 170650645 461 KQLLKNNEKLITWLNKELNENQ 482
Cdd:TIGR00618 479 EQIHLQETRKKAVVLARLLELQ 500
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
253-480 |
2.00e-03 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 41.59 E-value: 2.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 253 RLSELEAANKELTERKykgdstvRELKAKLAGVEEELQRaKQEVLSLRRENctlDTECHEKEKHINQLQTKVAVLEQEIK 332
Cdd:PRK03918 156 GLDDYENAYKNLGEVI-------KEIKRRIERLEKFIKR-TENIEELIKEK---EKELEEVLREINEISSELPELREELE 224
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 333 DKDQLVLR---TKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKklqgDLKTLMGKLKlkntvti 409
Cdd:PRK03918 225 KLEKEVKEleeLKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVK----ELKELKEKAE------- 293
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 170650645 410 QQEKLLAEKEEMLQKERKESQDAGQFlrakEQEVCRLQEQLEttvqKLEESKQLLKNNEKLITWLNKELNE 480
Cdd:PRK03918 294 EYIKLSEFYEEYLDELREIEKRLSRL----EEEINGIEERIK----ELEEKEERLEELKKKLKELEKRLEE 356
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
162-472 |
2.01e-03 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 41.26 E-value: 2.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 162 EEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQQQHEQQKKELE- 240
Cdd:pfam05557 48 DRNQELQKRIRLLEKREAEAEEALREQAELNRLKKKYLEALNKKLNEKESQLADAREVISCLKNELSELRRQIQRAELEl 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 241 --------------TLHQRNIHQLQSRLSELEAANKELTERKYKgdstVRELKAKLAGVE---EELQRAKQEVLSLRREN 303
Cdd:pfam05557 128 qstnseleelqerlDLLKAKASEAEQLRQNLEKQQSSLAEAEQR----IKELEFEIQSQEqdsEIVKNSKSELARIPELE 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 304 CTLDtECHEKEKHINQLQTKVAVLEQEIKDkdqlvLRTK-EAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLK 382
Cdd:pfam05557 204 KELE-RLREHNKHLNENIENKLLLKEEVED-----LKRKlEREEKYREEAATLELEKEKLEQELQSWVKLAQDTGLNLRS 277
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 383 ----ANEIIKKLQGDLkTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLE 458
Cdd:pfam05557 278 pedlSRRIEQLQQREI-VLKEENSSLTSSARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLTKERD 356
|
330
....*....|....
gi 170650645 459 ESKQLLKNNEKLIT 472
Cdd:pfam05557 357 GYRAILESYDKELT 370
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
182-464 |
2.21e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 41.35 E-value: 2.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 182 QETLSEKMQELDKLRSEWASHTASLTNKHS-----QELTAEKEKALqtqvqcqqqheqqkkelETLHQRNIHQLQSRLSE 256
Cdd:pfam10174 407 QEQLRDKDKQLAGLKERVKSLQTDSSNTDTalttlEEALSEKERII-----------------ERLKEQREREDRERLEE 469
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 257 LEAANKELterkykgdstvRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLqtkvavlEQEIKDKDQ 336
Cdd:pfam10174 470 LESLKKEN-----------KDLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSL-------EIAVEQKKE 531
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 337 LVLRTKEAFDTIQEQKVALEENGEKNQiqlgkleaTIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVTIQQEKLLA 416
Cdd:pfam10174 532 ECSKLENQLKKAHNAEEAVRTNPEIND--------RIRLLEQEVARYKEESGKAQAEVERLLGILREVENEKNDKDKKIA 603
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 170650645 417 EKEEMLQKERKESQDAGQFLRAKEQEV--------------------CRLQEQLETTVQKLEESKQLL 464
Cdd:pfam10174 604 ELESLTLRQMKEQNKKVANIKHGQQEMkkkgaqlleearrrednladNSQQLQLEELMGALEKTRQEL 671
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
236-487 |
2.21e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 41.16 E-value: 2.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 236 KKELETLHQRNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLR----------RENCT 305
Cdd:TIGR04523 136 NKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLElllsnlkkkiQKNKS 215
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 306 LDTECHEKEKHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAEL----- 380
Cdd:TIGR04523 216 LESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLnqlks 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 381 -------LKANEIIKKLQGDLKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETT 453
Cdd:TIGR04523 296 eisdlnnQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKL 375
|
250 260 270
....*....|....*....|....*....|....
gi 170650645 454 VQKLEESKQLLKNNEKLITWLNKELNENQLVRKQ 487
Cdd:TIGR04523 376 KKENQSYKQEIKNLESQINDLESKIQNQEKLNQQ 409
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
237-402 |
2.49e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 41.05 E-value: 2.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 237 KELETLHQRnIHQLQSRLSELEAANKELteRKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKH 316
Cdd:COG4913 255 EPIRELAER-YAAARERLAELEYLRAAL--RLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQ 331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 317 INQ--------LQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQ----LGKLEATIKSLSAELLKAN 384
Cdd:COG4913 332 IRGnggdrleqLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEaaalLEALEEELEALEEALAEAE 411
|
170
....*....|....*...
gi 170650645 385 EIIKKLQGDLKTLMGKLK 402
Cdd:COG4913 412 AALRDLRRELRELEAEIA 429
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
157-481 |
2.61e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 41.16 E-value: 2.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 157 LKCSKEEKLS--LTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLTNKHSQELTAEKEkalqtQVQCQQQHEQ 234
Cdd:TIGR04523 300 LNNQKEQDWNkeLKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRE-----LEEKQNEIEK 374
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 235 QKKELETLHQrNIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRREnctldtecheke 314
Cdd:TIGR04523 375 LKKENQSYKQ-EIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSE------------ 441
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 315 khINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDL 394
Cdd:TIGR04523 442 --IKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKI 519
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 395 KTLMGKLKLKNTVTIQQEKLLAEKEEMLQK--ERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLIT 472
Cdd:TIGR04523 520 SSLKEKIEKLESEKKEKESKISDLEDELNKddFELKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKK 599
|
....*....
gi 170650645 473 WLNKELNEN 481
Cdd:TIGR04523 600 DLIKEIEEK 608
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
219-361 |
2.62e-03 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 41.00 E-value: 2.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 219 EKALQTQVQCQQQHEQQKKELETLH-QRNIHQLQSRLSELEAANKELteRKYkgdstVRELKAKLAGVEEELQRAKQEVL 297
Cdd:COG2433 379 EEALEELIEKELPEEEPEAEREKEHeERELTEEEEEIRRLEEQVERL--EAE-----VEELEAELEEKDERIERLERELS 451
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 170650645 298 SLRREnctLDTEcHEKEKHINQLQTKVAVLEQEIKDKDQlvlRTKEAFDTIQEQK--VALEENGEK 361
Cdd:COG2433 452 EARSE---ERRE-IRKDREISRLDREIERLERELEEERE---RIEELKRKLERLKelWKLEHSGEL 510
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
176-422 |
2.63e-03 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 40.98 E-value: 2.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 176 RQLHITQETLSEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQQQHEQQKKELETL--HQRNIHQLQSR 253
Cdd:pfam12128 276 SRQEERQETSAELNQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALEDQHGAFLDADIETAaaDQEQLPSWQSE 355
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 254 LSELEAANKELT------ERKYKG--DSTVRELKAKLAGVEEELQRAKQE-VLSLRRENCTLDTECHEKEKHINQLQTKV 324
Cdd:pfam12128 356 LENLEERLKALTgkhqdvTAKYNRrrSKIKEQNNRDIAGIKDKLAKIREArDRQLAVAEDDLQALESELREQLEAGKLEF 435
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 325 AVLEQEIKDK-DQLVLRTKEAfdTIQEQkvaLEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKL 403
Cdd:pfam12128 436 NEEEYRLKSRlGELKLRLNQA--TATPE---LLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQ 510
|
250
....*....|....*....
gi 170650645 404 KNTVTIQQEKLLAEKEEML 422
Cdd:pfam12128 511 ASRRLEERQSALDELELQL 529
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
246-490 |
2.70e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 40.77 E-value: 2.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 246 NIHQLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTkva 325
Cdd:TIGR04523 69 KINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLT--- 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 326 vleqEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKAN---EIIKKLQGDLKTLMGKL- 401
Cdd:TIGR04523 146 ----EIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLElllSNLKKKIQKNKSLESQIs 221
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 402 KLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVcRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNEN 481
Cdd:TIGR04523 222 ELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQN-KIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDL 300
|
....*....
gi 170650645 482 QLVRKQDTL 490
Cdd:TIGR04523 301 NNQKEQDWN 309
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
283-469 |
2.76e-03 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 41.11 E-value: 2.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 283 AGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKN 362
Cdd:pfam02463 162 AAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDL 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 363 QIQLGKLE-ATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQ 441
Cdd:pfam02463 242 LQELLRDEqEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEK 321
|
170 180
....*....|....*....|....*...
gi 170650645 442 EVCRLQEQLETTVQKLEESKQLLKNNEK 469
Cdd:pfam02463 322 EKKKAEKELKKEKEEIEELEKELKELEI 349
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
317-489 |
2.83e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 41.05 E-value: 2.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 317 INQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKN---------QIQLGKLEATIKSL---SAELLKAN 384
Cdd:COG4913 612 LAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSwdeidvasaEREIAELEAELERLdasSDDLAALE 691
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 385 EIIKKLQGDLKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEvcRLQEQLETTVQKlEESKQLL 464
Cdd:COG4913 692 EQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRA--LLEERFAAALGD-AVERELR 768
|
170 180
....*....|....*....|....*..
gi 170650645 465 KNNEKLITWLNKELN--ENQLVRKQDT 489
Cdd:COG4913 769 ENLEERIDALRARLNraEEELERAMRA 795
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
169-476 |
2.88e-03 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 40.79 E-value: 2.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 169 RSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLT--NKHSQELTAEKEKALQTQVQCQQQHEQQKKELETLHQrN 246
Cdd:PRK02224 272 REREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEavEARREELEDRDEELRDRLEECRVAAQAHNEEAESLRE-D 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 247 IHQLQSRL-----------SELEAANKELTERKYKG---DSTVRELKAKLAGVEEELQRAkQEVLSLRRENctlDTECHE 312
Cdd:PRK02224 351 ADDLEERAeelreeaaeleSELEEAREAVEDRREEIeelEEEIEELRERFGDAPVDLGNA-EDFLEELREE---RDELRE 426
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 313 KEKhinQLQTKVAVLEQEIKDKDQLVLRTK-----------EAFDTIQEQKVALEENGEknqiQLGKLEATIKSLSAELL 381
Cdd:PRK02224 427 REA---ELEATLRTARERVEEAEALLEAGKcpecgqpvegsPHVETIEEDRERVEELEA----ELEDLEEEVEEVEERLE 499
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 382 KANEIiKKLQGDLKTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESK 461
Cdd:PRK02224 500 RAEDL-VEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELN 578
|
330
....*....|....*
gi 170650645 462 QLLKNNEKLITWLNK 476
Cdd:PRK02224 579 SKLAELKERIESLER 593
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
249-470 |
3.41e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 39.89 E-value: 3.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 249 QLQSRLSELEAANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDT---ECHEKEKHINQL----Q 321
Cdd:COG1340 47 ELNAQVKELREEAQELREKRDELNEKVKELKEERDELNEKLNELREELDELRKELAELNKaggSIDKLRKEIERLewrqQ 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 322 TKVAVLEQEIkdkdQLVLRTKEAFDTIQEQKVALEENGEKNQI---------QLGKLEATIKSLSAELLKANEIIKKLQG 392
Cdd:COG1340 127 TEVLSPEEEK----ELVEKIKELEKELEKAKKALEKNEKLKELraelkelrkEAEEIHKKIKELAEEAQELHEEMIELYK 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 393 DLKTLMGKL-KLKNTVTIQQEKLLAEKEEM--LQKERKESQDAGQFLRAKEQEVcRLQEQLETTVQKLEESKQLLKNNEK 469
Cdd:COG1340 203 EADELRKEAdELHKEIVEAQEKADELHEEIieLQKELRELRKELKKLRKKQRAL-KREKEKEELEEKAEEIFEKLKKGEK 281
|
.
gi 170650645 470 L 470
Cdd:COG1340 282 L 282
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
183-475 |
4.06e-03 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 40.41 E-value: 4.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 183 ETLSEKMQELDKLRSEWAS---HTASLTNKHsqeltaEKEKALQTQVQCQQQHEQQKKELETLhqrnIHQLQSRLSELEA 259
Cdd:PRK02224 468 ETIEEDRERVEELEAELEDleeEVEEVEERL------ERAEDLVEAEDRIERLEERREDLEEL----IAERRETIEEKRE 537
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 260 ANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTEchekekhINQLQTKVAVLEqEIKDKDQLVL 339
Cdd:PRK02224 538 RAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKER-------IESLERIRTLLA-AIADAEDEIE 609
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 340 RTKEAFDTIQEQ----KVALEENGEKNQIQLGKL-EATIKSLSAELLKANEIIKKLQGDLKTLMGKL-KLKNTVTIQQEK 413
Cdd:PRK02224 610 RLREKREALAELnderRERLAEKRERKRELEAEFdEARIEEAREDKERAEEYLEQVEEKLDELREERdDLQAEIGAVENE 689
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 170650645 414 LlaEKEEMLQKERKESQDAGQFLRAKEQEVcrlqEQLETTVQKLE-ESKQllKNNEKLITWLN 475
Cdd:PRK02224 690 L--EELEELRERREALENRVEALEALYDEA----EELESMYGDLRaELRQ--RNVETLERMLN 744
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
190-331 |
4.37e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 39.14 E-value: 4.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 190 QELDKLRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQQQHEQQK-----------KELETLhQRNIHQLQSRLSELE 258
Cdd:COG1579 31 AELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKyeeqlgnvrnnKEYEAL-QKEIESLKRRISDLE 109
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 170650645 259 AANKELTERKYKGDSTVRELKAKLAGVEEELQRAKQEvlslrrenctLDTECHEKEKHINQLQTKVAVLEQEI 331
Cdd:COG1579 110 DEILELMERIEELEEELAELEAELAELEAELEEKKAE----------LDEELAELEAELEELEAEREELAAKI 172
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
157-479 |
4.67e-03 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 40.09 E-value: 4.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 157 LKCSKEEKLSLTRSLDDVTRQLH---ITQETLSEKMQeldklrsewashtasLTNKHSQELTAEKEKALQTQVQCQQQHE 233
Cdd:pfam05483 284 LKELIEKKDHLTKELEDIKMSLQrsmSTQKALEEDLQ---------------IATKTICQLTEEKEAQMEELNKAKAAHS 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 234 QQKKELETLH--------------QRNIHQLQSRLSELEAANKELTE-RKYKGDSTVR--ELKAKLAGVEEELQRAKQ-- 294
Cdd:pfam05483 349 FVVTEFEATTcsleellrteqqrlEKNEDQLKIITMELQKKSSELEEmTKFKNNKEVEleELKKILAEDEKLLDEKKQfe 428
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 295 ---EVLSLRRENCTLDTECHEKEKHINQLQTKVAVLEQEIKDKDQLVLRTKeaFDTIQEQKVALEENGEKNQIQLGKLEA 371
Cdd:pfam05483 429 kiaEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTE--LEKEKLKNIELTAHCDKLLLENKELTQ 506
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 372 TIKSLSAELLKANEII-------KKLQGDLKTLMGK-LKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQF-------- 435
Cdd:pfam05483 507 EASDMTLELKKHQEDIinckkqeERMLKQIENLEEKeMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIeyevlkke 586
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 170650645 436 --LRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELN 479
Cdd:pfam05483 587 kqMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLN 632
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
161-471 |
5.01e-03 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 40.34 E-value: 5.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 161 KEEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQQQHEQQKKE-- 238
Cdd:pfam02463 222 EEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKse 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 239 ---LETLHQRNIHQLQSRLSELEAANKELTERKykgdstvrELKAKLAGVEEELQRAKQEVLslrrenctldtechEKEK 315
Cdd:pfam02463 302 llkLERRKVDDEEKLKESEKEKKKAEKELKKEK--------EEIEELEKELKELEIKREAEE--------------EEEE 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 316 HINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLK-ANEIIKKLQGDL 394
Cdd:pfam02463 360 ELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKeELEILEEEEESI 439
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 170650645 395 KTLMGKLKLKNTVTIQQEKLLAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLI 471
Cdd:pfam02463 440 ELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALI 516
|
|
| PLN03229 |
PLN03229 |
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional |
189-399 |
5.72e-03 |
|
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
Pssm-ID: 178768 [Multi-domain] Cd Length: 762 Bit Score: 39.84 E-value: 5.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 189 MQELDKLRSEWASHTASLTNKHSQELTAEKEKALQTQVQCQQQheqqKKELETLHQRNIHQLQSRLSELEAANKELTERK 268
Cdd:PLN03229 534 KYKLDMLNEFSRAKALSEKKSKAEKLKAEINKKFKEVMDRPEI----KEKMEALKAEVASSGASSGDELDDDLKEKVEKM 609
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 269 YKgdstvrELKAKLAGVeeeLQRAKQEVLSLRRENCTLDTECHEKEkhinqLQTKVAVLEQEIKDKDQLVLRTKEAFDTI 348
Cdd:PLN03229 610 KK------EIELELAGV---LKSMGLEVIGVTKKNKDTAEQTPPPN-----LQEKIESLNEEINKKIERVIRSSDLKSKI 675
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 349 QEQKVALEENG------EKNQIQlgKLEATIKSLSAELLKANEIIKK---LQGDLKTLMG 399
Cdd:PLN03229 676 ELLKLEVAKASktpdvtEKEKIE--ALEQQIKQKIAEALNSSELKEKfeeLEAELAAARE 733
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
160-469 |
6.06e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 39.72 E-value: 6.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 160 SKEEKLSLTRSLDDVTRQLHITQETLSEKMQELDKLRSEWASHTASLTnkhsqeltaEKEKALQTQVQCQQQHEQ----Q 235
Cdd:pfam15921 459 SLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQ---------EKERAIEATNAEITKLRSrvdlK 529
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 236 KKELETLHQRNIHqLQSRLSELEAANKELTERkykgDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHE--- 312
Cdd:pfam15921 530 LQELQHLKNEGDH-LRNVQTECEALKLQMAEK----DKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDrrl 604
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 313 -----------KEKHINQLQTKVAVLEQE--------------IKD----KDQL---VLRTKEAFDTIQEQKVALEEN-- 358
Cdd:pfam15921 605 elqefkilkdkKDAKIRELEARVSDLELEkvklvnagserlraVKDikqeRDQLlneVKTSRNELNSLSEDYEVLKRNfr 684
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 359 --GEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGD----LKTLMGKLKlKNTVTIQQEKLLAEKEEMLQKERKESQDA 432
Cdd:pfam15921 685 nkSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSdghaMKVAMGMQK-QITAKRGQIDALQSKIQFLEEAMTNANKE 763
|
330 340 350
....*....|....*....|....*....|....*..
gi 170650645 433 GQFLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEK 469
Cdd:pfam15921 764 KHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQER 800
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
147-480 |
6.13e-03 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 40.03 E-value: 6.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 147 VEIKKFLAGCLKCSKEEKLSLTRSLDDVTRQLHITQETLSE-------KMQELDKLRSEWASHTASLTNK-----HSQEL 214
Cdd:TIGR00606 496 TETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQmemltkdKMDKDEQIRKIKSRHSDELTSLlgyfpNKKQL 575
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 215 TAEKEKALQTQVQCQQQHEQQKKELETLHQrNIHQLQSRLSELEAANKELTERKYKGDSTvRELKAKLAGVEEELQRAKQ 294
Cdd:TIGR00606 576 EDWLHSKSKEINQTRDRLAKLNKELASLEQ-NKNHINNELESKEEQLSSYEDKLFDVCGS-QDEESDLERLKEEIEKSSK 653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 295 E--------------VLSLRREN---CTLDTECHEKEKHINQ----LQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKV 353
Cdd:TIGR00606 654 QramlagatavysqfITQLTDENqscCPVCQRVFQTEAELQEfisdLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAP 733
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 354 ALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKN------TVTIQQEKLLAEKEEMLQKERK 427
Cdd:TIGR00606 734 GRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKvcltdvTIMERFQMELKDVERKIAQQAA 813
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 170650645 428 ESQDAGQFLRAKE--QEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNE 480
Cdd:TIGR00606 814 KLQGSDLDRTVQQvnQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNE 868
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
237-470 |
6.84e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 39.77 E-value: 6.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 237 KELETLhQRNIHQLQSRLSELEAANKELTerkykgdSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKH 316
Cdd:pfam01576 798 KQLKKL-QAQMKDLQRELEEARASRDEIL-------AQSKESEKKLKNLEAELLQLQEDLAASERARRQAQQERDELADE 869
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 317 INQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATI---KSLSAELLKANEIIKKLQGD 393
Cdd:pfam01576 870 IASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTELaaeRSTSQKSESARQQLERQNKE 949
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 394 LKTLMGKL------KLKNTVTIQQEKLlAEKEEMLQKERKESQDAGQFLRAKEQEVCRLQEQLETTVQKLEESK-QLLKN 466
Cdd:pfam01576 950 LKAKLQEMegtvksKFKSSIAALEAKI-AQLEEQLEQESRERQAANKLVRRTEKKLKEVLLQVEDERRHADQYKdQAEKG 1028
|
....
gi 170650645 467 NEKL 470
Cdd:pfam01576 1029 NSRM 1032
|
|
| Sas6_CC |
pfam18594 |
Sas6/XLF/XRCC4 coiled-coil domain; This is a coiled-coil domain found at the C-terminal of ... |
146-171 |
7.49e-03 |
|
Sas6/XLF/XRCC4 coiled-coil domain; This is a coiled-coil domain found at the C-terminal of spindle assembly abnormal protein 6 (Sas6). The highly conserved protein SAS-6 constitutes the center of the cartwheel assembly that scaffolds centrioles early in their biogenesis.Structural analysis of Sas6 show that similar to XLF, and XRCC4 it forms a parallel coiled-coil dimer.
Pssm-ID: 408377 [Multi-domain] Cd Length: 30 Bit Score: 34.57 E-value: 7.49e-03
10 20
....*....|....*....|....*.
gi 170650645 146 DVEIKKFLAGCLKCSKEEKLSLTRSL 171
Cdd:pfam18594 1 DSEVKKYLADCLKSLKEEKQLLEQKL 26
|
|
| ClyA_Cry6Aa-like |
cd22656 |
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ... |
286-407 |
8.31e-03 |
|
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.
Pssm-ID: 439154 [Multi-domain] Cd Length: 309 Bit Score: 38.89 E-value: 8.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 286 EEELQRAKQEVLSLrrenctLDTEchekEKHINQLQTKVAVLEQEIKDKDQLVLRTKEAFDTIQEQKVAL--EENGEKNQ 363
Cdd:cd22656 109 DEELEEAKKTIKAL------LDDL----LKEAKKYQDKAAKVVDKLTDFENQTEKDQTALETLEKALKDLltDEGGAIAR 178
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 170650645 364 IQLGKLEATIKSLSAELL-KANEIIKKLQGDLKTLMGKLKLKNTV 407
Cdd:cd22656 179 KEIKDLQKELEKLNEEYAaKLKAKIDELKALIADDEAKLAAALRL 223
|
|
| Lebercilin |
pfam15619 |
Ciliary protein causing Leber congenital amaurosis disease; Lebercilin is a family of ... |
247-422 |
8.99e-03 |
|
Ciliary protein causing Leber congenital amaurosis disease; Lebercilin is a family of eukaryotic ciliary proteins. Mutations in the gene, LCA5, are implicated in the disease Leber congenital amaurosis. In photoreceptors, lebercilin is uniquely localized at the cilium that bridges the inner and outer segments. Lebercilin functions as an integral element of selective protein transport through photoreceptor cilia. Lebercilin specifically interacts with the intraflagellar transport (IFT), and disruption of IFT can lead to Leber congenital amaurosis.
Pssm-ID: 464776 [Multi-domain] Cd Length: 193 Bit Score: 37.96 E-value: 8.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 247 IHQLQSRLSELEAANKELTERKY-------KGDSTVRELKAKLAGVEEELQRAKQEVLSLRRENCTLDTECHEKEKHINQ 319
Cdd:pfam15619 20 LAELQSKLEELRKENRLLKRLQKrqekalgKYEGTESELPQLIARHNEEVRVLRERLRRLQEKERDLERKLKEKEAELLR 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 320 LQTKVAVLEQEIKDKDqlVLRTKEAFDTIQEQKVALEENGEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMG 399
Cdd:pfam15619 100 LRDQLKRLEKLSEDKN--LAEREELQKKLEQLEAKLEDKDEKIQDLERKLELENKSFRRQLAAEKKKHKEAQEEVKILQE 177
|
170 180
....*....|....*....|...
gi 170650645 400 KLKLKNTvtiqqekLLAEKEEML 422
Cdd:pfam15619 178 EIERLQQ-------KLKEKEREL 193
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
359-482 |
9.98e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 39.13 E-value: 9.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 170650645 359 GEKNQIQLGKLEATIKSLSAELLKANEIIKKLQGDLKTLMGKLKLKNTV---------TIQQEKLLAEKEEMLQKERKES 429
Cdd:COG4913 605 GFDNRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLaeyswdeidVASAEREIAELEAELERLDASS 684
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 170650645 430 QDagqfLRAKEQEVCRLQEQLETTVQKLEESKQLLKNNEKLITWLNKELNENQ 482
Cdd:COG4913 685 DD----LAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQ 733
|
|
|