NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907152066|ref|XP_036019193|]
View 

structural maintenance of chromosomes protein 4 isoform X3 [Mus musculus]

Protein Classification

chromosome segregation protein SMC( domain architecture ID 12035156)

chromosome segregation protein SMC is an ATPase required for chromosome condensation and partitioning

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1-1038 1.12e-129

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


:

Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 422.84  E-value: 1.12e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066    1 MMKPKGQTEHDEGMLEYLEDIIGCGRLNEPIKVLCRRVEILNEHRGEKLNRVKMVEKEKDALEGEKNIAIEFLTLENEMF 80
Cdd:pfam02463  149 MMKPERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLY 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   81 KKKNHICQYYIYDLQNRIAEITTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLE 160
Cdd:pfam02463  229 LDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  161 DVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEVMDSLKQETQGLQK 240
Cdd:pfam02463  309 KVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSS 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  241 EKEIQEKELMGFNKSVNEARSKMEVAQSELDIYLSRHNTAVSQLSKAKEALITASETLKERKAAIKDINTKLPQTQQELK 320
Cdd:pfam02463  389 AAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELK 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  321 EKEKELQKLTQEEINLKSLVHDLFQKVEEAKSSLAMNRSRGKVLDAIIQEKKSGRIPGIYGRLGDLGAIDEKYDIAISSC 400
Cdd:pfam02463  469 KSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTA 548
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  401 CHALDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKMTVWAKKMSKIQTPENTPRLfdlvkvkneeirqafyfalrdtl 480
Cdd:pfam02463  549 VIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNL----------------------- 605
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  481 vaNNLDQATRVAYQRDRRWRVVTLQGQIIEQSGTMSGGGSKVMRGRMGSSVIDEISVEEVNKMESQLERHSKQAMQIQEQ 560
Cdd:pfam02463  606 --AQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQE 683
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  561 KVQHEEAVVKLRHSERDMRNTLEKFAASIQGLSEQEEYLCVQIKELEANVLTTAPDRKQQKLLEENVSVFKKEYDAVAEK 640
Cdd:pfam02463  684 KAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKE 763
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  641 AGKVEAEIKRLHNTIIDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEIN 720
Cdd:pfam02463  764 EEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELK 843
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  721 DLKTELKNIEDKAEEVINNTKTAETSLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQ 800
Cdd:pfam02463  844 EEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERI 923
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  801 KEISKIKLHPVEDNPVETVAVLSQEELEAIKNPESITNEIALLEAQCREMKPNLGAIAEYKKKEDLYLQRVAELDKITSE 880
Cdd:pfam02463  924 KEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEE 1003
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  881 RDNFRQAYEDLRKQRLNEFMAGFYVITNKLKENYQMLTLGGDAELELVDSLDPFSEGIMFSVRPPKKSWKKIFNLSGGEK 960
Cdd:pfam02463 1004 KKKLIRAIIEETCQRLKEFLELFVSINKGWNKVFFYLELGGSAELRLEDPDDPFSGGIEISARPPGKGVKNLDLLSGGEK 1083
                          970       980       990      1000      1010      1020      1030
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907152066  961 TLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGIYKTYNSTKS 1038
Cdd:pfam02463 1084 TLVALALIFAIQKYKPAPFYLLDEIDAALDDQNVSRVANLLKELSKNAQFIVISLREEMLEKADKLVGVTMVENGVST 1161
 
Name Accession Description Interval E-value
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1-1038 1.12e-129

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 422.84  E-value: 1.12e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066    1 MMKPKGQTEHDEGMLEYLEDIIGCGRLNEPIKVLCRRVEILNEHRGEKLNRVKMVEKEKDALEGEKNIAIEFLTLENEMF 80
Cdd:pfam02463  149 MMKPERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLY 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   81 KKKNHICQYYIYDLQNRIAEITTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLE 160
Cdd:pfam02463  229 LDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  161 DVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEVMDSLKQETQGLQK 240
Cdd:pfam02463  309 KVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSS 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  241 EKEIQEKELMGFNKSVNEARSKMEVAQSELDIYLSRHNTAVSQLSKAKEALITASETLKERKAAIKDINTKLPQTQQELK 320
Cdd:pfam02463  389 AAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELK 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  321 EKEKELQKLTQEEINLKSLVHDLFQKVEEAKSSLAMNRSRGKVLDAIIQEKKSGRIPGIYGRLGDLGAIDEKYDIAISSC 400
Cdd:pfam02463  469 KSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTA 548
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  401 CHALDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKMTVWAKKMSKIQTPENTPRLfdlvkvkneeirqafyfalrdtl 480
Cdd:pfam02463  549 VIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNL----------------------- 605
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  481 vaNNLDQATRVAYQRDRRWRVVTLQGQIIEQSGTMSGGGSKVMRGRMGSSVIDEISVEEVNKMESQLERHSKQAMQIQEQ 560
Cdd:pfam02463  606 --AQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQE 683
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  561 KVQHEEAVVKLRHSERDMRNTLEKFAASIQGLSEQEEYLCVQIKELEANVLTTAPDRKQQKLLEENVSVFKKEYDAVAEK 640
Cdd:pfam02463  684 KAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKE 763
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  641 AGKVEAEIKRLHNTIIDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEIN 720
Cdd:pfam02463  764 EEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELK 843
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  721 DLKTELKNIEDKAEEVINNTKTAETSLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQ 800
Cdd:pfam02463  844 EEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERI 923
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  801 KEISKIKLHPVEDNPVETVAVLSQEELEAIKNPESITNEIALLEAQCREMKPNLGAIAEYKKKEDLYLQRVAELDKITSE 880
Cdd:pfam02463  924 KEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEE 1003
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  881 RDNFRQAYEDLRKQRLNEFMAGFYVITNKLKENYQMLTLGGDAELELVDSLDPFSEGIMFSVRPPKKSWKKIFNLSGGEK 960
Cdd:pfam02463 1004 KKKLIRAIIEETCQRLKEFLELFVSINKGWNKVFFYLELGGSAELRLEDPDDPFSGGIEISARPPGKGVKNLDLLSGGEK 1083
                          970       980       990      1000      1010      1020      1030
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907152066  961 TLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGIYKTYNSTKS 1038
Cdd:pfam02463 1084 TLVALALIFAIQKYKPAPFYLLDEIDAALDDQNVSRVANLLKELSKNAQFIVISLREEMLEKADKLVGVTMVENGVST 1161
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
46-1029 2.21e-94

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 325.87  E-value: 2.21e-94
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   46 GEKLNRVKMVEKEK----DALEGEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAEITTQKEKIHEDTKEITEKSNV 121
Cdd:TIGR02169  183 EENIERLDLIIDEKrqqlERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISE 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  122 LSNEMKAKNSAVKDVEKKLNKVTK-FIEQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSK 200
Cdd:TIGR02169  263 LEKRLEEIEQLLEELNKKIKDLGEeEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  201 TVINETTTRNNSLEKEREKEEKklkeVMDSLKQETQGLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELDIYLSRHNTA 280
Cdd:TIGR02169  343 REIEEERKRRDKLTEEYAELKE----ELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRL 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  281 VSQLSKAKEAL------ITASETLKERKAA-IKDINTKLPQTQQELKEKEKELQKLTQEEINLKSLVHDLFQKVEEAKSS 353
Cdd:TIGR02169  419 SEELADLNAAIagieakINELEEEKEDKALeIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  354 LAMNRSRGKVLDAIIQEKKSGrIPGIYGRLGDLGAIDEKYDIAISSCCHA-LDYIVVDSIDTAQECVNFLKKHNIGIATF 432
Cdd:TIGR02169  499 ARASEERVRGGRAVEEVLKAS-IQGVHGTVAQLGSVGERYATAIEVAAGNrLNNVVVEDDAVAKEAIELLKRRKAGRATF 577
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  433 IGLDKMTVWAKKMSKIQTPENTPRLFDLVKVkNEEIRQAFYFALRDTLVANNLDQATRVAYQrdrrWRVVTLQGQIIEQS 512
Cdd:TIGR02169  578 LPLNKMRDERRDLSILSEDGVIGFAVDLVEF-DPKYEPAFKYVFGDTLVVEDIEAARRLMGK----YRMVTLEGELFEKS 652
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  513 GTMSGGgSKVMRGRMGSSVIDEISVEEVNKMESQLERHSKQAMQ-IQEQKVQHEEAVVKLRHSER---DMRNTLEKFAAS 588
Cdd:TIGR02169  653 GAMTGG-SRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSeLRRIENRLDELSQELSDASRkigEIEKEIEQLEQE 731
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  589 IQGLSEQEEYLCVQIKELEANVltTAPDRKQQKLleenvsvfKKEYDAVAEKAGKVEAEI----KRLHNTIIDINNRKLK 664
Cdd:TIGR02169  732 EEKLKERLEELEEDLSSLEQEI--ENVKSELKEL--------EARIEELEEDLHKLEEALndleARLSHSRIPEIQAELS 801
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  665 AQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTKTAE 744
Cdd:TIGR02169  802 KLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLE 881
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  745 TSLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKIKLHPVEDNPVETVAVLSQ 824
Cdd:TIGR02169  882 SRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQ 961
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  825 EELEAIKNPESItneialleaqcremkpNLGAIAEYKKKEDLYLQRVAELDKITSERDNFRQAYEDLRKQRLNEFMAGFY 904
Cdd:TIGR02169  962 RVEEEIRALEPV----------------NMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFE 1025
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  905 VITNKLKENYQMLTlGGDAELELVDSLDPFSEGIMFSVRPPKKSWKKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMDE 984
Cdd:TIGR02169 1026 AINENFNEIFAELS-GGTGELILENPDDPFAGGLELSAKPKGKPVQRLEAMSGGEKSLTALSFIFAIQRYKPSPFYAFDE 1104
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*
gi 1907152066  985 IDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGI 1029
Cdd:TIGR02169 1105 VDMFLDGVNVERVAKLIREKAGEAQFIVVSLRSPMIEYADRAIGV 1149
ABC_SMC4_euk cd03274
ATP-binding cassette domain of eukaryotic SMC4 proteins; The structural maintenance of ...
945-1039 1.99e-65

ATP-binding cassette domain of eukaryotic SMC4 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213241 [Multi-domain]  Cd Length: 212  Bit Score: 219.47  E-value: 1.99e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  945 PKKSWKKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISD 1024
Cdd:cd03274    118 PKKSWKNISNLSGGEKTLSSLALVFALHHYKPTPLYVMDEIDAALDFRNVSIVANYIKERTKNAQFIVISLRNNMFELAD 197
                           90
                   ....*....|....*
gi 1907152066 1025 RLIGIYKTYNSTKSV 1039
Cdd:cd03274    198 RLVGIYKTNNCTKSV 212
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
47-1029 3.36e-44

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 173.97  E-value: 3.36e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   47 EKLNRVK--MVEKEK--DALEGEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAEITTQKEkihedtkEITEKSNVL 122
Cdd:COG1196    186 ENLERLEdiLGELERqlEPLERQAEKAERYRELKEELKELEAELLLLKLRELEAELEELEAELE-------ELEAELEEL 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  123 SNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTV 202
Cdd:COG1196    259 EAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEE 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  203 INETTTRNNSLEKEREKEEKKLKEV---MDSLKQETQGLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELDIYLSRHNT 279
Cdd:COG1196    339 LEELEEELEEAEEELEEAEAELAEAeeaLLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLER 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  280 AVSQLSKAKEALITASETLKERKAAIKDINTKLPQTQQELKEKEKELQKLTQEEINLKSLVHDLFQKVEEAKSS----LA 355
Cdd:COG1196    419 LEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARllllLE 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  356 MNRSRGKVLDAIIQEKKSGRIPGIYGRLGDLGAIDEKYDIAISSCCHA-LDYIVVDSIDTAQECVNFLKKHNIGIATFIG 434
Cdd:COG1196    499 AEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAaLQNIVVEDDEVAAAAIEYLKAAKAGRATFLP 578
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  435 LDKMTVwAKKMSKIQTPENTPRLFDLVKVKNEEiRQAFYFALRDTLVANNLDQATRvayqRDRRWRVVTLQGQIIEQSGT 514
Cdd:COG1196    579 LDKIRA-RAALAAALARGAIGAAVDLVASDLRE-ADARYYVLGDTLLGRTLVAARL----EAALRRAVTLAGRLREVTLE 652
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  515 MSGGGSKVMRGRMGSSVIDEISVEEVNKMESQLERHSKQAMQIQEQKVQHEEAVVKLRHSERDMRNTLEKFAASIQGLSE 594
Cdd:COG1196    653 GEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEA 732
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  595 QEEYLCVQIKELEANVLTTAPdrkqqklleenvsvfkkEYDAVAEKAGKVEAEIKRLhntiidinnrklkaqqnkldtin 674
Cdd:COG1196    733 EREELLEELLEEEELLEEEAL-----------------EELPEPPDLEELERELERL----------------------- 772
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  675 kqldecasaitkaqvaiktadrnlkkaqdsvcrtekeikdtEKEINDLktelkniedkaEEVinntktaetslpeiqkeh 754
Cdd:COG1196    773 -----------------------------------------EREIEAL-----------GPV------------------ 782
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  755 rNL--LQELKVIQEnehalQKDALSiklklEQIDghisehnskikywqkeiskiklhpveDnpvetvavlsqeeleaikn 832
Cdd:COG1196    783 -NLlaIEEYEELEE-----RYDFLS-----EQRE--------------------------D------------------- 806
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  833 pesitneiaLLEAqcremkpnlgaiaeykkKEDLyLQRVAELDKITSERdnFRQAYEDLRKQrlnefmagfyvitnkLKE 912
Cdd:COG1196    807 ---------LEEA-----------------RETL-EEAIEEIDRETRER--FLETFDAVNEN---------------FQE 842
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  913 NYQMLTLGGDAELELVDSLDPFSEGIMFSVRPPKKSWKKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFK 992
Cdd:COG1196    843 LFPRLFGGGEAELLLTDPDDPLETGIEIMAQPPGKKLQRLSLLSGGEKALTALALLFAIFRLNPSPFCVLDEVDAPLDDA 922
                          970       980       990
                   ....*....|....*....|....*....|....*..
gi 1907152066  993 NVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGI 1029
Cdd:COG1196    923 NVERFAELLKEMSEDTQFIVITHNKRTMEAADRLYGV 959
SMC_hinge smart00968
SMC proteins Flexible Hinge Domain; This entry represents the hinge region of the SMC ...
377-491 5.00e-32

SMC proteins Flexible Hinge Domain; This entry represents the hinge region of the SMC (Structural Maintenance of Chromosomes) family of proteins. The hinge region is responsible for formation of the DNA interacting dimer. It is also possible that the precise structure of it is an essential determinant of the specificity of the DNA-protein interaction.


Pssm-ID: 214944 [Multi-domain]  Cd Length: 120  Bit Score: 120.80  E-value: 5.00e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   377 PGIYGRLGDLGAIDEKYDIAISSCCHA-LDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKMTV-----WAKKMSKIQT 450
Cdd:smart00968    1 PGVLGRVADLISVDPKYETALEAALGGrLQAVVVDTEETAKKAIEFLKKNRLGRATFLPLDKIKPrspagSKLREALLPE 80
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|.
gi 1907152066   451 PENTPRLFDLVKVKnEEIRQAFYFALRDTLVANNLDQATRV 491
Cdd:smart00968   81 PGFVGPAIDLVEYD-PELRPALEYLLGNTLVVDDLETARRL 120
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
25-352 2.49e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 58.15  E-value: 2.49e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   25 GRLNEPIKVLCRRVEILNEHR---GEKLNRVKMVEKEKDALEGEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAEI 101
Cdd:PRK03918   317 SRLEEEINGIEERIKELEEKEerlEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEEL 396
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  102 TTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKlnkvtkfieQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQ 181
Cdd:PRK03918   397 EKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKA---------KGKCPVCGRELTEEHRKELLEEYTAELKRIEKE 467
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  182 LQKDKEKVEELKSVPAKSKTVINETTTrnnslekerEKEEKKLKEVMDSLKQETQGLQKEK-EIQEKELMGFNKSVNEAR 260
Cdd:PRK03918   468 LKEIEEKERKLRKELRELEKVLKKESE---------LIKLKELAEQLKELEEKLKKYNLEElEKKAEEYEKLKEKLIKLK 538
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  261 SKMEVAQSELDiylsrhntAVSQLSKAKEALITASETLKERKAAIKDINTKLP-QTQQELKEKEKELQKLTQEEINLKSL 339
Cdd:PRK03918   539 GEIKSLKKELE--------KLEELKKKLAELEKKLDELEEELAELLKELEELGfESVEELEERLKELEPFYNEYLELKDA 610
                          330
                   ....*....|...
gi 1907152066  340 VHDLFQKVEEAKS 352
Cdd:PRK03918   611 EKELEREEKELKK 623
MARTX_Nterm NF012221
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ...
444-838 3.66e-04

MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.


Pssm-ID: 467957 [Multi-domain]  Cd Length: 1848  Bit Score: 44.83  E-value: 3.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  444 KMSKIQTPENTPRLFDLVKVK-NEEIRQAFYFALRDTLVANN----LDQATRV--AYQRDRRWRVVTLQGQIIEQSGTMS 516
Cdd:NF012221  1438 RVSELDTYTNTSLYQDLSNLTaGEVIALSFDFARRAGLSTNNgievLWNGEVVfaSSGDASAWQQKTLKLTAKAGSNRLE 1517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  517 GGGSKVMRGrMGSSVIDEISVEEVNKMESQLERHSKQAMQIQEQKVQHEEAVVKLRHSERDMRNTLEKFAASiqglseqe 596
Cdd:NF012221  1518 FKGTGHNDG-LGYILDNVVATSESSQQADAVSKHAKQDDAAQNALADKERAEADRQRLEQEKQQQLAAISGS-------- 1588
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  597 eylcvQiKELEANvlttapdrkQQKLLEENVSVfkkEYDAVAEKAGKVEAEIKRLHNTIIDINNRKLKAQQNK------- 669
Cdd:NF012221  1589 -----Q-SQLEST---------DQNALETNGQA---QRDAILEESRAVTKELTTLAQGLDALDSQATYAGESGdqwrnpf 1650
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  670 ----LDTINKQLDECASAITKAQVAIKTA-DRNLKKAQDSVC-------RTEKEIKDTEKEINDLKTELKNIEDKAEEVI 737
Cdd:NF012221  1651 agglLDRVQEQLDDAKKISGKQLADAKQRhVDNQQKVKDAVAkseagvaQGEQNQANAEQDIDDAKADAEKRKDDALAKQ 1730
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  738 NNTKTAET-SLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIK---YWQKEISKIKLHPVED 813
Cdd:NF012221  1731 NEAQQAESdANAAANDAQSRGEQDASAAENKANQAQADAKGAKQDESDKPNRQGAAGSGLSgkaYSVEGVAEPGSHINPD 1810
                          410       420
                   ....*....|....*....|....*....
gi 1907152066  814 NPVET----VAVLSQEELEAIKNPESITN 838
Cdd:NF012221  1811 SPAAAdgrfSEGLTEQEQEALEGATNAVN 1839
 
Name Accession Description Interval E-value
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1-1038 1.12e-129

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 422.84  E-value: 1.12e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066    1 MMKPKGQTEHDEGMLEYLEDIIGCGRLNEPIKVLCRRVEILNEHRGEKLNRVKMVEKEKDALEGEKNIAIEFLTLENEMF 80
Cdd:pfam02463  149 MMKPERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLY 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   81 KKKNHICQYYIYDLQNRIAEITTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLE 160
Cdd:pfam02463  229 LDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  161 DVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEVMDSLKQETQGLQK 240
Cdd:pfam02463  309 KVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSS 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  241 EKEIQEKELMGFNKSVNEARSKMEVAQSELDIYLSRHNTAVSQLSKAKEALITASETLKERKAAIKDINTKLPQTQQELK 320
Cdd:pfam02463  389 AAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELK 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  321 EKEKELQKLTQEEINLKSLVHDLFQKVEEAKSSLAMNRSRGKVLDAIIQEKKSGRIPGIYGRLGDLGAIDEKYDIAISSC 400
Cdd:pfam02463  469 KSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTA 548
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  401 CHALDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKMTVWAKKMSKIQTPENTPRLfdlvkvkneeirqafyfalrdtl 480
Cdd:pfam02463  549 VIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNL----------------------- 605
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  481 vaNNLDQATRVAYQRDRRWRVVTLQGQIIEQSGTMSGGGSKVMRGRMGSSVIDEISVEEVNKMESQLERHSKQAMQIQEQ 560
Cdd:pfam02463  606 --AQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQE 683
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  561 KVQHEEAVVKLRHSERDMRNTLEKFAASIQGLSEQEEYLCVQIKELEANVLTTAPDRKQQKLLEENVSVFKKEYDAVAEK 640
Cdd:pfam02463  684 KAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKE 763
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  641 AGKVEAEIKRLHNTIIDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEIN 720
Cdd:pfam02463  764 EEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELK 843
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  721 DLKTELKNIEDKAEEVINNTKTAETSLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQ 800
Cdd:pfam02463  844 EEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERI 923
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  801 KEISKIKLHPVEDNPVETVAVLSQEELEAIKNPESITNEIALLEAQCREMKPNLGAIAEYKKKEDLYLQRVAELDKITSE 880
Cdd:pfam02463  924 KEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEE 1003
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  881 RDNFRQAYEDLRKQRLNEFMAGFYVITNKLKENYQMLTLGGDAELELVDSLDPFSEGIMFSVRPPKKSWKKIFNLSGGEK 960
Cdd:pfam02463 1004 KKKLIRAIIEETCQRLKEFLELFVSINKGWNKVFFYLELGGSAELRLEDPDDPFSGGIEISARPPGKGVKNLDLLSGGEK 1083
                          970       980       990      1000      1010      1020      1030
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907152066  961 TLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGIYKTYNSTKS 1038
Cdd:pfam02463 1084 TLVALALIFAIQKYKPAPFYLLDEIDAALDDQNVSRVANLLKELSKNAQFIVISLREEMLEKADKLVGVTMVENGVST 1161
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
46-1029 2.21e-94

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 325.87  E-value: 2.21e-94
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   46 GEKLNRVKMVEKEK----DALEGEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAEITTQKEKIHEDTKEITEKSNV 121
Cdd:TIGR02169  183 EENIERLDLIIDEKrqqlERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISE 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  122 LSNEMKAKNSAVKDVEKKLNKVTK-FIEQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSK 200
Cdd:TIGR02169  263 LEKRLEEIEQLLEELNKKIKDLGEeEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  201 TVINETTTRNNSLEKEREKEEKklkeVMDSLKQETQGLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELDIYLSRHNTA 280
Cdd:TIGR02169  343 REIEEERKRRDKLTEEYAELKE----ELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRL 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  281 VSQLSKAKEAL------ITASETLKERKAA-IKDINTKLPQTQQELKEKEKELQKLTQEEINLKSLVHDLFQKVEEAKSS 353
Cdd:TIGR02169  419 SEELADLNAAIagieakINELEEEKEDKALeIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  354 LAMNRSRGKVLDAIIQEKKSGrIPGIYGRLGDLGAIDEKYDIAISSCCHA-LDYIVVDSIDTAQECVNFLKKHNIGIATF 432
Cdd:TIGR02169  499 ARASEERVRGGRAVEEVLKAS-IQGVHGTVAQLGSVGERYATAIEVAAGNrLNNVVVEDDAVAKEAIELLKRRKAGRATF 577
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  433 IGLDKMTVWAKKMSKIQTPENTPRLFDLVKVkNEEIRQAFYFALRDTLVANNLDQATRVAYQrdrrWRVVTLQGQIIEQS 512
Cdd:TIGR02169  578 LPLNKMRDERRDLSILSEDGVIGFAVDLVEF-DPKYEPAFKYVFGDTLVVEDIEAARRLMGK----YRMVTLEGELFEKS 652
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  513 GTMSGGgSKVMRGRMGSSVIDEISVEEVNKMESQLERHSKQAMQ-IQEQKVQHEEAVVKLRHSER---DMRNTLEKFAAS 588
Cdd:TIGR02169  653 GAMTGG-SRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSeLRRIENRLDELSQELSDASRkigEIEKEIEQLEQE 731
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  589 IQGLSEQEEYLCVQIKELEANVltTAPDRKQQKLleenvsvfKKEYDAVAEKAGKVEAEI----KRLHNTIIDINNRKLK 664
Cdd:TIGR02169  732 EEKLKERLEELEEDLSSLEQEI--ENVKSELKEL--------EARIEELEEDLHKLEEALndleARLSHSRIPEIQAELS 801
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  665 AQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTKTAE 744
Cdd:TIGR02169  802 KLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLE 881
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  745 TSLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKIKLHPVEDNPVETVAVLSQ 824
Cdd:TIGR02169  882 SRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQ 961
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  825 EELEAIKNPESItneialleaqcremkpNLGAIAEYKKKEDLYLQRVAELDKITSERDNFRQAYEDLRKQRLNEFMAGFY 904
Cdd:TIGR02169  962 RVEEEIRALEPV----------------NMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFE 1025
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  905 VITNKLKENYQMLTlGGDAELELVDSLDPFSEGIMFSVRPPKKSWKKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMDE 984
Cdd:TIGR02169 1026 AINENFNEIFAELS-GGTGELILENPDDPFAGGLELSAKPKGKPVQRLEAMSGGEKSLTALSFIFAIQRYKPSPFYAFDE 1104
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*
gi 1907152066  985 IDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGI 1029
Cdd:TIGR02169 1105 VDMFLDGVNVERVAKLIREKAGEAQFIVVSLRSPMIEYADRAIGV 1149
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
55-1029 1.02e-81

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 289.65  E-value: 1.02e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   55 VEKEKDALEGEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAEITTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVK 134
Cdd:TIGR02168  198 LERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVS 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  135 DVEKKLNKVTKFIEQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSLE 214
Cdd:TIGR02168  278 ELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLE 357
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  215 KErekeekklkevMDSLKQETQGLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELDIYLSRHNTAVSQLSKAKEALITA 294
Cdd:TIGR02168  358 AE-----------LEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEEL 426
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  295 SETLKErkAAIKDINTKLPQTQQELKEKEKELQKLTQEEINLKSLVHDLFQKVEEAKSSLAMNRSRGKVLDAIIQE---- 370
Cdd:TIGR02168  427 LKKLEE--AELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENlegf 504
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  371 --------KKSGRIPGIYGRLGDLGAIDEKYDIAISSCCHA-LDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKMTVW 441
Cdd:TIGR02168  505 segvkallKNQSGLSGILGVLSELISVDEGYEAAIEAALGGrLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGT 584
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  442 AKKMSKIQTPENTPR----LFDLVKVKnEEIRQAFYFALRDTLVANNLDQATRVAYQRDRRWRVVTLQGQIIEQSGTMSG 517
Cdd:TIGR02168  585 EIQGNDREILKNIEGflgvAKDLVKFD-PKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITG 663
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  518 GGSKvmrgrmGSSVIDEISVEevnkmesqLERHSKQAMQIQEQKVQHEEAVVKLRHSERDMRNTLEKFAASIQGLSEQEE 597
Cdd:TIGR02168  664 GSAK------TNSSILERRRE--------IEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS 729
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  598 YLCVQIKELEANVLTTAPDRKQQKLLEENVSVFKKEYDAVAEKAGKVEAEIKRlhntiidinnrKLKAQQNKLDTINKQL 677
Cdd:TIGR02168  730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA-----------EIEELEAQIEQLKEEL 798
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  678 DECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTKTAETSLPEIQKEHRNL 757
Cdd:TIGR02168  799 KALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEAL 878
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  758 LQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKIKLHPVE-----DNPVETVAVLSQEELEAIKN 832
Cdd:TIGR02168  879 LNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGlevriDNLQERLSEEYSLTLEEAEA 958
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  833 --------PESITNEIALLEAQCREMKP-NLGAIAEYKKKEDLYLQRVAELDKITSERDNFRQAYEDLRKQRLNEFMAGF 903
Cdd:TIGR02168  959 lenkieddEEEARRRLKRLENKIKELGPvNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREARERFKDTF 1038
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  904 YVITNKLKENYQMLTLGGDAELELVDSLDPFSEGIMFSVRPPKKSWKKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMD 983
Cdd:TIGR02168 1039 DQVNENFQRVFPKLFGGGEAELRLTDPEDLLEAGIEIFAQPPGKKNQNLSLLSGGEKALTALALLFAIFKVKPAPFCILD 1118
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*.
gi 1907152066  984 EIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGI 1029
Cdd:TIGR02168 1119 EVDAPLDDANVERFANLLKEFSKNTQFIVITHNKGTMEVADQLYGV 1164
ABC_SMC4_euk cd03274
ATP-binding cassette domain of eukaryotic SMC4 proteins; The structural maintenance of ...
945-1039 1.99e-65

ATP-binding cassette domain of eukaryotic SMC4 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213241 [Multi-domain]  Cd Length: 212  Bit Score: 219.47  E-value: 1.99e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  945 PKKSWKKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISD 1024
Cdd:cd03274    118 PKKSWKNISNLSGGEKTLSSLALVFALHHYKPTPLYVMDEIDAALDFRNVSIVANYIKERTKNAQFIVISLRNNMFELAD 197
                           90
                   ....*....|....*
gi 1907152066 1025 RLIGIYKTYNSTKSV 1039
Cdd:cd03274    198 RLVGIYKTNNCTKSV 212
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
47-1029 3.36e-44

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 173.97  E-value: 3.36e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   47 EKLNRVK--MVEKEK--DALEGEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAEITTQKEkihedtkEITEKSNVL 122
Cdd:COG1196    186 ENLERLEdiLGELERqlEPLERQAEKAERYRELKEELKELEAELLLLKLRELEAELEELEAELE-------ELEAELEEL 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  123 SNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTV 202
Cdd:COG1196    259 EAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEE 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  203 INETTTRNNSLEKEREKEEKKLKEV---MDSLKQETQGLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELDIYLSRHNT 279
Cdd:COG1196    339 LEELEEELEEAEEELEEAEAELAEAeeaLLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLER 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  280 AVSQLSKAKEALITASETLKERKAAIKDINTKLPQTQQELKEKEKELQKLTQEEINLKSLVHDLFQKVEEAKSS----LA 355
Cdd:COG1196    419 LEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARllllLE 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  356 MNRSRGKVLDAIIQEKKSGRIPGIYGRLGDLGAIDEKYDIAISSCCHA-LDYIVVDSIDTAQECVNFLKKHNIGIATFIG 434
Cdd:COG1196    499 AEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAaLQNIVVEDDEVAAAAIEYLKAAKAGRATFLP 578
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  435 LDKMTVwAKKMSKIQTPENTPRLFDLVKVKNEEiRQAFYFALRDTLVANNLDQATRvayqRDRRWRVVTLQGQIIEQSGT 514
Cdd:COG1196    579 LDKIRA-RAALAAALARGAIGAAVDLVASDLRE-ADARYYVLGDTLLGRTLVAARL----EAALRRAVTLAGRLREVTLE 652
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  515 MSGGGSKVMRGRMGSSVIDEISVEEVNKMESQLERHSKQAMQIQEQKVQHEEAVVKLRHSERDMRNTLEKFAASIQGLSE 594
Cdd:COG1196    653 GEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEA 732
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  595 QEEYLCVQIKELEANVLTTAPdrkqqklleenvsvfkkEYDAVAEKAGKVEAEIKRLhntiidinnrklkaqqnkldtin 674
Cdd:COG1196    733 EREELLEELLEEEELLEEEAL-----------------EELPEPPDLEELERELERL----------------------- 772
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  675 kqldecasaitkaqvaiktadrnlkkaqdsvcrtekeikdtEKEINDLktelkniedkaEEVinntktaetslpeiqkeh 754
Cdd:COG1196    773 -----------------------------------------EREIEAL-----------GPV------------------ 782
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  755 rNL--LQELKVIQEnehalQKDALSiklklEQIDghisehnskikywqkeiskiklhpveDnpvetvavlsqeeleaikn 832
Cdd:COG1196    783 -NLlaIEEYEELEE-----RYDFLS-----EQRE--------------------------D------------------- 806
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  833 pesitneiaLLEAqcremkpnlgaiaeykkKEDLyLQRVAELDKITSERdnFRQAYEDLRKQrlnefmagfyvitnkLKE 912
Cdd:COG1196    807 ---------LEEA-----------------RETL-EEAIEEIDRETRER--FLETFDAVNEN---------------FQE 842
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  913 NYQMLTLGGDAELELVDSLDPFSEGIMFSVRPPKKSWKKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFK 992
Cdd:COG1196    843 LFPRLFGGGEAELLLTDPDDPLETGIEIMAQPPGKKLQRLSLLSGGEKALTALALLFAIFRLNPSPFCVLDEVDAPLDDA 922
                          970       980       990
                   ....*....|....*....|....*....|....*..
gi 1907152066  993 NVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGI 1029
Cdd:COG1196    923 NVERFAELLKEMSEDTQFIVITHNKRTMEAADRLYGV 959
ABC_SMC_head cd03239
The SMC head domain belongs to the ATP-binding cassette superfamily; The structural ...
955-1037 1.01e-32

The SMC head domain belongs to the ATP-binding cassette superfamily; The structural maintenance of chromosomes (SMC) proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and each has five distinct domains: amino- and carboxy-terminal globular domains, which contain sequences characteristic of ATPases, two coiled-coil regions separating the terminal domains , and a central flexible hinge. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair, and epigenetic silencing of gene expression.


Pssm-ID: 213206 [Multi-domain]  Cd Length: 178  Bit Score: 125.11  E-value: 1.01e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  955 LSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKN-AQFIIISLRNNMFEISDRLIGIYKTY 1033
Cdd:cd03239     95 LSGGEKSLSALALIFALQEIKPSPFYVLDEIDAALDPTNRRRVSDMIKEMAKHtSQFIVITLKKEMFENADKLIGVLFVH 174

                   ....
gi 1907152066 1034 NSTK 1037
Cdd:cd03239    175 GVST 178
SMC_hinge smart00968
SMC proteins Flexible Hinge Domain; This entry represents the hinge region of the SMC ...
377-491 5.00e-32

SMC proteins Flexible Hinge Domain; This entry represents the hinge region of the SMC (Structural Maintenance of Chromosomes) family of proteins. The hinge region is responsible for formation of the DNA interacting dimer. It is also possible that the precise structure of it is an essential determinant of the specificity of the DNA-protein interaction.


Pssm-ID: 214944 [Multi-domain]  Cd Length: 120  Bit Score: 120.80  E-value: 5.00e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   377 PGIYGRLGDLGAIDEKYDIAISSCCHA-LDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKMTV-----WAKKMSKIQT 450
Cdd:smart00968    1 PGVLGRVADLISVDPKYETALEAALGGrLQAVVVDTEETAKKAIEFLKKNRLGRATFLPLDKIKPrspagSKLREALLPE 80
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|.
gi 1907152066   451 PENTPRLFDLVKVKnEEIRQAFYFALRDTLVANNLDQATRV 491
Cdd:smart00968   81 PGFVGPAIDLVEYD-PELRPALEYLLGNTLVVDDLETARRL 120
ABC_SMC1_euk cd03275
ATP-binding cassette domain of eukaryotic SMC1 proteins; The structural maintenance of ...
944-1031 1.03e-29

ATP-binding cassette domain of eukaryotic SMC1 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213242 [Multi-domain]  Cd Length: 247  Bit Score: 118.83  E-value: 1.03e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  944 PPKKSWKKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQT-KNAQFIIISLRNNMFEI 1022
Cdd:cd03275    145 PPGKRFRDMDNLSGGEKTMAALALLFAIHSYQPAPFFVLDEVDAALDNTNVGKVASYIREQAgPNFQFIVISLKEEFFSK 224

                   ....*....
gi 1907152066 1023 SDRLIGIYK 1031
Cdd:cd03275    225 ADALVGVYR 233
SMC_hinge pfam06470
SMC proteins Flexible Hinge Domain; This family represents the hinge region of the SMC ...
376-492 4.17e-28

SMC proteins Flexible Hinge Domain; This family represents the hinge region of the SMC (Structural Maintenance of Chromosomes) family of proteins. The hinge region is responsible for formation of the DNA interacting dimer. It is also possible that the precise structure of it is an essential determinant of the specificity of the DNA-protein interaction.


Pssm-ID: 461926 [Multi-domain]  Cd Length: 116  Bit Score: 109.66  E-value: 4.17e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  376 IPGIYGRLGDLGAIDEKYDIAISSCC-HALDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKMTVWAKKMSKIQTpENT 454
Cdd:pfam06470    1 LKGVLGRLADLIEVDEGYEKAVEAALgGRLQAVVVDDEDDAKRAIEFLKKNKLGRATFLPLDRLKPRPRRPGADLK-GGA 79
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 1907152066  455 PRLFDLVKVKnEEIRQAFYFALRDTLVANNLDQATRVA 492
Cdd:pfam06470   80 GPLLDLVEYD-DEYRKALRYLLGNTLVVDDLDEALELA 116
ABC_SMC_barmotin cd03278
ATP-binding cassette domain of barmotin, a member of the SMC protein family; Barmotin is a ...
950-1029 8.98e-28

ATP-binding cassette domain of barmotin, a member of the SMC protein family; Barmotin is a tight junction-associated protein expressed in rat epithelial cells which is thought to have an important regulatory role in tight junction barrier function. Barmotin belongs to the SMC protein family. SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213245 [Multi-domain]  Cd Length: 197  Bit Score: 111.40  E-value: 8.98e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  950 KKIFN---LSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRL 1026
Cdd:cd03278    106 KKVQRlslLSGGEKALTALALLFAIFRVRPSPFCVLDEVDAALDDANVERFARLLKEFSKETQFIVITHRKGTMEAADRL 185

                   ...
gi 1907152066 1027 IGI 1029
Cdd:cd03278    186 YGV 188
ABC_Class2 cd03227
ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems ...
952-1032 9.15e-24

ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems involved in cellular processes other than transport. These families are characterized by the fact that the ABC subunit is made up of duplicated, fused ABC modules (ABC2). No known transmembrane proteins or domains are associated with these proteins.


Pssm-ID: 213194 [Multi-domain]  Cd Length: 162  Bit Score: 98.97  E-value: 9.15e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  952 IFNLSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQT-KNAQFIIISLRNNMFEISDRLIGIY 1030
Cdd:cd03227     75 RLQLSGGEKELSALALILALASLKPRPLYILDEIDRGLDPRDGQALAEAILEHLvKGAQVIVITHLPELAELADKLIHIK 154

                   ..
gi 1907152066 1031 KT 1032
Cdd:cd03227    155 KV 156
ABC_SMC3_euk cd03272
ATP-binding cassette domain of eukaryotic SMC3 proteins; The structural maintenance of ...
950-1029 1.09e-16

ATP-binding cassette domain of eukaryotic SMC3 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213239 [Multi-domain]  Cd Length: 243  Bit Score: 80.77  E-value: 1.09e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  950 KKIFNLSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGI 1029
Cdd:cd03272    154 QEMQQLSGGQKSLVALALIFAIQKCDPAPFYLFDEIDAALDAQYRTAVANMIKELSDGAQFITTTFRPELLEVADKFYGV 233
ABC_SMC2_euk cd03273
ATP-binding cassette domain of eukaryotic SMC2 proteins; The structural maintenance of ...
949-1034 1.68e-16

ATP-binding cassette domain of eukaryotic SMC2 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213240 [Multi-domain]  Cd Length: 251  Bit Score: 80.42  E-value: 1.68e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  949 WKK-IFNLSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTKNAQFIIISLRNNMFEISDRLi 1027
Cdd:cd03273    160 WKEsLTELSGGQRSLVALSLILALLLFKPAPMYILDEVDAALDLSHTQNIGRMIKTHFKGSQFIVVSLKEGMFNNANVL- 238

                   ....*..
gi 1907152066 1028 giYKTYN 1034
Cdd:cd03273    239 --FRTRF 243
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
94-353 1.82e-09

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 61.96  E-value: 1.82e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   94 LQNRIAEITTQKEKIHEDTKEITEKSNVLSNEMKAK--------NSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVQVR 165
Cdd:TIGR04523  269 LSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKeqdwnkelKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLK 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  166 EKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEvmdsLKQETQGLQKEKEIQ 245
Cdd:TIGR04523  349 KELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQ----KDEQIKKLQQEKELL 424
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  246 EKE---LMGFNKSVNEARSKMEVAQSELDIYLSRHNTAVSQLSKakealitaseTLKERKAAIKDINTKLPQTQQELKEK 322
Cdd:TIGR04523  425 EKEierLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLET----------QLKVLSRSINKIKQNLEQKQKELKSK 494
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1907152066  323 EKELQKLTQEEINLKSLVHDLFQKVEEAKSS 353
Cdd:TIGR04523  495 EKELKKLNEEKKELEEKVKDLTKKISSLKEK 525
ABC_ATPase cd00267
ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large ...
952-1031 2.33e-08

ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.


Pssm-ID: 213179 [Multi-domain]  Cd Length: 157  Bit Score: 54.17  E-value: 2.33e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  952 IFNLSGGEKTLSSLALVFALhhyKPtPLYFMDEIDAALDFKNVSIVAFYIYEQT-KNAQFIIISLRNNMFE-ISDRLIGI 1029
Cdd:cd00267     78 VPQLSGGQRQRVALARALLL---NP-DLLLLDEPTSGLDPASRERLLELLRELAeEGRTVIIVTHDPELAElAADRVIVL 153

                   ..
gi 1907152066 1030 YK 1031
Cdd:cd00267    154 KD 155
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
25-352 2.49e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 58.15  E-value: 2.49e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   25 GRLNEPIKVLCRRVEILNEHR---GEKLNRVKMVEKEKDALEGEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAEI 101
Cdd:PRK03918   317 SRLEEEINGIEERIKELEEKEerlEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEEL 396
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  102 TTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKlnkvtkfieQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQ 181
Cdd:PRK03918   397 EKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKA---------KGKCPVCGRELTEEHRKELLEEYTAELKRIEKE 467
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  182 LQKDKEKVEELKSVPAKSKTVINETTTrnnslekerEKEEKKLKEVMDSLKQETQGLQKEK-EIQEKELMGFNKSVNEAR 260
Cdd:PRK03918   468 LKEIEEKERKLRKELRELEKVLKKESE---------LIKLKELAEQLKELEEKLKKYNLEElEKKAEEYEKLKEKLIKLK 538
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  261 SKMEVAQSELDiylsrhntAVSQLSKAKEALITASETLKERKAAIKDINTKLP-QTQQELKEKEKELQKLTQEEINLKSL 339
Cdd:PRK03918   539 GEIKSLKKELE--------KLEELKKKLAELEKKLDELEEELAELLKELEELGfESVEELEERLKELEPFYNEYLELKDA 610
                          330
                   ....*....|...
gi 1907152066  340 VHDLFQKVEEAKS 352
Cdd:PRK03918   611 EKELEREEKELKK 623
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
96-374 8.03e-07

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 53.10  E-value: 8.03e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   96 NRIAE--ITTQKEKIHEDTKEITEksnVLSNEMKAKNSAVKDVEKKLNKvtkFIEQNKekFTQLDLEDVQVREKLKHATS 173
Cdd:COG3206    155 NALAEayLEQNLELRREEARKALE---FLEEQLPELRKELEEAEAALEE---FRQKNG--LVDLSEEAKLLLQQLSELES 226
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  174 KAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTrnnslekerekeekklKEVMDSLKQETQGLQKEKEIQEKELMGFN 253
Cdd:COG3206    227 QLAEARAELAEAEARLAALRAQLGSGPDALPELLQ----------------SPVIQQLRAQLAELEAELAELSARYTPNH 290
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  254 KSVNEARSKMEVAQSELDiylSRHNTAVSQLSKAKEALITASETLKERKAAIKDINTKLPQTQQELKEKEKELQkltqee 333
Cdd:COG3206    291 PDVIALRAQIAALRAQLQ---QEAQRILASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREVE------ 361
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 1907152066  334 iNLKSLVHDLFQKVEEAKSSLAMNRSRGKVLD-AIIQEKKSG 374
Cdd:COG3206    362 -VARELYESLLQRLEEARLAEALTVGNVRVIDpAVVPLKPVS 402
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
537-851 3.89e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 51.27  E-value: 3.89e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  537 VEEVNKMESQLErHSKQAMQIQEQKVQHEEAVVKLRHSE-RDMRNTLEKFAASIQGLSEQEEYLCVQIKELEANVLTTAP 615
Cdd:pfam15921  481 VEELTAKKMTLE-SSERTVSDLTASLQEKERAIEATNAEiTKLRSRVDLKLQELQHLKNEGDHLRNVQTECEALKLQMAE 559
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  616 DRKQQKLLEENVSVFKKEYDAVAEKAGKVEAEIKRLHNtiiDINNRKLKAQQ-----NKLDTINKQLDECASAITKAQVA 690
Cdd:pfam15921  560 KDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEK---EINDRRLELQEfkilkDKKDAKIRELEARVSDLELEKVK 636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  691 IKTAD-------RNLKKAQDSVCrteKEIKDTEKEINDLKTEL----KNIEDKAEEVINNTKTAETSLPEIQKEHRNLLQ 759
Cdd:pfam15921  637 LVNAGserlravKDIKQERDQLL---NEVKTSRNELNSLSEDYevlkRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRN 713
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  760 ELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKI--KLHPVEdnpvETVAVLSQEELEAIKNPESIT 837
Cdd:pfam15921  714 TLKSMEGSDGHAMKVAMGMQKQITAKRGQIDALQSKIQFLEEAMTNAnkEKHFLK----EEKNKLSQELSTVATEKNKMA 789
                          330
                   ....*....|....
gi 1907152066  838 NEIALLEAQCREMK 851
Cdd:pfam15921  790 GELEVLRSQERRLK 803
ABC_SMC6_euk cd03276
ATP-binding cassette domain of eukaryotic SM6 proteins; The structural maintenance of ...
955-1031 1.46e-05

ATP-binding cassette domain of eukaryotic SM6 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213243 [Multi-domain]  Cd Length: 198  Bit Score: 46.82  E-value: 1.46e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  955 LSGGEKTLSSLALVFALHHYKPTPLYFMDEIDAALDFKNVSIVAFYIYEQTK---NAQFIIISLRNNMFEISDRLIGIYK 1031
Cdd:cd03276    110 LSGGERSFSTVCLLLSLWEVMESPFRCLDEFDVFMDMVNRKISTDLLVKEAKkqpGRQFIFITPQDISGLASSDDVKVFR 189
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
105-352 1.95e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.91  E-value: 1.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  105 KEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKkLNKVTKFIEQNKEKFTQLDLEDVQVR-------EKLKHATSKAKK 177
Cdd:PRK03918   192 EELIKEKEKELEEVLREINEISSELPELREELEK-LEKEVKELEELKEEIEELEKELESLEgskrkleEKIRELEERIEE 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  178 LEKQLQKDKEKVEELKSVPAKSKTVInetttrnnslekerekeekKLKEVMDSLKQETQGLQKEKEIQEKELMGFNKSVN 257
Cdd:PRK03918   271 LKKEIEELEEKVKELKELKEKAEEYI-------------------KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIK 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  258 EARSKMEVAQ------SELDIYLSRHNTAVSQLSKAKeALITASETLKERKA--AIKDINTKLPQTQQELKEKEKELQKL 329
Cdd:PRK03918   332 ELEEKEERLEelkkklKELEKRLEELEERHELYEEAK-AKKEELERLKKRLTglTPEKLEKELEELEKAKEEIEEEISKI 410
                          250       260
                   ....*....|....*....|...
gi 1907152066  330 TQEEINLKSLVHDLFQKVEEAKS 352
Cdd:PRK03918   411 TARIGELKKEIKELKKAIEELKK 433
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
113-806 1.98e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 48.86  E-value: 1.98e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  113 KEITEKSNVLSNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQLQK-------D 185
Cdd:TIGR04523   36 KQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKinseiknD 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  186 KEKVEELKSVPAKSKTVINETT---TRNNSLEKEREKEEKKLKEVMDSLKQETQGLQKEKEIQEKELMGFNKSVNEARSK 262
Cdd:TIGR04523  116 KEQKNKLEVELNKLEKQKKENKkniDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNK 195
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  263 ---MEVAQSELDIYLSRHNTAVSQLSKAKEALITASETLKERkaaikdiNTKLPQTQQELKEKEKELQKLTQEEINLKSL 339
Cdd:TIGR04523  196 llkLELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIEKK-------QQEINEKTTEISNTQTQLNQLKDEQNKIKKQ 268
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  340 VHDLFQKVEEAKSSLAMNRSRGKVLDAIIQEKKSGRIPGIYGRL-GDLGAIDEKYDIAISSCCHAldyivVDSIDTAQEC 418
Cdd:TIGR04523  269 LSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKELkSELKNQEKKLEEIQNQISQN-----NKIISQLNEQ 343
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  419 VNFLKKhnigiatfiglDKMTVWAKKMSK-IQTPENTPRLFDLVKVKNEEIRQAFYFALRDTLVANNLDQATRVAYQRDR 497
Cdd:TIGR04523  344 ISQLKK-----------ELTNSESENSEKqRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDE 412
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  498 RWRVVTLQGQIIEQsgtmsgggskvmrgrmgssvideisveEVNKMESQLERHSKQAMQIQEQKVQHEEAVVKLRHSERD 577
Cdd:TIGR04523  413 QIKKLQQEKELLEK---------------------------EIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRES 465
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  578 MRNTLEKFAASIQGLSEQEEYLCVQIKELEANVLTTapdRKQQKLLEENVSVFKKEYDAVAEKAGKVEAEIKRLHNTIID 657
Cdd:TIGR04523  466 LETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKL---NEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISD 542
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  658 INNRKLKAQQN-KLDTINKQLDECASAITKaqvaIKTADRNLKKAQDSVcrtEKEIKDTEKEINDLKTELKNIEDKAEEV 736
Cdd:TIGR04523  543 LEDELNKDDFElKKENLEKEIDEKNKEIEE----LKQTQKSLKKKQEEK---QELIDQKEKEKKDLIKEIEEKEKKISSL 615
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  737 inntktaETSLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKI 806
Cdd:TIGR04523  616 -------EKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKWPEIIKKIKESKTKIDDI 678
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
91-373 2.55e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 48.48  E-value: 2.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   91 IYDLQNRIAEITTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVQVREKLKh 170
Cdd:TIGR04523  323 LEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQ- 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  171 atsKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEVMDSLKqetqglqKEKEIQEKELM 250
Cdd:TIGR04523  402 ---NQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLD-------NTRESLETQLK 471
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  251 GFNKSVNEARSKMEVAQSELDIYLSRHNTAVSQLSKAKEALITASETLKERKAAIKDINTKLPQTQQELKEKEKELQKLT 330
Cdd:TIGR04523  472 VLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDD 551
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907152066  331 QE--EINLKSLVHDLFQKVEEAKSSLAMNRSRGKVLDAIIQEKKS 373
Cdd:TIGR04523  552 FElkKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEK 596
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
556-971 2.77e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 48.43  E-value: 2.77e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  556 QIQEQKVQHEEAVVKLRHSERDMRNTLEKFAASIQGLSEQEEYLCVQIKELEANV-LTTAPDRKQQKLLEENVSVFKKEy 634
Cdd:TIGR00618  532 RGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIpNLQNITVRLQDLTEKLSEAEDML- 610
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  635 dAVAEKAGKVEAEIKRLHNTIIDINNRKLKAQQNKL-------DTINKQLDECASAITKAQVAIKTADR-NLKKAQDSVC 706
Cdd:TIGR00618  611 -ACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLtalhalqLTLTQERVREHALSIRVLPKELLASRqLALQKMQSEK 689
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  707 RTEKEIKDTEKEINDLKTELKNIEDKAEEVIN---------------NTKTAETSLPEIQKEHRNLLQELKVIQENEHAL 771
Cdd:TIGR00618  690 EQLTYWKEMLAQCQTLLRELETHIEEYDREFNeienassslgsdlaaREDALNQSLKELMHQARTVLKARTEAHFNNNEE 769
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  772 QKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKIKLHPVEDNPVETVAVLSQEELEAiKNPESITNEIALLEAQCREMK 851
Cdd:TIGR00618  770 VTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLV-QEEEQFLSRLEEKSATLGEIT 848
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  852 PNLGAIAEYKKKEDLYLQRVAELDKITSERDNFRQAYEDLRKQRLNEFMAGFYVITNKLKENYQMLTLGGDAELELVDSL 931
Cdd:TIGR00618  849 HQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQIKIQFDGDALIKFLHEITLYANVRLANQSEGRFHGRYADSHVNAR 928
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1907152066  932 DPFSEGIMF------SVRPPKkswkkifNLSGGEKTLSSLALVFAL 971
Cdd:TIGR00618  929 KYQGLALLVadaytgSVRPSA-------TLSGGETFLASLSLALAL 967
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
147-370 2.80e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 47.84  E-value: 2.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  147 IEQNKEKFTQLDLEDVQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRnnslekerekeekklke 226
Cdd:COG4942     22 AAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAE----------------- 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  227 vMDSLKQETQGLQKEKEIQEKELMGFNKSV--NEARSKMEV---AQSELDIYlsRHNTAVSQLSKAKEALITA-SETLKE 300
Cdd:COG4942     85 -LAELEKEIAELRAELEAQKEELAELLRALyrLGRQPPLALllsPEDFLDAV--RRLQYLKYLAPARREQAEElRADLAE 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  301 RKAAIKDINTKLPQTQQELKEKEKELQKLTQEEINLKSLVHDLFQKVEEAKSSLAMNRSRGKVLDAIIQE 370
Cdd:COG4942    162 LAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
538-784 4.64e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 47.60  E-value: 4.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  538 EEVNKMESQLERHSKQAM---QIQEQKVQHEEAVVKLRHSERdMRNTLEKFAAsiqglseqeeylcvqikELEANVLtta 614
Cdd:COG4913    235 DDLERAHEALEDAREQIEllePIRELAERYAAARERLAELEY-LRAALRLWFA-----------------QRRLELL--- 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  615 pdRKQQKLLEENVSVFKKEYDAVAEKAGKVEAEIKRLHNTIIDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTA 694
Cdd:COG4913    294 --EAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGGDRLEQLEREIERLERELEERERRRARLEALLAAL 371
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  695 DRnlkkaqdSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTKTAETslpEIQKEHRNLLQELKVIQENEHALQKD 774
Cdd:COG4913    372 GL-------PLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALR---DLRRELRELEAEIASLERRKSNIPAR 441
                          250
                   ....*....|
gi 1907152066  775 ALSIKLKLEQ 784
Cdd:COG4913    442 LLALRDALAE 451
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
662-895 5.98e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 46.68  E-value: 5.98e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  662 KLKAQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEvinntk 741
Cdd:COG4942     21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAE------ 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  742 tAETSLPEIQKEHRNLLQELKVIQENEHA---LQKDALSIKLKLEQIDGHISEHNskikywQKEISKIKlhpvednpvet 818
Cdd:COG4942     95 -LRAELEAQKEELAELLRALYRLGRQPPLallLSPEDFLDAVRRLQYLKYLAPAR------REQAEELR----------- 156
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907152066  819 vavLSQEELEAIKnpESITNEIALLEAQCREMKpnlgaiAEYKKKEDLYLQRVAELDKITSERDNFRQAYEDLRKQR 895
Cdd:COG4942    157 ---ADLAELAALR--AELEAERAELEALLAELE------EERAALEALKAERQKLLARLEKELAELAAELAELQQEA 222
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
603-900 1.13e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 46.58  E-value: 1.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  603 IKELEaNVLTTAPDrkQQKLLEENVSVFKKEYDavaEKAGKVEAEikrlhNTIIDINNRKLKAQQNKLDTINKQLDECAS 682
Cdd:TIGR00606  697 ISDLQ-SKLRLAPD--KLKSTESELKKKEKRRD---EMLGLAPGR-----QSIIDLKEKEIPELRNKLQKVNRDIQRLKN 765
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  683 AITKAQVAIKTADRNLKKAQDSVC------RTEKEIKDTEKEINDLKTELKNIE---------DKAEEVINNTKTAETSL 747
Cdd:TIGR00606  766 DIEEQETLLGTIMPEEESAKVCLTdvtimeRFQMELKDVERKIAQQAAKLQGSDldrtvqqvnQEKQEKQHELDTVVSKI 845
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  748 PEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKIKLHPVEDNPVETVAVLSQEEL 827
Cdd:TIGR00606  846 ELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEK 925
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  828 EAIKNPESITNEIALLEAQcrEMKPNLGAIAEYKK---------KEDLYLQRVAELDKITSERDNFRQayedlRKQRLNE 898
Cdd:TIGR00606  926 EELISSKETSNKKAQDKVN--DIKEKVKNIHGYMKdienkiqdgKDDYLKQKETELNTVNAQLEECEK-----HQEKINE 998

                   ..
gi 1907152066  899 FM 900
Cdd:TIGR00606  999 DM 1000
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
579-897 1.32e-04

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 45.98  E-value: 1.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  579 RNTLEKFAA--SIQGLSEQEEYLCVQIKE-LEANVLTTAPDRKQQKLLEENVSVFKKEYDAVAEKAGKVEAEIKRLHNTI 655
Cdd:pfam12128  305 ELNGELSAAdaAVAKDRSELEALEDQHGAfLDADIETAAADQEQLPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKI 384
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  656 IDINNRKL----KAQQNKLDTINKQLDECASA------------------ITKAQVAIKTADRNLKKAQDSVCRTEKEIK 713
Cdd:pfam12128  385 KEQNNRDIagikDKLAKIREARDRQLAVAEDDlqaleselreqleagkleFNEEEYRLKSRLGELKLRLNQATATPELLL 464
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  714 DTE---KEINDLKTEL----KNIEDKAEEVINNTKTAETSLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQID 786
Cdd:pfam12128  465 QLEnfdERIERAREEQeaanAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELELQLFPQAGTLLHFLRKEA 544
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  787 GHISEHNSKIkywqkeISKIKLHPVEDNPVETVAVLSQE--------ELEAIKNPESITNEIAlLEAQCREMKPNLGAIA 858
Cdd:pfam12128  545 PDWEQSIGKV------ISPELLHRTDLDPEVWDGSVGGElnlygvklDLKRIDVPEWAASEEE-LRERLDKAEEALQSAR 617
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 1907152066  859 EYKKKEDLYL-QRVAELDKITSERDNFRQAYE--DLRKQRLN 897
Cdd:pfam12128  618 EKQAAAEEQLvQANGELEKASREETFARTALKnaRLDLRRLF 659
46 PHA02562
endonuclease subunit; Provisional
589-831 1.46e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 45.78  E-value: 1.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  589 IQGLSEQEEYLCVQIKELEANVLttapdrKQQKLLEENVSVFKKEYDAVAEKAGKVEAEIKRLHNTIIDINnRKLKAQQN 668
Cdd:PHA02562   183 IQTLDMKIDHIQQQIKTYNKNIE------EQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLV-MDIEDPSA 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  669 KLDTINkqldecaSAITKAQVAIKTADRNLKKAQD-SVCRTEKE-IKDTEKEINDLKTELKNIEDKAEEVinntktaets 746
Cdd:PHA02562   256 ALNKLN-------TAAAKIKSKIEQFQKVIKMYEKgGVCPTCTQqISEGPDRITKIKDKLKELQHSLEKL---------- 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  747 lpeiqKEHRnllQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKIKLHPVEDNpvETVAVLSQEE 826
Cdd:PHA02562   319 -----DTAI---DELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNA--EELAKLQDEL 388

                   ....*
gi 1907152066  827 LEAIK 831
Cdd:PHA02562   389 DKIVK 393
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
100-359 1.61e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 45.87  E-value: 1.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  100 EITTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVQVREKLKHatsKAKKLE 179
Cdd:pfam05483  489 ELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQ---KGDEVK 565
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  180 KQLQKDKEKVEELKSVPAKSKTVINETTTRNNSLEKEREKEEKKlkevMDSLKQETQGLQKEKEIQEKELMGFNKSVNEA 259
Cdd:pfam05483  566 CKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKN----IEELHQENKALKKKGSAENKQLNAYEIKVNKL 641
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  260 RSKMEVAQSELDIYLSRHNTAVSQ--------LSKAKEALITASETLKERKAAIKDINTKLPQTQQELKEKEKELQKLTQ 331
Cdd:pfam05483  642 ELELASAKQKFEEIIDNYQKEIEDkkiseeklLEEVEKAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIE 721
                          250       260
                   ....*....|....*....|....*...
gi 1907152066  332 EEINLKSLVHDLFQKVEEAKSSLAMNRS 359
Cdd:pfam05483  722 ERDSELGLYKNKEQEQSSAKAALEIELS 749
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
622-901 2.27e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.89  E-value: 2.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  622 LLEENVSVFKKEYDAVAEKAGKVEAEIKRLhNTIIDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKA 701
Cdd:COG4372     28 ALSEQLRKALFELDKLQEELEQLREELEQA-REELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESL 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  702 QDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTKTAETSLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLK 781
Cdd:COG4372    107 QEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALD 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  782 LEQIDGHISEHNSKIKYWQKEISKIKLHPVEDNPVETVAVLSQEELEAIKNPESITNEIALLEAQCREMKPNLGAIAEYK 861
Cdd:COG4372    187 ELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELA 266
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1907152066  862 KKEDLYLQRVAELDKITSERDNFRQAYEDLRKQRLNEFMA 901
Cdd:COG4372    267 ILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAA 306
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
668-862 2.64e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.76  E-value: 2.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  668 NKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNtktaetsl 747
Cdd:COG1579     17 SELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNN-------- 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  748 peiqKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEISKIKlhpvednpvetvavlsqEEL 827
Cdd:COG1579     89 ----KEYEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKK-----------------AEL 147
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1907152066  828 EAIKnpESITNEIALLEAQCREMKPNLGA--IAEYKK 862
Cdd:COG1579    148 DEEL--AELEAELEELEAEREELAAKIPPelLALYER 182
MARTX_Nterm NF012221
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ...
444-838 3.66e-04

MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.


Pssm-ID: 467957 [Multi-domain]  Cd Length: 1848  Bit Score: 44.83  E-value: 3.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  444 KMSKIQTPENTPRLFDLVKVK-NEEIRQAFYFALRDTLVANN----LDQATRV--AYQRDRRWRVVTLQGQIIEQSGTMS 516
Cdd:NF012221  1438 RVSELDTYTNTSLYQDLSNLTaGEVIALSFDFARRAGLSTNNgievLWNGEVVfaSSGDASAWQQKTLKLTAKAGSNRLE 1517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  517 GGGSKVMRGrMGSSVIDEISVEEVNKMESQLERHSKQAMQIQEQKVQHEEAVVKLRHSERDMRNTLEKFAASiqglseqe 596
Cdd:NF012221  1518 FKGTGHNDG-LGYILDNVVATSESSQQADAVSKHAKQDDAAQNALADKERAEADRQRLEQEKQQQLAAISGS-------- 1588
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  597 eylcvQiKELEANvlttapdrkQQKLLEENVSVfkkEYDAVAEKAGKVEAEIKRLHNTIIDINNRKLKAQQNK------- 669
Cdd:NF012221  1589 -----Q-SQLEST---------DQNALETNGQA---QRDAILEESRAVTKELTTLAQGLDALDSQATYAGESGdqwrnpf 1650
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  670 ----LDTINKQLDECASAITKAQVAIKTA-DRNLKKAQDSVC-------RTEKEIKDTEKEINDLKTELKNIEDKAEEVI 737
Cdd:NF012221  1651 agglLDRVQEQLDDAKKISGKQLADAKQRhVDNQQKVKDAVAkseagvaQGEQNQANAEQDIDDAKADAEKRKDDALAKQ 1730
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  738 NNTKTAET-SLPEIQKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIK---YWQKEISKIKLHPVED 813
Cdd:NF012221  1731 NEAQQAESdANAAANDAQSRGEQDASAAENKANQAQADAKGAKQDESDKPNRQGAAGSGLSgkaYSVEGVAEPGSHINPD 1810
                          410       420
                   ....*....|....*....|....*....
gi 1907152066  814 NPVET----VAVLSQEELEAIKNPESITN 838
Cdd:NF012221  1811 SPAAAdgrfSEGLTEQEQEALEGATNAVN 1839
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
636-883 4.01e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.98  E-value: 4.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  636 AVAEKAGKVEAEIKRLhntiidinNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDT 715
Cdd:COG4942     17 AQADAAAEAEAELEQL--------QQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAEL 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  716 EKEINDLKTELKNIEDKAEEVINNT-KTAETSLPEI---QKEHRNLLQELKVIQENEHALQKDALSIKLKLEQIDGHISE 791
Cdd:COG4942     89 EKEIAELRAELEAQKEELAELLRALyRLGRQPPLALllsPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAE 168
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  792 HNSKIKYWQKEISKIKlhpvednpvETVAVLSQEELEAIKNPESITNEIALLEAQcremkpnlgaIAEYKKKEDLYLQRV 871
Cdd:COG4942    169 LEAERAELEALLAELE---------EERAALEALKAERQKLLARLEKELAELAAE----------LAELQQEAEELEALI 229
                          250
                   ....*....|..
gi 1907152066  872 AELDKITSERDN 883
Cdd:COG4942    230 ARLEAEAAAAAE 241
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
646-898 4.86e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 44.24  E-value: 4.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  646 AEIKRLHNTIIDI---NNRKLKAQQNK--LDTINKQLDECASAITKAQVAIktadRNLKKAQDSVcrtekeikDTEKEIN 720
Cdd:COG3206    148 ELAAAVANALAEAyleQNLELRREEARkaLEFLEEQLPELRKELEEAEAAL----EEFRQKNGLV--------DLSEEAK 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  721 DLKTELKNIEDKAEEVINNTKTAETSLPEIQKEHRNLLQELKVIQENEH--ALQKDALSIKLKLEQIDGHISEHNSKIKY 798
Cdd:COG3206    216 LLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPELLQSPViqQLRAQLAELEAELAELSARYTPNHPDVIA 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  799 WQKEISKIKlhpvednpvetvAVLSQEELEAIknpESITNEIALLEAQCREMKpnlGAIAEYKKKEDLYLQRVAELDKIT 878
Cdd:COG3206    296 LRAQIAALR------------AQLQQEAQRIL---ASLEAELEALQAREASLQ---AQLAQLEARLAELPELEAELRRLE 357
                          250       260
                   ....*....|....*....|
gi 1907152066  879 SERDNFRQAYEDLRkQRLNE 898
Cdd:COG3206    358 REVEVARELYESLL-QRLEE 376
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
602-797 6.06e-04

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 43.69  E-value: 6.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  602 QIKELEANVlttapdrkqQKLLEENvsvFKKEYDAVAEKAGKVEaeiKRLHNTIIDINNRKLKAQQNKLDTINKQLDECA 681
Cdd:pfam06160  212 QLEELKEGY---------REMEEEG---YALEHLNVDKEIQQLE---EQLEENLALLENLELDEAEEALEEIEERIDQLY 276
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  682 SAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEI-----------------NDLKTELKNIEDKAEEVINNTKTAE 744
Cdd:pfam06160  277 DLLEKEVDAKKYVEKNLPEIEDYLEHAEEQNKELKEELervqqsytlnenelervRGLEKQLEELEKRYDEIVERLEEKE 356
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  745 TSLPEIQKEHRNLLQELKVIQENE-------HALQKDALSIKLKLEQIDGHISEHNSKIK 797
Cdd:pfam06160  357 VAYSELQEELEEILEQLEEIEEEQeefkeslQSLRKDELEAREKLDEFKLELREIKRLVE 416
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
617-893 6.79e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 43.90  E-value: 6.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  617 RKQQKLLEENVSVFKKEYDAVAEKAGKVEAEIKRLHNTI--IDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTA 694
Cdd:PRK03918   171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREIneISSELPELREELEKLEKEVKELEELKEEIEELEKELESL 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  695 DRNLKKAQDSVCRTEKEIKDTEKEINDLKT---ELKNIEDKAEEVInntktaetslpEIQKEHRNLLQELKVIQENEHAL 771
Cdd:PRK03918   251 EGSKRKLEEKIRELEERIEELKKEIEELEEkvkELKELKEKAEEYI-----------KLSEFYEEYLDELREIEKRLSRL 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  772 QKDALSIKLKLEQidghISEHNSKIKYWQKEISKIklhpvednpvetvavlsQEELEAIKNPESITNEIALLEAQCREMK 851
Cdd:PRK03918   320 EEEINGIEERIKE----LEEKEERLEELKKKLKEL-----------------EKRLEELEERHELYEEAKAKKEELERLK 378
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1907152066  852 PNLG------AIAEYKKKEDLYLQRVAELDKITSERDNFRQAYEDLRK 893
Cdd:PRK03918   379 KRLTgltpekLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKK 426
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
602-747 8.12e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 42.22  E-value: 8.12e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  602 QIKELEaNVLTTAPDRKQQklLEENVSVFKKEYDAVAEKAGKVEAEIKRLHNTIIDINNRKLKAQQnKLDTI--NKQLDE 679
Cdd:COG1579     18 ELDRLE-HRLKELPAELAE--LEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEE-QLGNVrnNKEYEA 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907152066  680 CASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTKTAETSL 747
Cdd:COG1579     94 LQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEEL 161
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
590-772 1.05e-03

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 43.20  E-value: 1.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  590 QGLSEQEEYLCVQIKEL---EANVLTTAPDRKQQKLLEENVSVfKKEYDAVAEKAGKVEAEIKRLHNTIIDINNRKL-KA 665
Cdd:pfam07111   59 QALSQQAELISRQLQELrrlEEEVRLLRETSLQQKMRLEAQAM-ELDALAVAEKAGQAEAEGLRAALAGAEMVRKNLeEG 137
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  666 QQNKLDTINKQLDECASAITKAQ----VAIKTADRNLKKAQDSV----CRTEKEIKDTEKEINDLKTELKNIEDKAEEVI 737
Cdd:pfam07111  138 SQRELEEIQRLHQEQLSSLTQAHeealSSLTSKAEGLEKSLNSLetkrAGEAKQLAEAQKEAELLRKQLSKTQEELEAQV 217
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907152066  738 NNTKT-----AETSLPEIQK-----EHRNLLQELKVIQENEHALQ 772
Cdd:pfam07111  218 TLVESlrkyvGEQVPPEVHSqtwelERQELLDTMQHLQEDRADLQ 262
46 PHA02562
endonuclease subunit; Provisional
643-1037 1.16e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 42.69  E-value: 1.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  643 KVEAEIKRLHNTIIDINNRKLKAQQNKLDTINKQLDECASAITKAQvaiKTADRNLKKAQDSVCRTEKEIKDTEKEINDL 722
Cdd:PHA02562   163 SVLSEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQR---KKNGENIARKQNKYDELVEEAKTIKAEIEEL 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  723 KTELKNIEDKAEEVINNTKTAETSLPEIQKEHRNLLQELKVIQENEHalqkdALSIKLKLEQIDGHISEHNSKIKYWQKE 802
Cdd:PHA02562   240 TDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGV-----CPTCTQQISEGPDRITKIKDKLKELQHS 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  803 ISKIKLHpvednpVETVAVLSQEELEAIKNPESITNEIALLEAQ-CREMKPNLGAIAEYKKKEDLYLQRVAELDKITSER 881
Cdd:PHA02562   315 LEKLDTA------IDELEEIMDEFNEQSKKLLELKNKISTNKQSlITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDEL 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  882 DNFRQAYEDLRKQRlneFMAGFyvITNKLKE-------------------NYQMLTLGGDAELELVDSldpFSEGImfsv 942
Cdd:PHA02562   389 DKIVKTKSELVKEK---YHRGI--VTDLLKDsgikasiikkyipyfnkqiNHYLQIMEADYNFTLDEE---FNETI---- 456
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  943 rppKKSWKKIF---NLSGGEKTLSSLALVFA------LHHYKPTPLYFMDEI-DAALDFKNVSIVaFYIYEQTKNAQFII 1012
Cdd:PHA02562   457 ---KSRGREDFsyaSFSQGEKARIDLALLFTwrdvasKVSGVDTNLLILDEVfDGALDAEGTKAL-LSILDSLKDTNVFV 532
                          410       420
                   ....*....|....*....|....*
gi 1907152066 1013 ISLRNNMFEISDRLIGIYKTYNSTK 1037
Cdd:PHA02562   533 ISHKDHDPQKFDRHLKMEKVGRFSV 557
PRK01156 PRK01156
chromosome segregation protein; Provisional
116-369 1.18e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 42.97  E-value: 1.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  116 TEKSNVL----SNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVqvrEKLKHATSKAKKLEKQLQKDKEKVEE 191
Cdd:PRK01156   464 EEKSNHIinhyNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEI---NKSINEYNKIESARADLEDIKIKINE 540
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  192 LKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEVMDSLKQETqgLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELD 271
Cdd:PRK01156   541 LKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIET--NRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYID 618
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  272 IYLSRHNTAVSQLSKAK---EALITASETL--------------KERKAAIKDINTKLPQTQQELKEKEKELQKLTQEEI 334
Cdd:PRK01156   619 KSIREIENEANNLNNKYneiQENKILIEKLrgkidnykkqiaeiDSIIPDLKEITSRINDIEDNLKKSRKALDDAKANRA 698
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907152066  335 NLKSLVHDLFQKVEEAKSSLAMNRSRGKVLDAIIQ 369
Cdd:PRK01156   699 RLESTIEILRTRINELSDRINDINETLESMKKIKK 733
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
651-868 1.81e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.06  E-value: 1.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  651 LHNTIIDINNRKLKAQQNKLDTINKQLDE-CASAITKAQVAIKTAD---RNLKKAQDSVCRTEKEIKDTEKEINDLKTEL 726
Cdd:COG4717     39 LLAFIRAMLLERLEKEADELFKPQGRKPElNLKELKELEEELKEAEekeEEYAELQEELEELEEELEELEAELEELREEL 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  727 KNIED--KAEEVINNTKTAETSLPEIQKEHRNLLQELKVIQENEHalqkdalsiklKLEQIDGHISEHNSKIKYWQKEIS 804
Cdd:COG4717    119 EKLEKllQLLPLYQELEALEAELAELPERLEELEERLEELRELEE-----------ELEELEAELAELQEELEELLEQLS 187
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907152066  805 KIKLHPVEDNpVETVAVLSQEELEAIKNPESITNEIALLEAQCREMKPNLGAIAEYKKKEDLYL 868
Cdd:COG4717    188 LATEEELQDL-AEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARL 250
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
654-728 2.17e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 41.74  E-value: 2.17e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907152066  654 TIIDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKN 728
Cdd:COG3883     16 PQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGE 90
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
45-372 2.29e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 41.96  E-value: 2.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   45 RGEKLNRVKMVEKEKDALEgEKNIAIEFLTlENEMFKKKNHICQYYIYDLQNRIAEITTQKEKIHEDTKEITEKSNVLSN 124
Cdd:TIGR00606  507 QNEKADLDRKLRKLDQEME-QLNHHTTTRT-QMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSKSK 584
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  125 EMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVQVREKLKHA------TSKAKKLEKQLQKDKEKVEELKSVPAK 198
Cdd:TIGR00606  585 EINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVcgsqdeESDLERLKEEIEKSSKQRAMLAGATAV 664
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  199 SKTVINETTTRNNS---LEKEREKEEKKLKEVMDSLKQETQGLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELDIY-- 273
Cdd:TIGR00606  665 YSQFITQLTDENQSccpVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKek 744
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  274 --------LSRHNTAVSQLSKAKEALITASETLKERKAAIKDINTK---LPQTQQELKEKEKELQKLTQE--EINLKSLV 340
Cdd:TIGR00606  745 eipelrnkLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDvtiMERFQMELKDVERKIAQQAAKlqGSDLDRTV 824
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1907152066  341 HDLFQKVEEAKSSLAMNRSRGKVLDAIIQEKK 372
Cdd:TIGR00606  825 QQVNQEKQEKQHELDTVVSKIELNRKLIQDQQ 856
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
545-753 2.34e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.67  E-value: 2.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  545 SQLERHSKQAMQIQEQKVQHEEAVVKLRHSERDMRNTLEKFAASIQGLSEQEEYLCVQIKELEANVLTTapdRKQQKLLE 624
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAEL---EKEIAELR 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  625 ENVSVFKKEYD---AVAEKAGKVEAEIKRLHNTIIDINNRKL-------KAQQNKLDTINKQLDECASAITKAQVAIKTA 694
Cdd:COG4942     97 AELEAQKEELAellRALYRLGRQPPLALLLSPEDFLDAVRRLqylkylaPARREQAEELRADLAELAALRAELEAERAEL 176
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907152066  695 DRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTKTAETSLPEIQKE 753
Cdd:COG4942    177 EALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAE 235
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
547-807 2.37e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 41.97  E-value: 2.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  547 LERHSKQAMQIQEQKVQHEEAVVKLRHSERDMRNTLEKFAASIQglseqEEYLCVQIKELEanvlttapdRKQQKLLEEN 626
Cdd:PRK03918   454 LEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK-----LKELAEQLKELE---------EKLKKYNLEE 519
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  627 VSVFKKEYDAVAEKAGKVEAEIKRLHNTIIDIN--NRKLKAQQNKLDTINKQLDECASAITKAQV-AIKTADRNLKKAQD 703
Cdd:PRK03918   520 LEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEelKKKLAELEKKLDELEEELAELLKELEELGFeSVEELEERLKELEP 599
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  704 ------SVCRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTKTAETSLPEIQKEHRNllQELKVIQENEHALQKDALS 777
Cdd:PRK03918   600 fyneylELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE--EEYEELREEYLELSRELAG 677
                          250       260       270
                   ....*....|....*....|....*....|
gi 1907152066  778 IKLKLEQIDGHISEHNSKIKYWQKEISKIK 807
Cdd:PRK03918   678 LRAELEELEKRREEIKKTLEKLKEELEERE 707
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
537-911 2.50e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 42.03  E-value: 2.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  537 VEEVNKMESQLERHSKQAMQIQEQKVQHEEAVVKLRHSE-RDMRNTLEKFAASIQGLSEQEEYLCVQIKELEANVlttAP 615
Cdd:pfam15921  319 LSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSElTEARTERDQFSQESGNLDDQLQKLLADLHKREKEL---SL 395
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  616 DRKQQKLLEE-------NVSVFKKEYDAVAEKAGKVEAEIKRLHNTIIDINNRKLKAQQNKLDTINKqldecasaITKAQ 688
Cdd:pfam15921  396 EKEQNKRLWDrdtgnsiTIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNESLEK--------VSSLT 467
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  689 VAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEdKAEEVINNTKTAETSLPEIQKEHrnlLQELKVIQENE 768
Cdd:pfam15921  468 AQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKE-RAIEATNAEITKLRSRVDLKLQE---LQHLKNEGDHL 543
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  769 HALQKDALSIKLKLEQIDGHISEHNSKIKYWQK------------EISKIKLHP-VEDNPVE--TVAVLSQEELEAIKNP 833
Cdd:pfam15921  544 RNVQTECEALKLQMAEKDKVIEILRQQIENMTQlvgqhgrtagamQVEKAQLEKeINDRRLElqEFKILKDKKDAKIREL 623
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907152066  834 ESITNEIALLEAQ-CREMKPNLGAIAEYKKKEDlylQRVAELDKITSERDNFRQAYEDLRKQRLNEfMAGFYVITNKLK 911
Cdd:pfam15921  624 EARVSDLELEKVKlVNAGSERLRAVKDIKQERD---QLLNEVKTSRNELNSLSEDYEVLKRNFRNK-SEEMETTTNKLK 698
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
93-332 2.53e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.29  E-value: 2.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   93 DLQNRIAEITTQKEKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKLNKVTKFIEQNKEkftqldledvqvreklkhat 172
Cdd:COG4942     31 QLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEK-------------------- 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  173 sKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEVMDSLKQETQGLQKEKEiqekelmgf 252
Cdd:COG4942     91 -EIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLA--------- 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  253 nkSVNEARSKMEVAQSELDIYLSRHNTAVSQLSKAKealitasetlKERKAAIKDINTKLPQTQQELKEKEKELQKLTQE 332
Cdd:COG4942    161 --ELAALRAELEAERAELEALLAELEEERAALEALK----------AERQKLLARLEKELAELAAELAELQQEAEELEAL 228
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
534-892 4.51e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 41.18  E-value: 4.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  534 EISVEEVNKMESQLERHSKQAMQIQEQkvqhEEAVVKLRHSERDMRNTLEKFAASIQGLSEQEEYLCVQIKELEANVLTT 613
Cdd:PRK02224   230 EQARETRDEADEVLEEHEERREELETL----EAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLD 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  614 APDRK----QQKLLEENVSVFKKEYDAVAEKAGKVEAEIKRLHNTIIDINNRKLKAQqNKLDTINKQLDECASAITKAQV 689
Cdd:PRK02224   306 DADAEaveaRREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELR-EEAAELESELEEAREAVEDRRE 384
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  690 AIKTadrnlkkaqdsvcrTEKEIKDTEKEINDLKTELKNIEDKAEEVINN-------TKTAETSLPEIQK---EHRNLLQ 759
Cdd:PRK02224   385 EIEE--------------LEEEIEELRERFGDAPVDLGNAEDFLEELREErdelrerEAELEATLRTARErveEAEALLE 450
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  760 ELKV------IQENEHA------------LQKDALSIKLKLEQIDGHIsEHNSKIKYWQKEISKIKlhpvedNPVETVav 821
Cdd:PRK02224   451 AGKCpecgqpVEGSPHVetieedrerveeLEAELEDLEEEVEEVEERL-ERAEDLVEAEDRIERLE------ERREDL-- 521
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907152066  822 lsqEELEAIKnpESITNEIALLEAQCREMKPNLGAIAEYKKKE--------DLYLQRVAELDKITSERDNFRQAYEDLR 892
Cdd:PRK02224   522 ---EELIAER--RETIEEKRERAEELRERAAELEAEAEEKREAaaeaeeeaEEAREEVAELNSKLAELKERIESLERIR 595
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
36-325 4.52e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.19  E-value: 4.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   36 RRVEILNEHRGEKLNRVKMVEKEKDAL--EGEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAEITtqKEKIHEDTK 113
Cdd:PTZ00108  1032 KKKDLVKELKKLGYVRFKDIIKKKSEKitAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLT--KEKVEKLNA 1109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  114 EITEKSNVLSnemKAKNSAVKDVEKK-LNKVTKFIEQNKEKftqlDLEDVQVREKLKHATS-KAKKLEKQLQKDKEKVEE 191
Cdd:PTZ00108  1110 ELEKKEKELE---KLKNTTPKDMWLEdLDKFEEALEEQEEV----EEKEIAKEQRLKSKTKgKASKLRKPKLKKKEKKKK 1182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  192 LKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEVMDSLKQETQGLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELD 271
Cdd:PTZ00108  1183 KSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSS 1262
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907152066  272 IYLSRHN---TAVSQLSKAKEALITASETLKERK-------AAIKDINTKLPQTQQELKEKEKE 325
Cdd:PTZ00108  1263 DDLSKEGkpkNAPKRVSAVQYSPPPPSKRPDGESnggskpsSPTKKKVKKRLEGSLAALKKKKK 1326
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
54-335 4.95e-03

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 40.71  E-value: 4.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   54 MVEKEKDALEGEKNIAIEFLTLENEMFKKKNHIcqyyiydLQNRIAEITTQKEKIHEDTKEIteksnvlsnEMKAKNSAV 133
Cdd:COG5185    251 TSDKLEKLVEQNTDLRLEKLGENAESSKRLNEN-------ANNLIKQFENTKEKIAEYTKSI---------DIKKATESL 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  134 KDVEKKLNKVTKFIEQNKEKFTQLDledvQVREKLKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSL 213
Cdd:COG5185    315 EEQLAAAEAEQELEESKRETETGIQ----NLTAEIEQGQESLTENLEAIKEEIENIVGEVELSKSSEELDSFKDTIESTK 390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  214 EKEREKEEKKLKEVMDSLKQetqgLQKEKEIQEKELMGFNKSVNEARSKMEVAQSELD-----IYLSRHNTAVSQLSKAK 288
Cdd:COG5185    391 ESLDEIPQNQRGYAQEILAT----LEDTLKAADRQIEELQRQIEQATSSNEEVSKLLNeliseLNKVMREADEESQSRLE 466
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1907152066  289 EALITASETLKERKAAIKDINTKLPQTQQELKEKEKELQKLTQEEIN 335
Cdd:COG5185    467 EAYDEINRSVRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQLE 513
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
545-787 5.75e-03

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 40.49  E-value: 5.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  545 SQLERHSKQamQIQEQKVQHEEAVVKLRHSERDMRNTLEK---FAASIQGLSEQEEylcvqikELEANVLTTAPDRKQQK 621
Cdd:pfam05557   12 SQLQNEKKQ--MELEHKRARIELEKKASALKRQLDRESDRnqeLQKRIRLLEKREA-------EAEEALREQAELNRLKK 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  622 LLEENVSVFKKEYDAVAEKAGKVEAEIK---RLHNTIIDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKtadrNL 698
Cdd:pfam05557   83 KYLEALNKKLNEKESQLADAREVISCLKnelSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLRQ----NL 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  699 KKAQDSVCRTEKEIKDTEKEI---NDLKTELKNIEDKAEevinntktaetSLPEIQKEHRNLLQE---LKVIQENEHALQ 772
Cdd:pfam05557  159 EKQQSSLAEAEQRIKELEFEIqsqEQDSEIVKNSKSELA-----------RIPELEKELERLREHnkhLNENIENKLLLK 227
                          250
                   ....*....|....*
gi 1907152066  773 KDALSIKLKLEQIDG 787
Cdd:pfam05557  228 EEVEDLKRKLEREEK 242
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
170-355 5.77e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.20  E-value: 5.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  170 HATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRnnslekerekeekklkevMDSLKQETQGLQKEKEIQEKEL 249
Cdd:COG3883     13 FADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEE------------------YNELQAELEALQAEIDKLQAEI 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  250 MGFNKSVNEARSKME----------VAQSELDIYLSRHN--------TAVSQLSKAKEALI----TASETLKERKAAIKD 307
Cdd:COG3883     75 AEAEAEIEERREELGeraralyrsgGSVSYLDVLLGSESfsdfldrlSALSKIADADADLLeelkADKAELEAKKAELEA 154
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1907152066  308 INTKLPQTQQELKEKEKELQKLTQEEINLKSLVHDLFQKVEEAKSSLA 355
Cdd:COG3883    155 KLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELE 202
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
602-735 7.88e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 40.28  E-value: 7.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  602 QIKELEANVLTTAPDRKQQKLLEENVSVFKKEYDAVAE------KAGKVEAEIKRLHNTIidinnRKLKAQQNKLDTINK 675
Cdd:COG4913    618 ELAELEEELAEAEERLEALEAELDALQERREALQRLAEyswdeiDVASAEREIAELEAEL-----ERLDASSDDLAALEE 692
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  676 QLDEcasaitkAQVAIKTADRNLKKAQDSVCRTEKEIKDTEKEINDLKTELKNIEDKAEE 735
Cdd:COG4913    693 QLEE-------LEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARL 745
COG5022 COG5022
Myosin heavy chain [General function prediction only];
8-376 7.98e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 40.45  E-value: 7.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066    8 TEHDEGMLEYLEDIIGCGRLNEPIKVLCRRVEILNEHRGEKL-NRVKMVEKEKDALEGEKNIAIEFLTLENEMFKKKnhi 86
Cdd:COG5022    806 LGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLiQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAER--- 882
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   87 cqyYIYDLQNRIAEITTQKEKIHEDTKEITEKSNVLSNEMkaknsaVKDVEKKLNKVTKfIEQNKEKftqLDLEDVQVRE 166
Cdd:COG5022    883 ---QLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDL------IENLEFKTELIAR-LKKLLNN---IDLEEGPSIE 949
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  167 klKHATSKAKKLEKQLQKDKEKVEELKSVPAKSKTVINETTTRNNSlekerekeekklkevMDSLKQETQGLQKEKEIQE 246
Cdd:COG5022    950 --YVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSE---------------LKNFKKELAELSKQYGALQ 1012
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  247 KELmgfnKSVNEARSKMEVAQSELDIYLSRHnTAVSQLSKAKEALITASETLKERKAAIKDINTKLPQTQQELKEKEK-E 325
Cdd:COG5022   1013 EST----KQLKELPVEVAELQSASKIISSES-TELSILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQlE 1087
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907152066  326 LQKLTQEEINLKSLvhdLFQKVEEAKSS--LAMNRSRGKVLDAIIQEKKSGRI 376
Cdd:COG5022   1088 STENLLKTINVKDL---EVTNRNLVKPAnvLQFIVAQMIKLNLLQEISKFLSQ 1137
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
26-827 8.07e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 40.42  E-value: 8.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066   26 RLNEPIKVLCRRVEILNEHRGEKLNRVKMVEKEKDALEgEKNIAIEFLTLENEMFKKKNHICQYYIYDLQNRIAeITTQK 105
Cdd:TIGR00606  323 DCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQ-EHIRARDSLIQSLATRLELDGFERGPFSERQIKNF-HTLVI 400
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  106 EKIHEDTKEITEKSNVLSNEMKAKNSAVKDVEKKLNKVTKFIEQNKEKFTQLDLEDVQVREKLKHATSKAKKLekqLQKD 185
Cdd:TIGR00606  401 ERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRI---LELD 477
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  186 KEKVEELKSVPAKSKTVINETTTRNNSLEKEREKEEKKLKEVMDslkQETQGLQKEKEIQEKELMgfnksvnEARSKMEV 265
Cdd:TIGR00606  478 QELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLD---QEMEQLNHHTTTRTQMEM-------LTKDKMDK 547
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  266 AQSELDIYLSRHNTAVSQL------SKAKEALITASETLKERKAAIKDINTKLPQTQQELKEKEKELQKLTQEEINLKSL 339
Cdd:TIGR00606  548 DEQIRKIKSRHSDELTSLLgyfpnkKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDK 627
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  340 VHD-------------LFQKVEEAKSSLAMNRSRGKVLDAIIQE---KKSGRIPgiygrlgdLGAIDEKYDIAISSCCHA 403
Cdd:TIGR00606  628 LFDvcgsqdeesdlerLKEEIEKSSKQRAMLAGATAVYSQFITQltdENQSCCP--------VCQRVFQTEAELQEFISD 699
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  404 LDYIVVDSIDTAQECVNFLKKHNIGIATFIGLDKMTVWAKKMSKIQTPENTPRL----FDLVKVKNEEIRQAFYFALRDT 479
Cdd:TIGR00606  700 LQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLqkvnRDIQRLKNDIEEQETLLGTIMP 779
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  480 LVANNLDQATRVAYQRDRRWRVVTLQGQIIEQSGTMSGGGSKVMRGRMGSSVIDEisVEEVNKMESQLERHSKqAMQIQE 559
Cdd:TIGR00606  780 EEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEK--QHELDTVVSKIELNRK-LIQDQQ 856
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  560 QKVQHEEAVVKLRHSERdmrNTLEKFAASIQGLSEQEEYLCVQIKEL---------EANVLTTAPDRKQQKLLE------ 624
Cdd:TIGR00606  857 EQIQHLKSKTNELKSEK---LQIGTNLQRRQQFEEQLVELSTEVQSLireikdakeQDSPLETFLEKDQQEKEElisske 933
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  625 ENVSVFKKEYDAVAEKAGKVEAEIKRLHNTIIDINNRKLKAQQNKLDTINKQLDECASAITKAQVAIKTADRNL--KKAQ 702
Cdd:TIGR00606  934 TSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIdtQKIQ 1013
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  703 DSVCRTEKEIKDTEKEINDLKTELKNiedkaeeviNNTKTAETSLPEIQKEHRNLLQELKVIQENEhalqkdalsiklkl 782
Cdd:TIGR00606 1014 ERWLQDNLTLRKRENELKEVEEELKQ---------HLKEMGQMQVLQMKQEHQKLEENIDLIKRNH-------------- 1070
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....*
gi 1907152066  783 EQIDGHISEHNSKIKYWQKEISKIKLHPVEDNPVETVAVLSQEEL 827
Cdd:TIGR00606 1071 VLALGRQKGYEKEIKHFKKELREPQFRDAEEKYREMMIVMRTTEL 1115
PRK01156 PRK01156
chromosome segregation protein; Provisional
583-1037 8.40e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 40.27  E-value: 8.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  583 EKFAASIQGLSEQEEYLCVQIKELEANVLTTAPDRKQQKLLEENVSVFK-KEYDAVAEKAGKVEAEIKRLHNTIIDINNR 661
Cdd:PRK01156   465 EKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEiNKSINEYNKIESARADLEDIKIKINELKDK 544
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  662 KLKAQQnkldtinkqLDECASAITKAQVAIKTADRNLKKAQdsvcRTEKEIKDTEKEINDLKTELKNIEDKAEEVINNTK 741
Cdd:PRK01156   545 HDKYEE---------IKNRYKSLKLEDLDSKRTSWLNALAV----ISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFP 611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  742 TAET----SLPEIQKEHRNLLQELKVIQENehalqkdalsiKLKLEQIDGhisehnsKIKYWQKEISKIKlhPVEDNPVE 817
Cdd:PRK01156   612 DDKSyidkSIREIENEANNLNNKYNEIQEN-----------KILIEKLRG-------KIDNYKKQIAEID--SIIPDLKE 671
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  818 TVAVLSQEELEAIKNPESITNEIALLEAQCREMKPNLGAIAEYKkkedlylQRVAELDKITSERDNFRQAYEDLRKQRLN 897
Cdd:PRK01156   672 ITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRINELS-------DRINDINETLESMKKIKKAIGDLKRLREA 744
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  898 EFMAGFYVITNKLKENYqMLTLGGDAELELVDSLDPFSEGIMFSVRPPKKSW-KKIFNLSGGEKTLSSLALVFALHHY-- 974
Cdd:PRK01156   745 FDKSGVPAMIRKSASQA-MTSLTRKYLFEFNLDFDDIDVDQDFNITVSRGGMvEGIDSLSGGEKTAVAFALRVAVAQFln 823
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907152066  975 KPTPLYFMDEIDAALDFKNVS----IVAFYIYEQTKNAQFIIISLRNNMFEISDRLIGIYKTYNSTK 1037
Cdd:PRK01156   824 NDKSLLIMDEPTAFLDEDRRTnlkdIIEYSLKDSSDIPQVIMISHHRELLSVADVAYEVKKSSGSSK 890
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
545-888 9.47e-03

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 40.01  E-value: 9.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  545 SQLERHSKQAMQIQEQKVQHEEAVVKLRHSERD---------MRNTLEKFAASIQGLSEQEEYLCVQIKELeANVLTTAP 615
Cdd:pfam05667  207 SLLERNAAELAAAQEWEEEWNSQGLASRLTPEEyrkrkrtklLKRIAEQLRSAALAGTEATSGASRSAQDL-AELLSSFS 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  616 DRKQQKLLEENVSVFKKEYDAVAEKAGKVEAEIkrLHNTIIDINNRKLKaQQNKLDTINKQLDECASAITKAqvaiktad 695
Cdd:pfam05667  286 GSSTTDTGLTKGSRFTHTEKLQFTNEAPAATSS--PPTKVETEEELQQQ-REEELEELQEQLEDLESSIQEL-------- 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  696 rnlkkaqdsvcrtekeikdtEKEINDLKTELKNIEDKAEEvinnTKTAETSLPEIQKEHRNLLQELKVIQENEHALQKDA 775
Cdd:pfam05667  355 --------------------EKEIKKLESSIKQVEEELEE----LKEQNEELEKQYKVKKKTLDLLPDAEENIAKLQALV 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  776 LSIKLKLEQIDGHISEHNSKIkywqkeISKIKLHPVEdnpvetvavLSQEELEAiknpESITNEIALLEAQCREMkpnlg 855
Cdd:pfam05667  411 DASAQRLVELAGQWEKHRVPL------IEEYRALKEA---------KSNKEDES----QRKLEEIKELREKIKEV----- 466
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1907152066  856 aIAEYKKKEDLYLQRVAELDKITseRDNFRQAY 888
Cdd:pfam05667  467 -AEEAKQKEELYKQLVAEYERLP--KDVSRSAY 496
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
669-903 9.91e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 39.90  E-value: 9.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  669 KLDTINKQLDECASAITKAQVAIktadrnlkkaqdsvcrteKEIKDTEKEINDLKTELKNIEDKAEEVInNTKTAETSLP 748
Cdd:COG4913    611 KLAALEAELAELEEELAEAEERL------------------EALEAELDALQERREALQRLAEYSWDEI-DVASAEREIA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  749 EIQKEHRNLLQ---ELKVIQENEHALQKDALSIKLKLEQIDGHISEHNSKIKYWQKEI--SKIKLHPVEDNPVETVAVLS 823
Cdd:COG4913    672 ELEAELERLDAssdDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELdeLQDRLEAAEDLARLELRALL 751
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907152066  824 QEELEAIKNP-------ESITNEIALLEAQCREMKPNL-GAIAEYKKKEDLYLQRV-----------AELDKITSER-DN 883
Cdd:COG4913    752 EERFAAALGDaverelrENLEERIDALRARLNRAEEELeRAMRAFNREWPAETADLdadleslpeylALLDRLEEDGlPE 831
                          250       260
                   ....*....|....*....|
gi 1907152066  884 FRQAYEDLRKQRLNEFMAGF 903
Cdd:COG4913    832 YEERFKELLNENSIEFVADL 851
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH