NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1040670051|ref|XP_017207676|]
View 

CCR4-NOT transcription complex subunit 3b isoform X1 [Danio rerio]

Protein Classification

CNOT2/3/5 family protein( domain architecture ID 13694805)

CNOT2/3/5 family protein is a component of the CCR4-NOT complex which is one of the major cellular mRNA deadenylases and is linked to various cellular processes including bulk mRNA degradation, miRNA-mediated repression, translational repression during translational initiation and general transcription regulation

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Not3 pfam04065
Not1 N-terminal domain, CCR4-Not complex component;
3-229 3.52e-141

Not1 N-terminal domain, CCR4-Not complex component;


:

Pssm-ID: 461155 [Multi-domain]  Cd Length: 228  Bit Score: 417.72  E-value: 3.52e-141
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   3 DKRKLQGEIDRCLKKVTEGVEQFEDIWKKLHNAANANQKEKYESDLKKEIKKLQRLRDQIKTWVASNEIKDKRQLVENRK 82
Cdd:pfam04065   1 AARKLQQEIDRTLKKVAEGLEEFDDIYEKLESATNSSQKEKLEADLKKEIKKLQRLRDQIKTWLSSNDIKDKKKLLENRK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  83 LIETQMERFKIVERETKTKAYSKEGLGLAQ--KVDPAQKEKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDK 160
Cdd:pfam04065  81 LIEEAMERFKAVEKESKTKAFSKEGLSLAAasKLDPKEKEKAEARDWLSDSIDELNRQIEALEAEIESLQAQKKKKKKDS 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1040670051 161 EKQdRIEELKRLIERHRFHIRMLETILRMLDNDSIQVDSIRKIKDDVEYYLDSSQDPDFEENEFLYDDL 229
Cdd:pfam04065 161 EKA-RLEELEKLIERHKFHIEKLELILRLLENGELDPEQVDDIKEDIEYYVESNQDPDFVEDEDIYDDL 228
NOT2_3_5 pfam04153
NOT2 / NOT3 / NOT5 family; NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that ...
752-877 1.01e-58

NOT2 / NOT3 / NOT5 family; NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5.


:

Pssm-ID: 461199  Cd Length: 121  Bit Score: 195.89  E-value: 1.01e-58
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 752 SAWAHMPHPSDSERIRQYLMRNPCPtlpfHHQMPPPHSDSVEFYQRLSTETLFFIFYYLEGTKAQYLAAKALKKQSWRFH 831
Cdd:pfam04153   1 YSTFNSPWPLDAERPYEPDFRNPAC----YPQEPLPRLESPEKFSKFDDETLFYIFYSMPGDYAQYLAAKELKKRSWRYH 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1040670051 832 TKYMMWFQRHEEPKTITDEFEQGTYIYFDYeKWGQRKKEGFTFEYR 877
Cdd:pfam04153  77 KKLRTWFTRDEEPKVITNEYERGTYIYFDY-TWEQRRKKDFTLDYE 121
PPE super family cl35037
PPE-repeat protein [Function unknown];
406-599 3.67e-05

PPE-repeat protein [Function unknown];


The actual alignment was detected with superfamily member COG5651:

Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.20  E-value: 3.67e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 406 STLGPLTPSSQSQTNLGMSLSS--------SLGLGKGTSALTGTNQM--PSLVLSGMPASL---SGFLSGSTPAPYAQAA 472
Cdd:COG5651   162 VALTPFTQPPPTITNPGGLLGAqnagsgntSSNPGFANLGLTGLNQVgiGGLNSGSGPIGLnsgPGNTGFAGTGAAAGAA 241
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 473 AAGCVGSGLSSAIGTNISSTNSSGVVTSLGTNGSLGSTSLLGSGSGILGLGSTFAAVE-------PLPLLAPSPVGSTLT 545
Cdd:COG5651   242 AAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGagaaaatGLGLGAGGAAGAAGA 321
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1040670051 546 SGSNVGVIGSSGGTSGTANATGSGTQPRPPSGPKQNGGSSYSAVVGDSTSDSAL 599
Cdd:COG5651   322 TGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAA 375
 
Name Accession Description Interval E-value
Not3 pfam04065
Not1 N-terminal domain, CCR4-Not complex component;
3-229 3.52e-141

Not1 N-terminal domain, CCR4-Not complex component;


Pssm-ID: 461155 [Multi-domain]  Cd Length: 228  Bit Score: 417.72  E-value: 3.52e-141
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   3 DKRKLQGEIDRCLKKVTEGVEQFEDIWKKLHNAANANQKEKYESDLKKEIKKLQRLRDQIKTWVASNEIKDKRQLVENRK 82
Cdd:pfam04065   1 AARKLQQEIDRTLKKVAEGLEEFDDIYEKLESATNSSQKEKLEADLKKEIKKLQRLRDQIKTWLSSNDIKDKKKLLENRK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  83 LIETQMERFKIVERETKTKAYSKEGLGLAQ--KVDPAQKEKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDK 160
Cdd:pfam04065  81 LIEEAMERFKAVEKESKTKAFSKEGLSLAAasKLDPKEKEKAEARDWLSDSIDELNRQIEALEAEIESLQAQKKKKKKDS 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1040670051 161 EKQdRIEELKRLIERHRFHIRMLETILRMLDNDSIQVDSIRKIKDDVEYYLDSSQDPDFEENEFLYDDL 229
Cdd:pfam04065 161 EKA-RLEELEKLIERHKFHIEKLELILRLLENGELDPEQVDDIKEDIEYYVESNQDPDFVEDEDIYDDL 228
NOT2_3_5 pfam04153
NOT2 / NOT3 / NOT5 family; NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that ...
752-877 1.01e-58

NOT2 / NOT3 / NOT5 family; NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5.


Pssm-ID: 461199  Cd Length: 121  Bit Score: 195.89  E-value: 1.01e-58
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 752 SAWAHMPHPSDSERIRQYLMRNPCPtlpfHHQMPPPHSDSVEFYQRLSTETLFFIFYYLEGTKAQYLAAKALKKQSWRFH 831
Cdd:pfam04153   1 YSTFNSPWPLDAERPYEPDFRNPAC----YPQEPLPRLESPEKFSKFDDETLFYIFYSMPGDYAQYLAAKELKKRSWRYH 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1040670051 832 TKYMMWFQRHEEPKTITDEFEQGTYIYFDYeKWGQRKKEGFTFEYR 877
Cdd:pfam04153  77 KKLRTWFTRDEEPKVITNEYERGTYIYFDY-TWEQRRKKDFTLDYE 121
CDC36 COG5601
General negative regulator of transcription subunit [Transcription];
785-880 6.70e-15

General negative regulator of transcription subunit [Transcription];


Pssm-ID: 227888  Cd Length: 172  Bit Score: 73.47  E-value: 6.70e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 785 PPPHSDSVEFYQR---LSTETLFFIFYYLEGTKAQYLAAKALKKQSWRFHTKYMMW--FQRHEEPKTITDEFEQGTYIYF 859
Cdd:COG5601    73 NPPIFKVNIEDMKmdnFHDETLFYIFYSFPNDVLQEKAYDELLKRNWRFHKMLKCWltFNPGMSPATADHVKERGSYVFF 152
                          90       100
                  ....*....|....*....|.
gi 1040670051 860 DYEKWGQRKKEgFTFEYRYLE 880
Cdd:COG5601   153 DPFSWSKVSLD-FLLDYKAVR 172
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
5-190 1.24e-06

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 51.06  E-value: 1.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   5 RKLQGEIDRCLKKVTEGVEQFEDIWKKLhnaANANQKEKYESDLKKEIkklQRLRDQIKTWVAS--------NEIKDKRQ 76
Cdd:COG1340    74 KELKEERDELNEKLNELREELDELRKEL---AELNKAGGSIDKLRKEI---ERLEWRQQTEVLSpeeekelvEKIKELEK 147
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  77 LVENRKLIETQMErfKIVERETKTKAYSKEGLGLAQKV----DPAQKEKEEVGQwLTNTIDTLNMQVDQFESEVEslsvq 152
Cdd:COG1340   148 ELEKAKKALEKNE--KLKELRAELKELRKEAEEIHKKIkelaEEAQELHEEMIE-LYKEADELRKEADELHKEIV----- 219
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 1040670051 153 trkkkgdkEKQDRIEELKRLIERHRFHIRMLETILRML 190
Cdd:COG1340   220 --------EAQEKADELHEEIIELQKELRELRKELKKL 249
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
3-170 7.40e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 49.63  E-value: 7.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   3 DKRKLQGEIDRCLKKVTEGVEQFEDIwkklhnaananqkEKYESDLKKEIKKLQRLRDQIKTWVA--SNEIKDkrqlvEN 80
Cdd:TIGR04523 420 EKELLEKEIERLKETIIKNNSEIKDL-------------TNQDSVKELIIKNLDNTRESLETQLKvlSRSINK-----IK 481
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  81 RKLIETQMErfkIVERETKTKAYSKEGLGLAQKVDPAQKEKEEvgqwLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDK 160
Cdd:TIGR04523 482 QNLEQKQKE---LKSKEKELKKLNEEKKELEEKVKDLTKKISS----LKEKIEKLESEKKEKESKISDLEDELNKDDFEL 554
                         170       180
                  ....*....|....*....|
gi 1040670051 161 ----------EKQDRIEELK 170
Cdd:TIGR04523 555 kkenlekeidEKNKEIEELK 574
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
15-209 1.40e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.91  E-value: 1.40e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  15 LKKVTEGVEQFEDIWKKLHNaananQKEKYESDLKKE--IKKLQRLRDQIKTWVASNEIKDKRQLVENRKLIETQMERF- 91
Cdd:PRK03918  461 LKRIEKELKEIEEKERKLRK-----ELRELEKVLKKEseLIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLi 535
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  92 ------KIVERE-TKTKAYSKEGLGLAQKVDPAQKEKEEvgqwLTNTIDTLNMQ-VDQFESEVESLSVQTRK----KKGD 159
Cdd:PRK03918  536 klkgeiKSLKKElEKLEELKKKLAELEKKLDELEEELAE----LLKELEELGFEsVEELEERLKELEPFYNEylelKDAE 611
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1040670051 160 KEKQDRIEELKRL---IERHRFHIRMLETILRMLDNdsiQVDSIRKIKDDVEY 209
Cdd:PRK03918  612 KELEREEKELKKLeeeLDKAFEELAETEKRLEELRK---ELEELEKKYSEEEY 661
PPE COG5651
PPE-repeat protein [Function unknown];
406-599 3.67e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.20  E-value: 3.67e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 406 STLGPLTPSSQSQTNLGMSLSS--------SLGLGKGTSALTGTNQM--PSLVLSGMPASL---SGFLSGSTPAPYAQAA 472
Cdd:COG5651   162 VALTPFTQPPPTITNPGGLLGAqnagsgntSSNPGFANLGLTGLNQVgiGGLNSGSGPIGLnsgPGNTGFAGTGAAAGAA 241
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 473 AAGCVGSGLSSAIGTNISSTNSSGVVTSLGTNGSLGSTSLLGSGSGILGLGSTFAAVE-------PLPLLAPSPVGSTLT 545
Cdd:COG5651   242 AAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGagaaaatGLGLGAGGAAGAAGA 321
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1040670051 546 SGSNVGVIGSSGGTSGTANATGSGTQPRPPSGPKQNGGSSYSAVVGDSTSDSAL 599
Cdd:COG5651   322 TGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAA 375
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
395-729 1.23e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 395 PTSTSTSPpNTSTLGPLTPSsqsqTNLGMSLSSSLGLGKGTSALTGTNQMPSLVLSGMPASLSGFLSGSTPAPYAQAAAA 474
Cdd:pfam05109 425 PESTTTSP-TLNTTGFAAPN----TTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGT 499
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 475 GCVGSGLSSAIGTNISSTNSSGVVTSLGTNGSLGSTsllgsGSGILGLGSTFAAVEPLP-LLAPSPVGSTLTSGSNVGVI 553
Cdd:pfam05109 500 ESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNAT-----SPTLGKTSPTSAVTTPTPnATSPTPAVTTPTPNATIPTL 574
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 554 GSSGGTSGTANATGSGTQPR-PPSGPKQN------GGSSYSAVVGDSTSDSAlnsasqsqnsqssslnLAVTNQLMDSGS 626
Cdd:pfam05109 575 GKTSPTSAVTTPTPNATSPTvGETSPQANttnhtlGGTSSTPVVTSPPKNAT----------------SAVTTGQHNITS 638
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 627 SLLGSMTLAPSSPSPVYSESKAGGGS----LLNGPH-------SYTQPADTIKPQEPPSSLKSMAERVAQGSGlDGELSA 695
Cdd:pfam05109 639 SSTSSMSLRPSSISETLSPSTSDNSTshmpLLTSAHptggeniTQVTPASTSTHHVSTSSPAPRPGTTSQASG-PGNSST 717
                         330       340       350
                  ....*....|....*....|....*....|....
gi 1040670051 696 LHIPSDIFPSSTAPSGTPAAPQLSLSEVSIPPSL 729
Cdd:pfam05109 718 STKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTV 751
 
Name Accession Description Interval E-value
Not3 pfam04065
Not1 N-terminal domain, CCR4-Not complex component;
3-229 3.52e-141

Not1 N-terminal domain, CCR4-Not complex component;


Pssm-ID: 461155 [Multi-domain]  Cd Length: 228  Bit Score: 417.72  E-value: 3.52e-141
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   3 DKRKLQGEIDRCLKKVTEGVEQFEDIWKKLHNAANANQKEKYESDLKKEIKKLQRLRDQIKTWVASNEIKDKRQLVENRK 82
Cdd:pfam04065   1 AARKLQQEIDRTLKKVAEGLEEFDDIYEKLESATNSSQKEKLEADLKKEIKKLQRLRDQIKTWLSSNDIKDKKKLLENRK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  83 LIETQMERFKIVERETKTKAYSKEGLGLAQ--KVDPAQKEKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDK 160
Cdd:pfam04065  81 LIEEAMERFKAVEKESKTKAFSKEGLSLAAasKLDPKEKEKAEARDWLSDSIDELNRQIEALEAEIESLQAQKKKKKKDS 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1040670051 161 EKQdRIEELKRLIERHRFHIRMLETILRMLDNDSIQVDSIRKIKDDVEYYLDSSQDPDFEENEFLYDDL 229
Cdd:pfam04065 161 EKA-RLEELEKLIERHKFHIEKLELILRLLENGELDPEQVDDIKEDIEYYVESNQDPDFVEDEDIYDDL 228
NOT2_3_5 pfam04153
NOT2 / NOT3 / NOT5 family; NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that ...
752-877 1.01e-58

NOT2 / NOT3 / NOT5 family; NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5.


Pssm-ID: 461199  Cd Length: 121  Bit Score: 195.89  E-value: 1.01e-58
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 752 SAWAHMPHPSDSERIRQYLMRNPCPtlpfHHQMPPPHSDSVEFYQRLSTETLFFIFYYLEGTKAQYLAAKALKKQSWRFH 831
Cdd:pfam04153   1 YSTFNSPWPLDAERPYEPDFRNPAC----YPQEPLPRLESPEKFSKFDDETLFYIFYSMPGDYAQYLAAKELKKRSWRYH 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1040670051 832 TKYMMWFQRHEEPKTITDEFEQGTYIYFDYeKWGQRKKEGFTFEYR 877
Cdd:pfam04153  77 KKLRTWFTRDEEPKVITNEYERGTYIYFDY-TWEQRRKKDFTLDYE 121
CDC36 COG5601
General negative regulator of transcription subunit [Transcription];
785-880 6.70e-15

General negative regulator of transcription subunit [Transcription];


Pssm-ID: 227888  Cd Length: 172  Bit Score: 73.47  E-value: 6.70e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 785 PPPHSDSVEFYQR---LSTETLFFIFYYLEGTKAQYLAAKALKKQSWRFHTKYMMW--FQRHEEPKTITDEFEQGTYIYF 859
Cdd:COG5601    73 NPPIFKVNIEDMKmdnFHDETLFYIFYSFPNDVLQEKAYDELLKRNWRFHKMLKCWltFNPGMSPATADHVKERGSYVFF 152
                          90       100
                  ....*....|....*....|.
gi 1040670051 860 DYEKWGQRKKEgFTFEYRYLE 880
Cdd:COG5601   153 DPFSWSKVSLD-FLLDYKAVR 172
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
5-190 1.24e-06

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 51.06  E-value: 1.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   5 RKLQGEIDRCLKKVTEGVEQFEDIWKKLhnaANANQKEKYESDLKKEIkklQRLRDQIKTWVAS--------NEIKDKRQ 76
Cdd:COG1340    74 KELKEERDELNEKLNELREELDELRKEL---AELNKAGGSIDKLRKEI---ERLEWRQQTEVLSpeeekelvEKIKELEK 147
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  77 LVENRKLIETQMErfKIVERETKTKAYSKEGLGLAQKV----DPAQKEKEEVGQwLTNTIDTLNMQVDQFESEVEslsvq 152
Cdd:COG1340   148 ELEKAKKALEKNE--KLKELRAELKELRKEAEEIHKKIkelaEEAQELHEEMIE-LYKEADELRKEADELHKEIV----- 219
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 1040670051 153 trkkkgdkEKQDRIEELKRLIERHRFHIRMLETILRML 190
Cdd:COG1340   220 --------EAQEKADELHEEIIELQKELRELRKELKKL 249
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
3-170 7.40e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 49.63  E-value: 7.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   3 DKRKLQGEIDRCLKKVTEGVEQFEDIwkklhnaananqkEKYESDLKKEIKKLQRLRDQIKTWVA--SNEIKDkrqlvEN 80
Cdd:TIGR04523 420 EKELLEKEIERLKETIIKNNSEIKDL-------------TNQDSVKELIIKNLDNTRESLETQLKvlSRSINK-----IK 481
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  81 RKLIETQMErfkIVERETKTKAYSKEGLGLAQKVDPAQKEKEEvgqwLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDK 160
Cdd:TIGR04523 482 QNLEQKQKE---LKSKEKELKKLNEEKKELEEKVKDLTKKISS----LKEKIEKLESEKKEKESKISDLEDELNKDDFEL 554
                         170       180
                  ....*....|....*....|
gi 1040670051 161 ----------EKQDRIEELK 170
Cdd:TIGR04523 555 kkenlekeidEKNKEIEELK 574
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
15-209 1.40e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.91  E-value: 1.40e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  15 LKKVTEGVEQFEDIWKKLHNaananQKEKYESDLKKE--IKKLQRLRDQIKTWVASNEIKDKRQLVENRKLIETQMERF- 91
Cdd:PRK03918  461 LKRIEKELKEIEEKERKLRK-----ELRELEKVLKKEseLIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLi 535
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  92 ------KIVERE-TKTKAYSKEGLGLAQKVDPAQKEKEEvgqwLTNTIDTLNMQ-VDQFESEVESLSVQTRK----KKGD 159
Cdd:PRK03918  536 klkgeiKSLKKElEKLEELKKKLAELEKKLDELEEELAE----LLKELEELGFEsVEELEERLKELEPFYNEylelKDAE 611
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1040670051 160 KEKQDRIEELKRL---IERHRFHIRMLETILRMLDNdsiQVDSIRKIKDDVEY 209
Cdd:PRK03918  612 KELEREEKELKKLeeeLDKAFEELAETEKRLEELRK---ELEELEKKYSEEEY 661
PPE COG5651
PPE-repeat protein [Function unknown];
406-599 3.67e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.20  E-value: 3.67e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 406 STLGPLTPSSQSQTNLGMSLSS--------SLGLGKGTSALTGTNQM--PSLVLSGMPASL---SGFLSGSTPAPYAQAA 472
Cdd:COG5651   162 VALTPFTQPPPTITNPGGLLGAqnagsgntSSNPGFANLGLTGLNQVgiGGLNSGSGPIGLnsgPGNTGFAGTGAAAGAA 241
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 473 AAGCVGSGLSSAIGTNISSTNSSGVVTSLGTNGSLGSTSLLGSGSGILGLGSTFAAVE-------PLPLLAPSPVGSTLT 545
Cdd:COG5651   242 AAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGagaaaatGLGLGAGGAAGAAGA 321
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1040670051 546 SGSNVGVIGSSGGTSGTANATGSGTQPRPPSGPKQNGGSSYSAVVGDSTSDSAL 599
Cdd:COG5651   322 TGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAA 375
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
45-235 8.30e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 46.05  E-value: 8.30e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  45 ESDLKKEIKKLQRLRDQIKTwvASNEIKDKRQLVEN--RKLIETQMERFKIVER----ETKTKAYSKEGLGLAQKVDPAQ 118
Cdd:COG4372    30 SEQLRKALFELDKLQEELEQ--LREELEQAREELEQleEELEQARSELEQLEEEleelNEQLQAAQAELAQAQEELESLQ 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 119 KEKEEvgqwLTNTIDTLNMQVDQFESEVESL-SVQTRKKKGDKEKQDRIEELKRLIERHRFHIRMLETILRMLDNDSI-- 195
Cdd:COG4372   108 EEAEE----LQEELEELQKERQDLEQQRKQLeAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAeq 183
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 1040670051 196 QVDSIRK----------IKDDVEYYLDSSQDPDFEENEFLYDDLDLEEIP 235
Cdd:COG4372   184 ALDELLKeanrnaekeeELAEAEKLIESLPRELAEELLEAKDSLEAKLGL 233
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
16-234 8.89e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 45.66  E-value: 8.89e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  16 KKVTEGVEQFEDIWKKLHNAAN-----ANQKEKYESDLKKEIKKLQRLRDQIKtwvasneiKDKRQLVENRKLIETQMER 90
Cdd:COG4372    31 EQLRKALFELDKLQEELEQLREeleqaREELEQLEEELEQARSELEQLEEELE--------ELNEQLQAAQAELAQAQEE 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  91 FKIVERETKT-----KAYSKEGLGLAQKVDPAQKEKEEVGQWLTNT---IDTLNMQVDQFESEVESLSvQTRKKKGDKEK 162
Cdd:COG4372   103 LESLQEEAEElqeelEELQKERQDLEQQRKQLEAQIAELQSEIAEReeeLKELEEQLESLQEELAALE-QELQALSEAEA 181
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1040670051 163 QDRIEELKRLIERHRFHIRMLETILRMLDNDSI-QVDSIRKIKDDVEYYLDSSQDPDFEENEFLYDDLDLEEI 234
Cdd:COG4372   182 EQALDELLKEANRNAEKEEELAEAEKLIESLPReLAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEE 254
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
5-202 9.72e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 46.21  E-value: 9.72e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   5 RKLQGEIDRCLKKVTEgVEQFEDIWKKLHNA-----ANANQKEKYESDLKKEIKKLQRLRDQIKTWVasneikdkrQLVE 79
Cdd:PRK03918  296 IKLSEFYEEYLDELRE-IEKRLSRLEEEINGieeriKELEEKEERLEELKKKLKELEKRLEELEERH---------ELYE 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  80 NRKLIETQMERFKivereTKTKAYSKEglGLAQKVDPAQKEKEEVgqwlTNTIDTLNMQVDQFESEVESLS---VQTRKK 156
Cdd:PRK03918  366 EAKAKKEELERLK-----KRLTGLTPE--KLEKELEELEKAKEEI----EEEISKITARIGELKKEIKELKkaiEELKKA 434
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1040670051 157 KG----------DKEKQDRIEELKRLIERHRFHIRMLETILRMLDNDSIQVDSIRK 202
Cdd:PRK03918  435 KGkcpvcgreltEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLK 490
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2-191 1.07e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.21  E-value: 1.07e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051    2 ADKRKLQGEIDRCLKKVTEGVEQFEDIWKKLHnaananQKEKYESDLKKEIKKLQRLRD-QIKTWVASNEIkDKRQLVEN 80
Cdd:TIGR02169  237 RQKEAIERQLASLEEELEKLTEEISELEKRLE------EIEQLLEELNKKIKDLGEEEQlRVKEKIGELEA-EIASLERS 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   81 RKLIETQMERF-----KIVERETKTKAyskEGLGLAQKVDPAQKEKEEVGQWLTNTIDTLNMQVDQFEsEVESLSVQTRK 155
Cdd:TIGR02169  310 IAEKERELEDAeerlaKLEAEIDKLLA---EIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELE-EVDKEFAETRD 385
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1040670051  156 KKgdKEKQDRIEELKRLIERHRFHIRMLETILRMLD 191
Cdd:TIGR02169  386 EL--KDYREKLEKLKREINELKRELDRLQEELQRLS 419
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
395-729 1.23e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 395 PTSTSTSPpNTSTLGPLTPSsqsqTNLGMSLSSSLGLGKGTSALTGTNQMPSLVLSGMPASLSGFLSGSTPAPYAQAAAA 474
Cdd:pfam05109 425 PESTTTSP-TLNTTGFAAPN----TTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGT 499
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 475 GCVGSGLSSAIGTNISSTNSSGVVTSLGTNGSLGSTsllgsGSGILGLGSTFAAVEPLP-LLAPSPVGSTLTSGSNVGVI 553
Cdd:pfam05109 500 ESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNAT-----SPTLGKTSPTSAVTTPTPnATSPTPAVTTPTPNATIPTL 574
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 554 GSSGGTSGTANATGSGTQPR-PPSGPKQN------GGSSYSAVVGDSTSDSAlnsasqsqnsqssslnLAVTNQLMDSGS 626
Cdd:pfam05109 575 GKTSPTSAVTTPTPNATSPTvGETSPQANttnhtlGGTSSTPVVTSPPKNAT----------------SAVTTGQHNITS 638
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 627 SLLGSMTLAPSSPSPVYSESKAGGGS----LLNGPH-------SYTQPADTIKPQEPPSSLKSMAERVAQGSGlDGELSA 695
Cdd:pfam05109 639 SSTSSMSLRPSSISETLSPSTSDNSTshmpLLTSAHptggeniTQVTPASTSTHHVSTSSPAPRPGTTSQASG-PGNSST 717
                         330       340       350
                  ....*....|....*....|....*....|....
gi 1040670051 696 LHIPSDIFPSSTAPSGTPAAPQLSLSEVSIPPSL 729
Cdd:pfam05109 718 STKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTV 751
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
3-208 1.58e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 45.40  E-value: 1.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   3 DKRKLQGEIDRCLKKVTEGVEQFEDIWKKLHNAANANQKekyesdLKKEIKKLQRLRDQIKtwvasNEIKDKRQLVENRK 82
Cdd:TIGR04523 322 KLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSE------KQRELEEKQNEIEKLK-----KENQSYKQEIKNLE 390
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  83 LietqmerfKIVERETKTKAYSKEGLGLAQKVDPAQKEKEEVGQWLTN----------TIDTLNMQVDQFESEVESLsvq 152
Cdd:TIGR04523 391 S--------QINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERlketiiknnsEIKDLTNQDSVKELIIKNL--- 459
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1040670051 153 trKKKGDKEKQdRIEELKRLIERHRfhiRMLETILRMLDNDSIQVDSIRKIKDDVE 208
Cdd:TIGR04523 460 --DNTRESLET-QLKVLSRSINKIK---QNLEQKQKELKSKEKELKKLNEEKKELE 509
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
26-208 2.42e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 45.01  E-value: 2.42e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  26 EDIWKKLHN--AANANQKEKYESDLKKEIKKLQRLRDQI------KTWVASNEIKDKRQLVENRKLIETqmerfkiVERE 97
Cdd:TIGR04523 306 QDWNKELKSelKNQEKKLEEIQNQISQNNKIISQLNEQIsqlkkeLTNSESENSEKQRELEEKQNEIEK-------LKKE 378
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  98 TKTKAYSKEGLGLaQKVDPAQK--EKEEVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGD-KEKQDRIEELKRLIE 174
Cdd:TIGR04523 379 NQSYKQEIKNLES-QINDLESKiqNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEiKDLTNQDSVKELIIK 457
                         170       180       190
                  ....*....|....*....|....*....|....
gi 1040670051 175 RHRFHIRMLETILRMLDNdsiqvdSIRKIKDDVE 208
Cdd:TIGR04523 458 NLDNTRESLETQLKVLSR------SINKIKQNLE 485
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
46-202 4.42e-04

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 43.36  E-value: 4.42e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  46 SDLKKEIKKLQRLRD----QIKTWVAS--NEIKDKRQLVE-NRKLIEtqmERFKIVERETKTKAYSKEglgLAQKVDPAQ 118
Cdd:COG1340    18 EELREEIEELKEKRDelneELKELAEKrdELNAQVKELREeAQELRE---KRDELNEKVKELKEERDE---LNEKLNELR 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 119 KEKEEVGQwLTNTIDTLNMQVDQFESEVESL--SVQTRKKKGDKEKQ--DRIEELKRLIERhrfhIRMLETILRMLDNDS 194
Cdd:COG1340    92 EELDELRK-ELAELNKAGGSIDKLRKEIERLewRQQTEVLSPEEEKElvEKIKELEKELEK----AKKALEKNEKLKELR 166

                  ....*...
gi 1040670051 195 IQVDSIRK 202
Cdd:COG1340   167 AELKELRK 174
COG4487 COG4487
Uncharacterized conserved protein, contains DUF2130 domain [Function unknown];
36-175 5.22e-04

Uncharacterized conserved protein, contains DUF2130 domain [Function unknown];


Pssm-ID: 443580 [Multi-domain]  Cd Length: 425  Bit Score: 43.40  E-value: 5.22e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  36 ANANQKEKYESDLKKEIKKLQRLRDQIKtwvasNEIKDKRQLVENRKLIETQMERFKIVERETKTKAYSKEGLGLAQKvd 115
Cdd:COG4487    52 AEAKAKAQLQEQVAEKDAEIAELRARLE-----AEERKKALAVAEEKEKELAALQEALAEKDAKLAELQAKELELLKK-- 124
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1040670051 116 paQKEKEEVGQWLTNTI--------DTLNMQVDQFESEVESLSVQTRKKKGDKE---KQDRIEELKRLIER 175
Cdd:COG4487   125 --ERELEDAKREAELTVekerdeelDELKEKLKKEEEEKQLAEKSLKVAEYEKQlkdMQEQIEELKRKKEQ 193
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
3-236 1.25e-03

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 42.65  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051    3 DKRKLQGEIDRCLKKVTEGVEQFEDIWKKLHNAANA-NQKEKYESDLKKEIKKLQRLRDQIKTWVASNEIKDKRQLVENR 81
Cdd:pfam02463  230 DYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEkLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   82 KLIEtqmerFKIVERETKTKAYSKEGLGLAQKVDPAQKEKEEVgqwltntidtlNMQVDQFESEVESLSVQTRKKKgdKE 161
Cdd:pfam02463  310 VDDE-----EKLKESEKEKKKAEKELKKEKEEIEELEKELKEL-----------EIKREAEEEEEEELEKLQEKLE--QL 371
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1040670051  162 KQDRIEELKRLIERHRFHIRMLETILRMLDNDSIQVDSIRKIKDDVEYYLDSSQDPDFEENEFLYDDLDLEEIPL 236
Cdd:pfam02463  372 EEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKL 446
PPE COG5651
PPE-repeat protein [Function unknown];
485-714 1.44e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.80  E-value: 1.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 485 IGTNISSTNSSGVVTSLGTNGSLGSTSLLGSGSGILGLGSTFAAVEPLPLLAPSPVGSTLTSGSNVGVIGSSGGTSGTAN 564
Cdd:COG5651   173 TITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 565 ATGSGTQPRPPSGPkqNGGSSYSAVVGDSTSDSALNSASQSQNSQSSSLNLAVTNQLMDSGSSLLGSMTLAPSSPSPVYS 644
Cdd:COG5651   253 AGASAALASLAATL--LNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGA 330
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 645 ESKAGGGSLLNGphsytqpadtikpqePPSSLKSMAERVAQGSGLDGELSALHIPSDIFPSSTAPSGTPA 714
Cdd:COG5651   331 GAAAAAAGAAAG---------------AGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
22-204 1.49e-03

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 42.27  E-value: 1.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   22 VEQFEDIWKKLHN-----AANANQKEKYESDLKKEIKKLQRLRDQIktWVASNEIKDKRQLVENRK-LIETQMERFKIVE 95
Cdd:pfam02463  144 IEIIAMMKPERRLeieeeAAGSRLKRKKKEALKKLIEETENLAELI--IDLEELKLQELKLKEQAKkALEYYQLKEKLEL 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   96 RETKTKAYSKEGLglaqkvdpaqkekeeVGQWLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDKEKQDRIEELKRLIER 175
Cdd:pfam02463  222 EEEYLLYLDYLKL---------------NEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEE 286
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1040670051  176 HRFHIRMLETILR--MLDNDSIQVDSIRKIK 204
Cdd:pfam02463  287 ELKLLAKEEEELKseLLKLERRKVDDEEKLK 317
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
39-186 2.36e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 41.59  E-value: 2.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  39 NQKEKYESDLKKEIKKLQRLRDQIKTWVASNEIKDKRQLVENR--KLIETQMERFKIVERETKTKAYSKEGLG-LAQKVD 115
Cdd:PRK03918  269 EELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREieKRLSRLEEEINGIEERIKELEEKEERLEeLKKKLK 348
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1040670051 116 PAQKEKEEVGQW--LTNTIDTLNMQVDQFESEVESLSVQTRKKKGD---KEKQDRIEELKRLIERhrfhIRMLETI 186
Cdd:PRK03918  349 ELEKRLEELEERheLYEEAKAKKEELERLKKRLTGLTPEKLEKELEeleKAKEEIEEEISKITAR----IGELKKE 420
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
396-661 2.53e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 41.69  E-value: 2.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 396 TSTSTSPPNTSTLGPLTPSSQSQTNLGMSLSSSLGLGKGTSALTGTNQMPSLVLSGMPASLSGFLSGSTPAPYAQAAAAG 475
Cdd:COG4625   218 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGG 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 476 CVGSGLSSAIGTNISSTNSSGVVTSLGTNGSLGSTSLLGSGSGILGLGSTFAAVEPLPLLAPSPVGSTLTSGSNVGVIGS 555
Cdd:COG4625   298 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGS 377
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 556 SGGTSGTANATGSGTQPRPPSGPKQNGGSSYSAVVGDSTSDSALNSASQSQNSQSSSLNLAVTNQLMDSGSSL-LGSMTL 634
Cdd:COG4625   378 GGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGgGGGGAG 457
                         250       260
                  ....*....|....*....|....*...
gi 1040670051 635 APSSPSPVYSESKAGGGSL-LNGPHSYT 661
Cdd:COG4625   458 GSGGGAGAGGGSGSGAGTLtLTGNNTYT 485
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1-177 3.46e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 41.05  E-value: 3.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051    1 MADKRKLQGEIDRClkkvtegVEQFEDIwKKLHNAAnanqkekyeSDLKKEIKKLQRLRDQIKTWvasneikdkRQLVEN 80
Cdd:COG4913    217 MLEEPDTFEAADAL-------VEHFDDL-ERAHEAL---------EDAREQIELLEPIRELAERY---------AAARER 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   81 RKLIETQMERFKIVERETKTKAYSKEGLGLAQKVDPAQKEKEEVGQwltntidtlnmQVDQFESEVESLSVQTRKKKGDk 160
Cdd:COG4913    271 LAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEA-----------RLDALREELDELEAQIRGNGGD- 338
                          170
                   ....*....|....*..
gi 1040670051  161 ekqdRIEELKRLIERHR 177
Cdd:COG4913    339 ----RLEQLEREIERLE 351
mukB PRK04863
chromosome partition protein MukB;
42-202 6.03e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 40.33  E-value: 6.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   42 EKYESdLKKEIKKLQRLRDQIKTWVASN-----------EIKDKRQlveNRKLIETQMERFKIVERETKTKA-YSKEGLG 109
Cdd:PRK04863   800 ERYAT-LSFDVQKLQRLHQAFSRFIGSHlavafeadpeaELRQLNR---RRVELERALADHESQEQQQRSQLeQAKEGLS 875
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  110 LAQKVDPaqkekeevgqwLTNTIDtlnmqVDQFESEVESLsvqtrkkkgdKEKQDRIEELKRLIERHRFHIRMLETILRM 189
Cdd:PRK04863   876 ALNRLLP-----------RLNLLA-----DETLADRVEEI----------REQLDEAEEAKRFVQQHGNALAQLEPIVSV 929
                          170
                   ....*....|...
gi 1040670051  190 LDNDSIQVDSIRK 202
Cdd:PRK04863   930 LQSDPEQFEQLKQ 942
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
3-170 6.21e-03

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 40.59  E-value: 6.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051    3 DKRKLQGEIDRCLKKVTEGVEQFEDIWKKLHNAANANQKEKYESDLKKEIKKLQRLRDQIKTWVA--------------- 67
Cdd:PTZ00440  1316 DAKKELEKTDNLIKQVEAKIEQAKEHKNKIYGSLEDKQIDDEIKKIEQIKEEISNKRKEINKYLSniksnkekcdlhvrn 1395
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   68 SNEIKDKRQLVENRKLIETQM----ERFKIVERETKTKAYSKEGLGLAQKVDpaqKEKEEVGQWLTNTIDTLNmqvdqfe 143
Cdd:PTZ00440  1396 ASRGKDKIDFLNKHEAIEPSNskevNIIKITDNINKCKQYSNEAMETENKAD---ENNDSIIKYEKEITNILN------- 1465
                          170       180
                   ....*....|....*....|....*..
gi 1040670051  144 sEVESLSVQTRKKKGDKEKQDRIEELK 170
Cdd:PTZ00440  1466 -NSSILGKKTKLEKKKKEATNIMDDIN 1491
PRK01156 PRK01156
chromosome segregation protein; Provisional
8-230 6.62e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 40.27  E-value: 6.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051   8 QGEIDRCLKKV-TEGVEQFEDIWKKLHNAANANQKEKYESDLKKEIKKLQRLRDQIKTWVasNEIKdkrqlvENRKLIET 86
Cdd:PRK01156  137 QGEMDSLISGDpAQRKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSN--LELE------NIKKQIAD 208
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  87 QMERFKIVERETKtkAYSKEGLGLAQKVDPAQKEKEEVGQwLTNTIDTLNMQVDQFESEVESLSVQTRKKKGDKEKQDRI 166
Cdd:PRK01156  209 DEKSHSITLKEIE--RLSIEYNNAMDDYNNLKSALNELSS-LEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKI 285
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1040670051 167 EELKRLIERHRF-----HIRMLETILRMLDNDSIQV----DSIRKIKDdveyyLDSSQDpDFEENEFLYDDLD 230
Cdd:PRK01156  286 INDPVYKNRNYIndyfkYKNDIENKKQILSNIDAEInkyhAIIKKLSV-----LQKDYN-DYIKKKSRYDDLN 352
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
10-216 6.73e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 40.00  E-value: 6.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  10 EIDRCLKKVTEGVEQFEDIWKKLHNAAN--ANQKEKYESDLKKEIK------------KLQRLRDQIKTWVASNEIKDKR 75
Cdd:TIGR04523 135 ENKKNIDKFLTEIKKKEKELEKLNNKYNdlKKQKEELENELNLLEKeklniqknidkiKNKLLKLELLLSNLKKKIQKNK 214
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051  76 QLV--------ENRKLIET-QMERFKIVERETKTKAYSKEGLGLAQKVDPAQKEKEEVGQWLTNT---IDTLNMQVDQFE 143
Cdd:TIGR04523 215 SLEsqiselkkQNNQLKDNiEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNnkkIKELEKQLNQLK 294
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1040670051 144 SEVESL----------SVQTRKKKGDKEKQD----------RIEELKRLIERHRFHIRMLETilrmlDNDSIQ------V 197
Cdd:TIGR04523 295 SEISDLnnqkeqdwnkELKSELKNQEKKLEEiqnqisqnnkIISQLNEQISQLKKELTNSES-----ENSEKQreleekQ 369
                         250
                  ....*....|....*....
gi 1040670051 198 DSIRKIKDDVEYYLDSSQD 216
Cdd:TIGR04523 370 NEIEKLKKENQSYKQEIKN 388
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH