NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|966959689|ref|XP_015001352|]
View 

angiopoietin-1 isoform X1 [Macaca mulatta]

Protein Classification

fibrinogen-related domain-containing protein( domain architecture ID 10053370)

fibrinogen-related domain-containing protein contains a C terminal globular domain similar to that of fibrinogen, and may be involved in one or more of a variety of binding interactions and functions including complement activation, signaling and regulation

PubMed:  1304888
SCOP:  4002544

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
FReD cd00087
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is ...
280-495 7.77e-127

Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.


:

Pssm-ID: 238040 [Multi-domain]  Cd Length: 215  Bit Score: 368.11  E-value: 7.77e-127
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 280 KPFRDCADVYQAGFNKSGIYTIYINNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNE 359
Cdd:cd00087    1 PLPRDCSEVLQRGGRTSGVYTIQPPGSNEPFQVYCDMDTDGGGWTVIQRRGDGSVDFYRSWKEYKDGFGNLDGEFWLGLE 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 360 FIFAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSlILHGADFSTKDADNDNCMCKCA 439
Cdd:cd00087   81 KIHLLTSQGPYELRIDLEDWEGNTAYAEYDSFKVGSESEGYRLTLGGYSGTAGDALS-YHNGMKFSTFDRDNDGASGNCA 159
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 966959689 440 LMLTGGWWFDACGPSNLNGMFYTAGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 495
Cdd:cd00087  160 ESYSGGWWYNSCHASNLNGRYYSGGHRNEYDNGINWATWKGSTYSLKFTEMKIRPK 215
Mplasa_alph_rch super family cl37461
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
105-258 5.76e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


The actual alignment was detected with superfamily member TIGR04523:

Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 45.78  E-value: 5.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  105 VENMKSEMAQIQQNAVQNhtatmleIGTSLLSQTAEQTRKLTDVETQVLNQTSRleIQLLENSLSTYKLEKQLLQQTNEI 184
Cdd:TIGR04523 290 LNQLKSEISDLNNQKEQD-------WNKELKSELKNQEKKLEEIQNQISQNNKI--ISQLNEQISQLKKELTNSESENSE 360
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 966959689  185 LK--IHEKNSLLEHKILEMEGKhKEELDTLKEEKENLQGLVTRQTYIIQELEKQLNRATTNNSVLQKQQLELMDTV 258
Cdd:TIGR04523 361 KQreLEEKQNEIEKLKKENQSY-KQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETI 435
 
Name Accession Description Interval E-value
FReD cd00087
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is ...
280-495 7.77e-127

Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.


Pssm-ID: 238040 [Multi-domain]  Cd Length: 215  Bit Score: 368.11  E-value: 7.77e-127
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 280 KPFRDCADVYQAGFNKSGIYTIYINNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNE 359
Cdd:cd00087    1 PLPRDCSEVLQRGGRTSGVYTIQPPGSNEPFQVYCDMDTDGGGWTVIQRRGDGSVDFYRSWKEYKDGFGNLDGEFWLGLE 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 360 FIFAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSlILHGADFSTKDADNDNCMCKCA 439
Cdd:cd00087   81 KIHLLTSQGPYELRIDLEDWEGNTAYAEYDSFKVGSESEGYRLTLGGYSGTAGDALS-YHNGMKFSTFDRDNDGASGNCA 159
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 966959689 440 LMLTGGWWFDACGPSNLNGMFYTAGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 495
Cdd:cd00087  160 ESYSGGWWYNSCHASNLNGRYYSGGHRNEYDNGINWATWKGSTYSLKFTEMKIRPK 215
FBG smart00186
Fibrinogen-related domains (FReDs); Domain present at the C-termini of fibrinogen beta and ...
282-495 2.02e-124

Fibrinogen-related domains (FReDs); Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.


Pssm-ID: 214548 [Multi-domain]  Cd Length: 212  Bit Score: 361.59  E-value: 2.02e-124
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   282 FRDCADVYQAGFNKSGIYTIYINNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNEFI 361
Cdd:smart00186   2 PRDCSDVLQNGGKTSGLYTIYPDGSSRPLKVYCDMETDGGGWTVIQRRMDGSVDFYRDWKDYKEGFGNLAGEFWLGNENI 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   362 FAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSLILHGADFSTKDADNDNCMCKCALM 441
Cdd:smart00186  82 HLLTSQGKYELRIDLEDWEGNTAYALYDSFKVADEADGYRLHIGGYSGTAGDASLTYHNGMQFSTYDRDNDKYSGNCAEE 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 966959689   442 LTGGWWFDACGPSNLNGMFYtagQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 495
Cdd:smart00186 162 YGGGWWYNNCHAANLNGRYY---PNNNYDNGINWATWKGSWYSLKFTEMKIRPL 212
Fibrinogen_C pfam00147
Fibrinogen beta and gamma chains, C-terminal globular domain;
283-495 1.81e-75

Fibrinogen beta and gamma chains, C-terminal globular domain;


Pssm-ID: 395095 [Multi-domain]  Cd Length: 221  Bit Score: 236.65  E-value: 1.81e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  283 RDCADVYQAGFNKSGIYTIYINNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPS-GEYWLGNEFI 361
Cdd:pfam00147   3 RDCSDVYNKGAKTSGLYTIRPDGATKPFEVYCDMETDGGGWTVFQRRLDGSTNFKRNWKDYKAGFGNLSpGEFWLGNDKI 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  362 FAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQ-----SSLILH-GADFSTKDADNDNCM 435
Cdd:pfam00147  83 HLLTKQGPYVLRIDLEDWNGETVFALYDSFKVTNENDKYRLHVENYIGDAGDAldtagRSMTYHnGMQFSTWDRDNDSPD 162
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  436 CKCALMLTGGWWFDACGPSNLNGMFYTaGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 495
Cdd:pfam00147 163 GNCALSYGGGWWYNNCHAANLNGVYYY-GGTYSKQNGIIWATWKGRWYSMKKAEMKIRPL 221
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
285-326 1.04e-08

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 51.02  E-value: 1.04e-08
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*
gi 966959689 285 CADVYQAGFN-KSGIYTIYINNMP--EPKKVFCNMDVNGGGWTVI 326
Cdd:NF040941   2 CWEILQAGPSaPSGVYWIDPDGMGglAPFQVYCDMTTDGGGWTLV 46
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
105-258 5.76e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 45.78  E-value: 5.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  105 VENMKSEMAQIQQNAVQNhtatmleIGTSLLSQTAEQTRKLTDVETQVLNQTSRleIQLLENSLSTYKLEKQLLQQTNEI 184
Cdd:TIGR04523 290 LNQLKSEISDLNNQKEQD-------WNKELKSELKNQEKKLEEIQNQISQNNKI--ISQLNEQISQLKKELTNSESENSE 360
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 966959689  185 LK--IHEKNSLLEHKILEMEGKhKEELDTLKEEKENLQGLVTRQTYIIQELEKQLNRATTNNSVLQKQQLELMDTV 258
Cdd:TIGR04523 361 KQreLEEKQNEIEKLKKENQSY-KQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETI 435
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
136-254 7.18e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 42.23  E-value: 7.18e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 136 SQTAEQTRKLTDVETQVLNQTSRLEIQLLENSLStyKLEKQLLQQTNEILKIHEKNSLLEHKILEMEGKHKEELDTLKEE 215
Cdd:COG1196  209 AEKAERYRELKEELKELEAELLLLKLRELEAELE--ELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 966959689 216 KENLQGLVTRqtyiIQELEKQLNRATTNNSVLQKQQLEL 254
Cdd:COG1196  287 QAEEYELLAE----LARLEQDIARLEERRRELEERLEEL 321
SHE3 pfam17078
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ...
176-250 1.72e-03

SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.


Pssm-ID: 293683 [Multi-domain]  Cd Length: 228  Bit Score: 40.11  E-value: 1.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  176 QLLQQTNEILKIHEKNSllehkilEMEGKHKEELDTLKEEKENLQGLVTRQTYIIQELEKQLN-------RATTNNSVLQ 248
Cdd:pfam17078  21 QLTVQSQNLLSKLEIAQ-------QKESKFLENLASLKHENDNLSSMLNRKERRLKDLEDQLSelknsyeELTESNKQLK 93

                  ..
gi 966959689  249 KQ 250
Cdd:pfam17078  94 KR 95
PRK10935 PRK10935
nitrate/nitrite two-component system sensor histidine kinase NarQ;
122-255 7.08e-03

nitrate/nitrite two-component system sensor histidine kinase NarQ;


Pssm-ID: 236800 [Multi-domain]  Cd Length: 565  Bit Score: 39.07  E-value: 7.08e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 122 NHTATmlEIGT---SLLSQTAEQTRKLTDV--ETQVLNQTSRLeiqllensLSTYKLEKQLLQQTNEILKIHEKNSLLEH 196
Cdd:PRK10935 216 NQMSS--ELHKlyrSLEASVEEKTRKLTQAnrSLEVLYQCSQA--------LNASQIDVHCFRHILQIVRDHEGLDYLEL 285
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 966959689 197 KILE------MEGKHKEELD----TLKEEKENL------QGLVTRQTYIIQELEKQLNRATT-NNSVLQKQQLELM 255
Cdd:PRK10935 286 EVGEnehwriSEGQPNPELPwqilPLTMEDTVLgylhwqASLPCPDEPLMNNVAQMLGRGLYfNQAQKQQQQLLLM 361
 
Name Accession Description Interval E-value
FReD cd00087
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is ...
280-495 7.77e-127

Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.


Pssm-ID: 238040 [Multi-domain]  Cd Length: 215  Bit Score: 368.11  E-value: 7.77e-127
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 280 KPFRDCADVYQAGFNKSGIYTIYINNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNE 359
Cdd:cd00087    1 PLPRDCSEVLQRGGRTSGVYTIQPPGSNEPFQVYCDMDTDGGGWTVIQRRGDGSVDFYRSWKEYKDGFGNLDGEFWLGLE 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 360 FIFAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSlILHGADFSTKDADNDNCMCKCA 439
Cdd:cd00087   81 KIHLLTSQGPYELRIDLEDWEGNTAYAEYDSFKVGSESEGYRLTLGGYSGTAGDALS-YHNGMKFSTFDRDNDGASGNCA 159
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 966959689 440 LMLTGGWWFDACGPSNLNGMFYTAGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 495
Cdd:cd00087  160 ESYSGGWWYNSCHASNLNGRYYSGGHRNEYDNGINWATWKGSTYSLKFTEMKIRPK 215
FBG smart00186
Fibrinogen-related domains (FReDs); Domain present at the C-termini of fibrinogen beta and ...
282-495 2.02e-124

Fibrinogen-related domains (FReDs); Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.


Pssm-ID: 214548 [Multi-domain]  Cd Length: 212  Bit Score: 361.59  E-value: 2.02e-124
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   282 FRDCADVYQAGFNKSGIYTIYINNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNEFI 361
Cdd:smart00186   2 PRDCSDVLQNGGKTSGLYTIYPDGSSRPLKVYCDMETDGGGWTVIQRRMDGSVDFYRDWKDYKEGFGNLAGEFWLGNENI 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   362 FAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSLILHGADFSTKDADNDNCMCKCALM 441
Cdd:smart00186  82 HLLTSQGKYELRIDLEDWEGNTAYALYDSFKVADEADGYRLHIGGYSGTAGDASLTYHNGMQFSTYDRDNDKYSGNCAEE 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 966959689   442 LTGGWWFDACGPSNLNGMFYtagQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 495
Cdd:smart00186 162 YGGGWWYNNCHAANLNGRYY---PNNNYDNGINWATWKGSWYSLKFTEMKIRPL 212
Fibrinogen_C pfam00147
Fibrinogen beta and gamma chains, C-terminal globular domain;
283-495 1.81e-75

Fibrinogen beta and gamma chains, C-terminal globular domain;


Pssm-ID: 395095 [Multi-domain]  Cd Length: 221  Bit Score: 236.65  E-value: 1.81e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  283 RDCADVYQAGFNKSGIYTIYINNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPS-GEYWLGNEFI 361
Cdd:pfam00147   3 RDCSDVYNKGAKTSGLYTIRPDGATKPFEVYCDMETDGGGWTVFQRRLDGSTNFKRNWKDYKAGFGNLSpGEFWLGNDKI 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  362 FAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQ-----SSLILH-GADFSTKDADNDNCM 435
Cdd:pfam00147  83 HLLTKQGPYVLRIDLEDWNGETVFALYDSFKVTNENDKYRLHVENYIGDAGDAldtagRSMTYHnGMQFSTWDRDNDSPD 162
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  436 CKCALMLTGGWWFDACGPSNLNGMFYTaGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 495
Cdd:pfam00147 163 GNCALSYGGGWWYNNCHAANLNGVYYY-GGTYSKQNGIIWATWKGRWYSMKKAEMKIRPL 221
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
285-326 1.04e-08

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 51.02  E-value: 1.04e-08
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*
gi 966959689 285 CADVYQAGFN-KSGIYTIYINNMP--EPKKVFCNMDVNGGGWTVI 326
Cdd:NF040941   2 CWEILQAGPSaPSGVYWIDPDGMGglAPFQVYCDMTTDGGGWTLV 46
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
105-258 5.76e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 45.78  E-value: 5.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  105 VENMKSEMAQIQQNAVQNhtatmleIGTSLLSQTAEQTRKLTDVETQVLNQTSRleIQLLENSLSTYKLEKQLLQQTNEI 184
Cdd:TIGR04523 290 LNQLKSEISDLNNQKEQD-------WNKELKSELKNQEKKLEEIQNQISQNNKI--ISQLNEQISQLKKELTNSESENSE 360
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 966959689  185 LK--IHEKNSLLEHKILEMEGKhKEELDTLKEEKENLQGLVTRQTYIIQELEKQLNRATTNNSVLQKQQLELMDTV 258
Cdd:TIGR04523 361 KQreLEEKQNEIEKLKKENQSY-KQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETI 435
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
81-253 4.72e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 42.70  E-value: 4.72e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   81 QKLQHLEHVMENYTQWLQKLENYIvENMKSEMAQI--------QQNAVQNHTATMLEIGTSLLSQTAEQTRKLTDVETQv 152
Cdd:TIGR04523 391 SQINDLESKIQNQEKLNQQKDEQI-KKLQQEKELLekeierlkETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLET- 468
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  153 lnQTSRLEIQLLENSLSTYKLEKQLLQQTNEILKIHEKNSLLEHKILEMegkhKEELDTLKEEKENLQGLVTRQTYIIQE 232
Cdd:TIGR04523 469 --QLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDL----TKKISSLKEKIEKLESEKKEKESKISD 542
                         170       180
                  ....*....|....*....|.
gi 966959689  233 LEKQLNratTNNSVLQKQQLE 253
Cdd:TIGR04523 543 LEDELN---KDDFELKKENLE 560
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
79-279 5.17e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 42.75  E-value: 5.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689    79 SSQKLQHLEHVMENYTQWLQKLENYIvENMKSEMAQIQQNAVQNHTATMLEIgTSLLSQTAEQTRKLTDVETQVlnqtSR 158
Cdd:TIGR02169  675 ELQRLRERLEGLKRELSSLQSELRRI-ENRLDELSQELSDASRKIGEIEKEI-EQLEQEEEKLKERLEELEEDL----SS 748
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   159 LEIQLLENSLSTYKLEKQLLQQTNEILKIHEKNSLLEHKILEmegkhkEELDTLKEEKENLQGLVTRQTYIIQELEKQLN 238
Cdd:TIGR02169  749 LEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH------SRIPEIQAELSKLEEEVSRIEARLREIEQKLN 822
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 966959689   239 RATTNNSVLQKQ----QLELMDTVHNLVNLCTKEVLLKGGKREEE 279
Cdd:TIGR02169  823 RLTLEKEYLEKEiqelQEQRIDLKEQIKSIEKEIENLNGKKEELE 867
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
55-256 6.53e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 42.35  E-value: 6.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689    55 RESTTDQYNTNALQRDAPHvEPDFSSQKLQHLEHVMENYTQWLQKLENYIVE-NMKSEMAQIQQNAVQNHTATMLEIGTS 133
Cdd:TIGR02168  284 EELQKELYALANEISRLEQ-QKQILRERLANLERQLEELEAQLEELESKLDElAEELAELEEKLEELKEELESLEAELEE 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   134 LLSQTAEQTRKLTDVETQVLNQTSRL------------EIQLLENSLStyKLEKQLLQQTNEILKIHEKNSLLEHKILEM 201
Cdd:TIGR02168  363 LEAELEELESRLEELEEQLETLRSKVaqlelqiaslnnEIERLEARLE--RLEDRRERLQQEIEELLKKLEEAELKELQA 440
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 966959689   202 E-GKHKEELDTLKEE-------KENLQGLVTRQTYIIQELEKQLNRATTNNSVLQKQQLELMD 256
Cdd:TIGR02168  441 ElEELEEELEELQEElerleeaLEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG 503
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
136-254 7.18e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 42.23  E-value: 7.18e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 136 SQTAEQTRKLTDVETQVLNQTSRLEIQLLENSLStyKLEKQLLQQTNEILKIHEKNSLLEHKILEMEGKHKEELDTLKEE 215
Cdd:COG1196  209 AEKAERYRELKEELKELEAELLLLKLRELEAELE--ELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 966959689 216 KENLQGLVTRqtyiIQELEKQLNRATTNNSVLQKQQLEL 254
Cdd:COG1196  287 QAEEYELLAE----LARLEQDIARLEERRRELEERLEEL 321
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
82-250 9.16e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 41.93  E-value: 9.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   82 KLQHLEHVMENYTQWLQKLENYIVEnMKSEMAQIQQNAVQNHTAtmLEIGTSLLSQTAEQTRKLTDVETQVLNQTSRLEI 161
Cdd:TIGR04523 198 KLELLLSNLKKKIQKNKSLESQISE-LKKQNNQLKDNIEKKQQE--INEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQK 274
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  162 QLLENSLSTYKLEKQLLQQTNEILKIHEKNSLLEHKILemegkhKEELDTLKEEKENLQGLVTRQTYIIQEL-------E 234
Cdd:TIGR04523 275 ELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKEL------KSELKNQEKKLEEIQNQISQNNKIISQLneqisqlK 348
                         170
                  ....*....|....*.
gi 966959689  235 KQLNRATTNNSVLQKQ 250
Cdd:TIGR04523 349 KELTNSESENSEKQRE 364
SHE3 pfam17078
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ...
176-250 1.72e-03

SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.


Pssm-ID: 293683 [Multi-domain]  Cd Length: 228  Bit Score: 40.11  E-value: 1.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  176 QLLQQTNEILKIHEKNSllehkilEMEGKHKEELDTLKEEKENLQGLVTRQTYIIQELEKQLN-------RATTNNSVLQ 248
Cdd:pfam17078  21 QLTVQSQNLLSKLEIAQ-------QKESKFLENLASLKHENDNLSSMLNRKERRLKDLEDQLSelknsyeELTESNKQLK 93

                  ..
gi 966959689  249 KQ 250
Cdd:pfam17078  94 KR 95
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
77-261 2.15e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 40.81  E-value: 2.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689    77 DFSSQKLQHLEHVMENYTQWLQKLENYIVE------NMKSEMAQIQQNavqnhtatmleigtsLLSQTAEQTRKltDVET 150
Cdd:TIGR02168  242 EELQEELKEAEEELEELTAELQELEEKLEElrlevsELEEEIEELQKE---------------LYALANEISRL--EQQK 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   151 QVLNQtsrlEIQLLENSLSTYKLEKQLLQQTNEILKihEKNSLLEHKILEMEGKHKEELDTLKEEKENLQGLVTRqtyiI 230
Cdd:TIGR02168  305 QILRE----RLANLERQLEELEAQLEELESKLDELA--EELAELEEKLEELKEELESLEAELEELEAELEELESR----L 374
                          170       180       190
                   ....*....|....*....|....*....|.
gi 966959689   231 QELEKQLNRATTNNSVLQKQQLELMDTVHNL 261
Cdd:TIGR02168  375 EELEEQLETLRSKVAQLELQIASLNNEIERL 405
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
79-261 2.17e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 40.65  E-value: 2.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   79 SSQKLQhLEHVMENYTQWLQKLENYIVEN--MKSEMAQIQQN-AVQNHTATMLeiGTSLLSQTAEQTRKLTDVEtQVLNQ 155
Cdd:pfam07888 205 DTQVLQ-LQDTITTLTQKLTTAHRKEAENeaLLEELRSLQERlNASERKVEGL--GEELSSMAAQRDRTQAELH-QARLQ 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  156 TSRLEIQLLENSL------STYKLEKQLLQQTNEilKIHEKNSLLEHKILEMEGKHKEE------LDT-LKEEKENLQGL 222
Cdd:pfam07888 281 AAQLTLQLADASLalregrARWAQERETLQQSAE--ADKDRIEKLSAELQRLEERLQEErmerekLEVeLGREKDCNRVQ 358
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 966959689  223 VTRQTYIIQELEKQLNRATTNNSVLQKQQLELMDTVHNL 261
Cdd:pfam07888 359 LSESRRELQELKASLRVAQKEKEQLQAEKQELLEYIRQL 397
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
134-254 2.83e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 39.52  E-value: 2.83e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 134 LLSQTAEQTRKLTDVETQVlnQTSRLEIQLLENSLSTY-----KLEKQLLQQTN---------EILKIHEKNSLLEHKIL 199
Cdd:COG1579   36 LEDELAALEARLEAAKTEL--EDLEKEIKRLELEIEEVearikKYEEQLGNVRNnkeyealqkEIESLKRRISDLEDEIL 113
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 966959689 200 EMEgkhkEELDTLKEEKENLQGLVTRQTYIIQELEKQLNRATTNnsvLQKQQLEL 254
Cdd:COG1579  114 ELM----ERIEELEEELAELEAELAELEAELEEKKAELDEELAE---LEAELEEL 161
DUF724 pfam05266
Protein of unknown function (DUF724); This family contains several uncharacterized proteins ...
172-217 3.17e-03

Protein of unknown function (DUF724); This family contains several uncharacterized proteins found in Arabidopsis thaliana and other plants. This region is often found associated with Agenet domains and may contain coiled-coil.


Pssm-ID: 428400 [Multi-domain]  Cd Length: 188  Bit Score: 38.79  E-value: 3.17e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 966959689  172 KLEKQLLQQTNEILKIHEKNSLLEHKILEMEgkhkEELDTLKEEKE 217
Cdd:pfam05266 113 KLEKKIAEEESEKRKLEEEIDELEKKILELE----RQLALAKEKKE 154
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
77-255 3.44e-03

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 40.11  E-value: 3.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   77 DFSSQKLQHLEHVMENytqwLQKLENYIVENMKsEMAQIQQNAVQNhtatmLEIGTSLLSQTAEQTRKLTDVETQVLNQT 156
Cdd:pfam05557  44 DRESDRNQELQKRIRL----LEKREAEAEEALR-EQAELNRLKKKY-----LEALNKKLNEKESQLADAREVISCLKNEL 113
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  157 SRL--EIQLLENSLSTYKLEKQLLQQTNEILkiHEKNSLLEHKILEMEGKHKEELDtlKEEKenlqglvtrqtyiIQELE 234
Cdd:pfam05557 114 SELrrQIQRAELELQSTNSELEELQERLDLL--KAKASEAEQLRQNLEKQQSSLAE--AEQR-------------IKELE 176
                         170       180
                  ....*....|....*....|.
gi 966959689  235 KQLNRATTNNSVLQKQQLELM 255
Cdd:pfam05557 177 FEIQSQEQDSEIVKNSKSELA 197
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
45-254 6.14e-03

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 39.17  E-value: 6.14e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689  45 FILPEHDGNCRESTTDQYnTNALQRDAPHVEPDFSSQ--KLQHLEHVMENYTQWLQKLENYIVENMKSEMAQIQQNAVQN 122
Cdd:COG5185  238 FQDPESELEDLAQTSDKL-EKLVEQNTDLRLEKLGENaeSSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEE 316
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 123 HTATMlEIGTSLLSQTAEQTRKLTDVETQVLNQTSRLE--IQLLENSLSTYKLEKQLLQQTNEILKIhekNSLLEHKILE 200
Cdd:COG5185  317 QLAAA-EAEQELEESKRETETGIQNLTAEIEQGQESLTenLEAIKEEIENIVGEVELSKSSEELDSF---KDTIESTKES 392
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....
gi 966959689 201 MEGKHKEELDTLKEEKENLQGLVTRQTYIIQELEKQLNRATTNNSVLQKQQLEL 254
Cdd:COG5185  393 LDEIPQNQRGYAQEILATLEDTLKAADRQIEELQRQIEQATSSNEEVSKLLNEL 446
PRK10935 PRK10935
nitrate/nitrite two-component system sensor histidine kinase NarQ;
122-255 7.08e-03

nitrate/nitrite two-component system sensor histidine kinase NarQ;


Pssm-ID: 236800 [Multi-domain]  Cd Length: 565  Bit Score: 39.07  E-value: 7.08e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 122 NHTATmlEIGT---SLLSQTAEQTRKLTDV--ETQVLNQTSRLeiqllensLSTYKLEKQLLQQTNEILKIHEKNSLLEH 196
Cdd:PRK10935 216 NQMSS--ELHKlyrSLEASVEEKTRKLTQAnrSLEVLYQCSQA--------LNASQIDVHCFRHILQIVRDHEGLDYLEL 285
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 966959689 197 KILE------MEGKHKEELD----TLKEEKENL------QGLVTRQTYIIQELEKQLNRATT-NNSVLQKQQLELM 255
Cdd:PRK10935 286 EVGEnehwriSEGQPNPELPwqilPLTMEDTVLgylhwqASLPCPDEPLMNNVAQMLGRGLYfNQAQKQQQQLLLM 361
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
160-261 8.90e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 37.98  E-value: 8.90e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 160 EIQLLENSLStyKLEKQLLQQTNEILKIHEKNSLLEHKILEMEG---KHKEELDTLKEEKEnlqglvtrqtyiIQELEKQ 236
Cdd:COG1579   32 ELAELEDELA--ALEARLEAAKTELEDLEKEIKRLELEIEEVEArikKYEEQLGNVRNNKE------------YEALQKE 97
                         90       100
                 ....*....|....*....|....*
gi 966959689 237 LNRATTNNSVLQKQQLELMDTVHNL 261
Cdd:COG1579   98 IESLKRRISDLEDEILELMERIEEL 122
PRK12704 PRK12704
phosphodiesterase; Provisional
139-235 8.92e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 38.61  E-value: 8.92e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689 139 AEQTRK--LTDVETQVLNQTSRLEIQLLE--NSLStyKLEKQLLQQTNEILKIHEKNSLLEHKILEMEGKHKEELDTLKE 214
Cdd:PRK12704  51 AEAIKKeaLLEAKEEIHKLRNEFEKELRErrNELQ--KLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEK 128
                         90       100
                 ....*....|....*....|.
gi 966959689 215 EKENLQGLVTRQtyiIQELEK 235
Cdd:PRK12704 129 KEEELEELIEEQ---LQELER 146
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
83-258 9.53e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 38.90  E-value: 9.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689    83 LQHLEHVMENYTQWLQKLE---NYIVENMKSEMAQI------QQNAVQnhtATMLEIGT---SLLSQTAEQTRKLTDVET 150
Cdd:TIGR02169  246 LASLEEELEKLTEEISELEkrlEEIEQLLEELNKKIkdlgeeEQLRVK---EKIGELEAeiaSLERSIAEKERELEDAEE 322
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 966959689   151 QVLNQTSRL-----EIQLLENSLSTYKLEKQLLqqTNEILKIHEKNSLLEHKILEMEGKH----------KEELDTLKEE 215
Cdd:TIGR02169  323 RLAKLEAEIdkllaEIEELEREIEEERKRRDKL--TEEYAELKEELEDLRAELEEVDKEFaetrdelkdyREKLEKLKRE 400
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 966959689   216 KENLQGLVTRQTYIIQELEKQLNRATTNNSVLQKQQLELMDTV 258
Cdd:TIGR02169  401 INELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEK 443
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH