NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|755512387|ref|XP_011247731|]
View 

protein lin-54 homolog isoform X4 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
375-410 3.04e-18

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


:

Pssm-ID: 461001  Cd Length: 38  Bit Score: 78.03  E-value: 3.04e-18
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 755512387  375 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 410
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVFCSSNCKCEGCKN 36
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
301-324 9.91e-09

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


:

Pssm-ID: 461001  Cd Length: 38  Bit Score: 51.06  E-value: 9.91e-09
                          10        20
                  ....*....|....*....|....
gi 755512387  301 RKPCNCTKSLCLKLYCDCFANGEF 324
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVF 24
SP1-4_N super family cl41773
N-terminal domain of transcription factor Specificity Proteins (SP) 1-4; Specificity Proteins ...
40-245 4.79e-04

N-terminal domain of transcription factor Specificity Proteins (SP) 1-4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. There are many SPs in vertebrates (9 SPs in humans and mice, 7 SPs in chicken, and 11 SPs in teleost fish), but arthropods only have 3 SPs. SPs belong to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP1-4.


The actual alignment was detected with superfamily member cd22540:

Pssm-ID: 425404 [Multi-domain]  Cd Length: 511  Bit Score: 42.61  E-value: 4.79e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387  40 TQASPPVVTGRVLSQSTPGTPSKTITISESgvigsTLNSTTQTPNKIAISPLKSPNKTVKSAVQTITVGGMSTSQFKTII 119
Cdd:cd22540  183 TNSASLQVPGNVIKLQSGGNVALTLPVNNL-----VGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVTVAEQVETVL 257
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387 120 PLATAPNVQQ-------IQVPGSKFHYVRLVTATTASSSAQPVSQSPsvntqplQQAKPVVVNTTPVRMSVPFVQAQAVK 192
Cdd:cd22540  258 IETTADNIIQagnnlliVQSPGTGQPAVLQQVQVLQPKQEQQVVQIP-------QQALRVVQAASATLPTVPQKPLQNIQ 330
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|...
gi 755512387 193 QVVPKPINSTSQIVTTSQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPGTG 245
Cdd:cd22540  331 IQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTANNGTG 383
 
Name Accession Description Interval E-value
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
375-410 3.04e-18

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


Pssm-ID: 461001  Cd Length: 38  Bit Score: 78.03  E-value: 3.04e-18
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 755512387  375 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 410
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVFCSSNCKCEGCKN 36
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
301-324 9.91e-09

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


Pssm-ID: 461001  Cd Length: 38  Bit Score: 51.06  E-value: 9.91e-09
                          10        20
                  ....*....|....*....|....
gi 755512387  301 RKPCNCTKSLCLKLYCDCFANGEF 324
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVF 24
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
40-245 4.79e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 42.61  E-value: 4.79e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387  40 TQASPPVVTGRVLSQSTPGTPSKTITISESgvigsTLNSTTQTPNKIAISPLKSPNKTVKSAVQTITVGGMSTSQFKTII 119
Cdd:cd22540  183 TNSASLQVPGNVIKLQSGGNVALTLPVNNL-----VGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVTVAEQVETVL 257
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387 120 PLATAPNVQQ-------IQVPGSKFHYVRLVTATTASSSAQPVSQSPsvntqplQQAKPVVVNTTPVRMSVPFVQAQAVK 192
Cdd:cd22540  258 IETTADNIIQagnnlliVQSPGTGQPAVLQQVQVLQPKQEQQVVQIP-------QQALRVVQAASATLPTVPQKPLQNIQ 330
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|...
gi 755512387 193 QVVPKPINSTSQIVTTSQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPGTG 245
Cdd:cd22540  331 IQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTANNGTG 383
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-278 3.22e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.69  E-value: 3.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387    5 PAEIAKKPRTPTSGPVItklifakPINSKAVTGQTTQASPPVVTGrvlSQSTPGTPSKTITISESGVIGSTLNSTTQTPN 84
Cdd:PHA03247 2734 ALPAAPAPPAVPAGPAT-------PGGPARPARPPTTAGPPAPAP---PAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387   85 KIAI-SPLKSPNKTVKSAVQTITVGGMSTSQFKTIIPLATAPNVQQIQ-----VPGSKFHY---VRLVTATTASSSAQPV 155
Cdd:PHA03247 2804 PADPpAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvAPGGDVRRrppSRSPAAKPAAPARPPV 2883
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387  156 SQ--SPSVNTQPLQQAKPvvvNTTPVRMSVPFVQAQAVKQVVPKPinstsQIVTTSQPQQRLIMPATPLPQIQPNLTNLP 233
Cdd:PHA03247 2884 RRlaRPAVSRSTESFALP---PDQPERPPQPQAPPPPQPQPQPPP-----PPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*
gi 755512387  234 PGTVLAPAPGTGNVGYAVLPAQYVTQLQQSSYVSIASNSNFTGTS 278
Cdd:PHA03247 2956 SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
MSL2_CXC cd13122
DNA-binding cysteine-rich domain of male-specific lethal 2 and related proteins; The CXC ...
375-410 3.34e-03

DNA-binding cysteine-rich domain of male-specific lethal 2 and related proteins; The CXC domain of Drosophila melanogaster MSL2 forms a Zn(3)Cys(9) cluster and is involved in recruiting members of the dosage compensation complex (DCC) to sites on the X chromosome.


Pssm-ID: 240555  Cd Length: 50  Bit Score: 35.83  E-value: 3.34e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|...
gi 755512387 375 SKGCNCKRSG-------CLKNYCECYEAKIMCSSiCKCIGCKN 410
Cdd:cd13122    4 KKGCRCGTATqspgvltCRGQRCPCYSNGKSCLD-CKCRGCKN 45
 
Name Accession Description Interval E-value
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
375-410 3.04e-18

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


Pssm-ID: 461001  Cd Length: 38  Bit Score: 78.03  E-value: 3.04e-18
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 755512387  375 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 410
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVFCSSNCKCEGCKN 36
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
301-324 9.91e-09

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


Pssm-ID: 461001  Cd Length: 38  Bit Score: 51.06  E-value: 9.91e-09
                          10        20
                  ....*....|....*....|....
gi 755512387  301 RKPCNCTKSLCLKLYCDCFANGEF 324
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVF 24
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
40-245 4.79e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 42.61  E-value: 4.79e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387  40 TQASPPVVTGRVLSQSTPGTPSKTITISESgvigsTLNSTTQTPNKIAISPLKSPNKTVKSAVQTITVGGMSTSQFKTII 119
Cdd:cd22540  183 TNSASLQVPGNVIKLQSGGNVALTLPVNNL-----VGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVTVAEQVETVL 257
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387 120 PLATAPNVQQ-------IQVPGSKFHYVRLVTATTASSSAQPVSQSPsvntqplQQAKPVVVNTTPVRMSVPFVQAQAVK 192
Cdd:cd22540  258 IETTADNIIQagnnlliVQSPGTGQPAVLQQVQVLQPKQEQQVVQIP-------QQALRVVQAASATLPTVPQKPLQNIQ 330
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|...
gi 755512387 193 QVVPKPINSTSQIVTTSQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPGTG 245
Cdd:cd22540  331 IQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTANNGTG 383
MSL2-CXC pfam16682
CXC domain of E3 ubiquitin-protein ligase MSL2; MSL2-CXC is an autonomously folded domain ...
371-410 2.19e-03

CXC domain of E3 ubiquitin-protein ligase MSL2; MSL2-CXC is an autonomously folded domain containing that binds three zinc ions. It lies on the E3 ubiquitin-protein ligase MSL2 in eukaryotes. The CXC domain critically contributes to the DNA-binding activity of MSL2. It carries 9 invariant cysteines within about a 50 residue region.


Pssm-ID: 435512  Cd Length: 55  Bit Score: 36.26  E-value: 2.19e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 755512387  371 DRRHSKGCNCKRSG-------CLKNYCECYEAKIMCSSiCKCIGCKN 410
Cdd:pfam16682   1 KPPEKKGCRCGTSTptppkltCRNQRCPCYSNGKSCTD-CKCRGCKN 46
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-278 3.22e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.69  E-value: 3.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387    5 PAEIAKKPRTPTSGPVItklifakPINSKAVTGQTTQASPPVVTGrvlSQSTPGTPSKTITISESGVIGSTLNSTTQTPN 84
Cdd:PHA03247 2734 ALPAAPAPPAVPAGPAT-------PGGPARPARPPTTAGPPAPAP---PAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387   85 KIAI-SPLKSPNKTVKSAVQTITVGGMSTSQFKTIIPLATAPNVQQIQ-----VPGSKFHY---VRLVTATTASSSAQPV 155
Cdd:PHA03247 2804 PADPpAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvAPGGDVRRrppSRSPAAKPAAPARPPV 2883
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755512387  156 SQ--SPSVNTQPLQQAKPvvvNTTPVRMSVPFVQAQAVKQVVPKPinstsQIVTTSQPQQRLIMPATPLPQIQPNLTNLP 233
Cdd:PHA03247 2884 RRlaRPAVSRSTESFALP---PDQPERPPQPQAPPPPQPQPQPPP-----PPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*
gi 755512387  234 PGTVLAPAPGTGNVGYAVLPAQYVTQLQQSSYVSIASNSNFTGTS 278
Cdd:PHA03247 2956 SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
MSL2_CXC cd13122
DNA-binding cysteine-rich domain of male-specific lethal 2 and related proteins; The CXC ...
375-410 3.34e-03

DNA-binding cysteine-rich domain of male-specific lethal 2 and related proteins; The CXC domain of Drosophila melanogaster MSL2 forms a Zn(3)Cys(9) cluster and is involved in recruiting members of the dosage compensation complex (DCC) to sites on the X chromosome.


Pssm-ID: 240555  Cd Length: 50  Bit Score: 35.83  E-value: 3.34e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|...
gi 755512387 375 SKGCNCKRSG-------CLKNYCECYEAKIMCSSiCKCIGCKN 410
Cdd:cd13122    4 KKGCRCGTATqspgvltCRGQRCPCYSNGKSCLD-CKCRGCKN 45
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH