NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|169234729|ref|NP_001108482|]
View 

protein lin-54 homolog isoform 1 [Mus musculus]

Protein Classification

TCR domain-containing protein( domain architecture ID 10508328)

TCR domain-containing protein similar to Homo sapiens tesmin, a testis-specific metallothionein-like protein, and to Arabidopsis thaliana TSO1, a novel protein that modulates cytokinesis and cell expansion in Arabidopsis

Gene Ontology:  GO:0003700|GO:0003677|GO:0046872

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
596-631 2.34e-20

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


:

Pssm-ID: 461001  Cd Length: 38  Bit Score: 84.58  E-value: 2.34e-20
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 169234729  596 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 631
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVFCSSNCKCEGCKN 36
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
522-545 4.99e-10

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


:

Pssm-ID: 461001  Cd Length: 38  Bit Score: 54.92  E-value: 4.99e-10
                          10        20
                  ....*....|....*....|....
gi 169234729  522 RKPCNCTKSLCLKLYCDCFANGEF 545
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVF 24
PHA03247 super family cl33720
large tegument protein UL36; Provisional
165-499 1.52e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 1.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  165 QKVTAQAQPGDAKLPPQQ---------IKVVTIGGRPEVKPVIGVSALTPGSQLINTTTQPSVLQtQQLKTVQIAKKPRT 235
Cdd:PHA03247 2665 RRARRLGRAAQASSPPQRprrraarptVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR-QASPALPAAPAPPA 2743
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  236 PTSGPVItklifakPINSKAVTGQTTQASPPVVTGrvlSQSTPGTPSKTITISESGVIGSTLNSTTQTPNKIAI-SPLKS 314
Cdd:PHA03247 2744 VPAGPAT-------PGGPARPARPPTTAGPPAPAP---PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPpAAVLA 2813
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  315 PNKTVKSAVQTITVGGMSTSQFKTIIPLATAPNVQQIQ-----VPGSKFHY---VRLVTATTASSSAQPVSQ--SPSVNT 384
Cdd:PHA03247 2814 PAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvAPGGDVRRrppSRSPAAKPAAPARPPVRRlaRPAVSR 2893
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  385 QPLQQAKPvvvNTTPVRMSVPFVQAQAVKQVVPKPinstsQIVTTSQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPG 464
Cdd:PHA03247 2894 STESFALP---PDQPERPPQPQAPPPPQPQPQPPP-----PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 169234729  465 TGNVGYAVLPAQYVTQLQQSSYVSIASNSNFTGTS 499
Cdd:PHA03247 2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
 
Name Accession Description Interval E-value
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
596-631 2.34e-20

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


Pssm-ID: 461001  Cd Length: 38  Bit Score: 84.58  E-value: 2.34e-20
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 169234729  596 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 631
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVFCSSNCKCEGCKN 36
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
522-545 4.99e-10

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


Pssm-ID: 461001  Cd Length: 38  Bit Score: 54.92  E-value: 4.99e-10
                          10        20
                  ....*....|....*....|....
gi 169234729  522 RKPCNCTKSLCLKLYCDCFANGEF 545
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVF 24
PHA03247 PHA03247
large tegument protein UL36; Provisional
165-499 1.52e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 1.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  165 QKVTAQAQPGDAKLPPQQ---------IKVVTIGGRPEVKPVIGVSALTPGSQLINTTTQPSVLQtQQLKTVQIAKKPRT 235
Cdd:PHA03247 2665 RRARRLGRAAQASSPPQRprrraarptVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR-QASPALPAAPAPPA 2743
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  236 PTSGPVItklifakPINSKAVTGQTTQASPPVVTGrvlSQSTPGTPSKTITISESGVIGSTLNSTTQTPNKIAI-SPLKS 314
Cdd:PHA03247 2744 VPAGPAT-------PGGPARPARPPTTAGPPAPAP---PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPpAAVLA 2813
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  315 PNKTVKSAVQTITVGGMSTSQFKTIIPLATAPNVQQIQ-----VPGSKFHY---VRLVTATTASSSAQPVSQ--SPSVNT 384
Cdd:PHA03247 2814 PAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvAPGGDVRRrppSRSPAAKPAAPARPPVRRlaRPAVSR 2893
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  385 QPLQQAKPvvvNTTPVRMSVPFVQAQAVKQVVPKPinstsQIVTTSQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPG 464
Cdd:PHA03247 2894 STESFALP---PDQPERPPQPQAPPPPQPQPQPPP-----PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 169234729  465 TGNVGYAVLPAQYVTQLQQSSYVSIASNSNFTGTS 499
Cdd:PHA03247 2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
MSL2_CXC cd13122
DNA-binding cysteine-rich domain of male-specific lethal 2 and related proteins; The CXC ...
596-631 2.79e-04

DNA-binding cysteine-rich domain of male-specific lethal 2 and related proteins; The CXC domain of Drosophila melanogaster MSL2 forms a Zn(3)Cys(9) cluster and is involved in recruiting members of the dosage compensation complex (DCC) to sites on the X chromosome.


Pssm-ID: 240555  Cd Length: 50  Bit Score: 39.30  E-value: 2.79e-04
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|...
gi 169234729 596 SKGCNCKRSG-------CLKNYCECYEAKIMCSSiCKCIGCKN 631
Cdd:cd13122    4 KKGCRCGTATqspgvltCRGQRCPCYSNGKSCLD-CKCRGCKN 45
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
208-466 4.32e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 43.38  E-value: 4.32e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729 208 QLINTTTQPSVLQTQQLKTVQIAKKPrtptsgpvitklifaKPINSKAVTGQTTQASPPVVTGRVLSQSTPGTPSKTITI 287
Cdd:cd22540  145 QIIPGTNQAIITPVQVLQQPQQAHKP---------------VPIKPAPLQTSNTNSASLQVPGNVIKLQSGGNVALTLPV 209
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729 288 SESgvigsTLNSTTQTPNKIAISPLKSPNKTVKSAVQTITVGGMSTSQFKTIIPLATAPNVQQ-------IQVPGSKFHY 360
Cdd:cd22540  210 NNL-----VGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVTVAEQVETVLIETTADNIIQagnnlliVQSPGTGQPA 284
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729 361 VRLVTATTASSSAQPVSQSPsvntqplQQAKPVVVNTTPVRMSVPFVQAQAVKQVVPKPINSTSQIVTTSQPQQRLIMPA 440
Cdd:cd22540  285 VLQQVQVLQPKQEQQVVQIP-------QQALRVVQAASATLPTVPQKPLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQE 357
                        250       260
                 ....*....|....*....|....*.
gi 169234729 441 TPLPQIQPNLTNLPPGTVLAPAPGTG 466
Cdd:cd22540  358 APAATATPSSSTSTVQQQVTANNGTG 383
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
198-505 5.55e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 40.28  E-value: 5.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  198 IGVSAL--TPGSQLINTTTQPSVLQTQQlktVQIAKKPRTPTSGPVITKLIFAKPINSKAVTGQT---TQASPPVVTGRV 272
Cdd:pfam05109 391 ITVSGLgtAPKTLIITRTATNATTTTHK---VIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSThvpTNLTAPASTGPT 467
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  273 LSQSTPGTPSKTITISesgviGSTLNSTTQTPNKIAISPlKSPNKTV-KSAVQTITVGGMS-TSQFKTIIPLATAPNVQQ 350
Cdd:pfam05109 468 VSTADVTSPTPAGTTS-----GASPVTPSPSPRDNGTES-KAPDMTSpTSAVTTPTPNATSpTPAVTTPTPNATSPTLGK 541
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  351 iqvpgskfhyvrlVTATTASSSAQPVSQSPSvntqplqqakPVVVNTTPvRMSVPFVQAQAVKQVVPKPI-NSTSQIVTT 429
Cdd:pfam05109 542 -------------TSPTSAVTTPTPNATSPT----------PAVTTPTP-NATIPTLGKTSPTSAVTTPTpNATSPTVGE 597
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 169234729  430 SQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVGYAVLPAQYVTQLQQSSYVSIASNSNFTGTSGIQTQA 505
Cdd:pfam05109 598 TSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSA 673
 
Name Accession Description Interval E-value
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
596-631 2.34e-20

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


Pssm-ID: 461001  Cd Length: 38  Bit Score: 84.58  E-value: 2.34e-20
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 169234729  596 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKN 631
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVFCSSNCKCEGCKN 36
TCR pfam03638
Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two ...
522-545 4.99e-10

Tesmin/TSO1-like CXC domain, cysteine-rich domain; This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in.


Pssm-ID: 461001  Cd Length: 38  Bit Score: 54.92  E-value: 4.99e-10
                          10        20
                  ....*....|....*....|....
gi 169234729  522 RKPCNCTKSLCLKLYCDCFANGEF 545
Cdd:pfam03638   1 KKGCNCKKSKCLKLYCECFAAGVF 24
PHA03247 PHA03247
large tegument protein UL36; Provisional
165-499 1.52e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 1.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  165 QKVTAQAQPGDAKLPPQQ---------IKVVTIGGRPEVKPVIGVSALTPGSQLINTTTQPSVLQtQQLKTVQIAKKPRT 235
Cdd:PHA03247 2665 RRARRLGRAAQASSPPQRprrraarptVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR-QASPALPAAPAPPA 2743
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  236 PTSGPVItklifakPINSKAVTGQTTQASPPVVTGrvlSQSTPGTPSKTITISESGVIGSTLNSTTQTPNKIAI-SPLKS 314
Cdd:PHA03247 2744 VPAGPAT-------PGGPARPARPPTTAGPPAPAP---PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPpAAVLA 2813
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  315 PNKTVKSAVQTITVGGMSTSQFKTIIPLATAPNVQQIQ-----VPGSKFHY---VRLVTATTASSSAQPVSQ--SPSVNT 384
Cdd:PHA03247 2814 PAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvAPGGDVRRrppSRSPAAKPAAPARPPVRRlaRPAVSR 2893
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  385 QPLQQAKPvvvNTTPVRMSVPFVQAQAVKQVVPKPinstsQIVTTSQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPG 464
Cdd:PHA03247 2894 STESFALP---PDQPERPPQPQAPPPPQPQPQPPP-----PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 169234729  465 TGNVGYAVLPAQYVTQLQQSSYVSIASNSNFTGTS 499
Cdd:PHA03247 2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
MSL2-CXC pfam16682
CXC domain of E3 ubiquitin-protein ligase MSL2; MSL2-CXC is an autonomously folded domain ...
592-631 2.42e-04

CXC domain of E3 ubiquitin-protein ligase MSL2; MSL2-CXC is an autonomously folded domain containing that binds three zinc ions. It lies on the E3 ubiquitin-protein ligase MSL2 in eukaryotes. The CXC domain critically contributes to the DNA-binding activity of MSL2. It carries 9 invariant cysteines within about a 50 residue region.


Pssm-ID: 435512  Cd Length: 55  Bit Score: 39.34  E-value: 2.42e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 169234729  592 DRRHSKGCNCKRSG-------CLKNYCECYEAKIMCSSiCKCIGCKN 631
Cdd:pfam16682   1 KPPEKKGCRCGTSTptppkltCRNQRCPCYSNGKSCTD-CKCRGCKN 46
MSL2_CXC cd13122
DNA-binding cysteine-rich domain of male-specific lethal 2 and related proteins; The CXC ...
596-631 2.79e-04

DNA-binding cysteine-rich domain of male-specific lethal 2 and related proteins; The CXC domain of Drosophila melanogaster MSL2 forms a Zn(3)Cys(9) cluster and is involved in recruiting members of the dosage compensation complex (DCC) to sites on the X chromosome.


Pssm-ID: 240555  Cd Length: 50  Bit Score: 39.30  E-value: 2.79e-04
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|...
gi 169234729 596 SKGCNCKRSG-------CLKNYCECYEAKIMCSSiCKCIGCKN 631
Cdd:cd13122    4 KKGCRCGTATqspgvltCRGQRCPCYSNGKSCLD-CKCRGCKN 45
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
208-466 4.32e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 43.38  E-value: 4.32e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729 208 QLINTTTQPSVLQTQQLKTVQIAKKPrtptsgpvitklifaKPINSKAVTGQTTQASPPVVTGRVLSQSTPGTPSKTITI 287
Cdd:cd22540  145 QIIPGTNQAIITPVQVLQQPQQAHKP---------------VPIKPAPLQTSNTNSASLQVPGNVIKLQSGGNVALTLPV 209
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729 288 SESgvigsTLNSTTQTPNKIAISPLKSPNKTVKSAVQTITVGGMSTSQFKTIIPLATAPNVQQ-------IQVPGSKFHY 360
Cdd:cd22540  210 NNL-----VGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVTVAEQVETVLIETTADNIIQagnnlliVQSPGTGQPA 284
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729 361 VRLVTATTASSSAQPVSQSPsvntqplQQAKPVVVNTTPVRMSVPFVQAQAVKQVVPKPINSTSQIVTTSQPQQRLIMPA 440
Cdd:cd22540  285 VLQQVQVLQPKQEQQVVQIP-------QQALRVVQAASATLPTVPQKPLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQE 357
                        250       260
                 ....*....|....*....|....*.
gi 169234729 441 TPLPQIQPNLTNLPPGTVLAPAPGTG 466
Cdd:cd22540  358 APAATATPSSSTSTVQQQVTANNGTG 383
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
198-505 5.55e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 40.28  E-value: 5.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  198 IGVSAL--TPGSQLINTTTQPSVLQTQQlktVQIAKKPRTPTSGPVITKLIFAKPINSKAVTGQT---TQASPPVVTGRV 272
Cdd:pfam05109 391 ITVSGLgtAPKTLIITRTATNATTTTHK---VIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSThvpTNLTAPASTGPT 467
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  273 LSQSTPGTPSKTITISesgviGSTLNSTTQTPNKIAISPlKSPNKTV-KSAVQTITVGGMS-TSQFKTIIPLATAPNVQQ 350
Cdd:pfam05109 468 VSTADVTSPTPAGTTS-----GASPVTPSPSPRDNGTES-KAPDMTSpTSAVTTPTPNATSpTPAVTTPTPNATSPTLGK 541
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 169234729  351 iqvpgskfhyvrlVTATTASSSAQPVSQSPSvntqplqqakPVVVNTTPvRMSVPFVQAQAVKQVVPKPI-NSTSQIVTT 429
Cdd:pfam05109 542 -------------TSPTSAVTTPTPNATSPT----------PAVTTPTP-NATIPTLGKTSPTSAVTTPTpNATSPTVGE 597
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 169234729  430 SQPQQRLIMPATPLPQIQPNLTNLPPGTVLAPAPGTGNVGYAVLPAQYVTQLQQSSYVSIASNSNFTGTSGIQTQA 505
Cdd:pfam05109 598 TSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSA 673
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH